The Cambridge Encyclopedia of Language Science

The Cambridge Encyclopedia of the Language Sciences
Have you lost track of developments in generative linguistics, i nding yourself unsure about
the distinctive features of Minimalism? Would you like to know more about recent advances in
the genetics of language, or about right hemisphere linguistic operation? Has your interest in
narrative drawn you to question the relation between stories and grammars? he Cambridge
Encyclopedia of the Language Sciences addresses these issues, along with hundreds of others.
It includes basic entries for those unfamiliar with a given topic and more speciic entries for
those seeking more specialized knowledge. It incorporates both well-established i ndings and
cutting-edge research as well as classical approaches and new theoretical innovations. he vol-
ume is aimed at readers who have an interest in some aspect of language science but wish to
learn more about the broad range of ideas, i ndings, practices, and prospects that constitute
this rapidly expanding ield, a ield arguably at the center of current research on the human
mind and human society.
Patrick Colm Hogan is a professor in the Department of English and the Program in Cognitive
Science at the University of Connecticut. He is the author of ten books, including Cognitive
Science, Literature, and the Arts: A Guide for Humanists and he Mind and Its Stories: Narrative
Universals and Human Emotion (Cambridge University Press, 2003).
Advance Praise for
For both range and depth of exposition and commentary on the diverse disciplinary angles
that exist on the nature of language, there is no single volume to match this i ne work of
reference.
Akeel Bilgrami, Columbia University
he Cambridge Encyclopedia of the Language Sciences is a very welcome addition to the ield
of language sciences. Its comprehensiveness is praiseworthy, as is the quality of its entries and
discussions.
Seymour Chatman, University of California, Berkeley
h is ambitious and comprehensive work, and the very high quality of the editors and con-
tributors, ensure that it will be a valuable contribution to the understanding of language and
its uses, for both professionals and a more general audience.
Noam Chomsky, Massachusetts Institute of Technology
THE CAMBRIDGE ENCYCLOPEDIA OF
THE LANGUAGE SCIENCES

Edited by
PATRICK COLM HOGAN
University of Connecticut
C AM BRIDG E U N I VE RSI T Y PRE SS
Cambridge, New York, Melbourne, Madrid, Cape Town, Singapore,
So Paulo, Delhi, Dubai, Tokyo, Mexico City
Cambridge University Press

32 Avenue of the Americas, New York, NY 10013-2473, USA
www.cambridge.org
Information on this title: www.cambridge.org/9780521866897
Cambridge University Press 2011
his publication is in copyright. Subject to statutory exception

and to the provisions of relevant collective licensing agreements,
no reproduction of any part may take place without the written
permission of Cambridge University Press.
First published 2011
Printed in the United States of America
A catalog record for this publication is available from the British Library.
Library of Congress Cataloging in Publication data

he Cambridge encyclopedia of the language sciences / edited by Patrick Colm Hogan.
p. cm.
Includes bibliographical references and index.
ISBN 978-0-521-86689-7 (hardback)
1. Linguistics Encyclopedias. I. Hogan, Patrick Colm. II. Title.
P29.C28 2009
410.3dc22 2008041978
ISBN 978-0-521-86689-7 Hardback
Cambridge University Press has no responsibility for the persistence or

accuracy of URLs for external or third-party Internet Web sites referred to in
this publication and does not guarantee that any content on such Web sites is,
or will remain, accurate or appropriate.
GENERAL EDITOR
Patrick Colm Hogan
University of Connecticut, Storrs
ADVISORY EDITORIAL BOARD
Florian Coulmas Barbara Lust

German Institute of Japanese Studies Cornell University
and Duisberg-Essen University
William Croft Lee Osterhout

University of New Mexico University of Washington
Lyle Jenkins James Pustejovsky

Biolinguistics Institute Brandeis University
CONSULTING EDITORIAL BOARD
Mark Baker Howard Lasnik

Rutgers University University of Maryland
Deborah Cameron Loraine Obler

University of Oxford City University of New York
Nigel Fabb William OGrady

University of Strathclyde University of Hawaii
Carol Ann Fowler Susan Pintzuk

Haskins Laboratories University of York
and University of Connecticut
Ronald Geluykens Eleanor Rosch

University of Oldenburg University of California, Berkeley
Margaret Harris Jay Rueckl

Oxford Brookes University University of Connecticut
Zoltn Kvecses Mark Turner

Etvs Lornd University Case Western Reserve University
To the memory of B. N. Pandit (19162007)
philosopher, Sanskritist, father-in-law
Purua-artha-nyn gun pratiprasava kaivalya

sva-rpa-pratih v citi-aktir-iti
Patajali
CONTENTS
List of Entries page xi

A Note on Cross-References and the Alphabetization of the Entries xv
Preface: On the Very Idea of Language Sciences xvii
Acknowledgments xxiii
1 Language Structure in Its Human Context: New Directions

for the Language Sciences in the Twenty-First Century 1
William Croft
2 The Psychology of Linguistic Form 12

Lee Osterhout, Richard A. Wright, and Mark D. Allen
3 The Structure of Meaning 23

James Pustejovsky
4 Social Practices of Speech and Writing 35

Florian Coulmas
5 Explaining Language: Neuroscience, Genetics, and Evolution 46

Lyle Jenkins
6 Acquisition of Language 56
Barbara Lust
7 Elaborating Speech and Writing: Verbal Art 65

Patrick Colm Hogan
ENTRIES 77
List of Contributors 941

Index 953
ix
ENTRIES
A Biolinguistics Computational Linguistics

Abduction Birdsong and Human Language Concepts
Absolute and Statistical Universals Blended Space Conceptual Blending
Accessibility Hierarchy Blindness and Language Conceptual Development and Change
Acoustic Phonetics Bounding Conceptual Metaphor
Adaptation Brain and Language Conduit Metaphor
Ad Hoc Categories Brocas Area Connectionism and Grammar
Adjacency Pair Connectionism, Language Science, and
Age Groups C Meaning
Aging and Language Cartesian Linguistics Connectionist Models, Language
Agreement Case Structure, and Representation
Agreement Maximization Categorial Grammar Consciousness and Language
Alliteration Categorization Consistency, Truth, and Paradox
Ambiguity Causative Constructions Constituent Structure
Amygdala C-Command Constraints in Language Acquisition
Analogy Cerebellum Construction Grammars
Analogy: Synchronic and Diachronic Charity, Principle of Contact, Language
Analyticity Childrens Grammatical Errors Context and Co-Text
Anaphora Chirographic Culture Control Structures
Animacy Clitics and Cliticization Conversational Implicature
Animal Communication and Human Codeswitching Conversational Repair
Language Cognitive Architecture Conversation Analysis
Aphasia Cognitive Grammar Cooperative Principle
Areal Distinctness and Literature Cognitive Linguistics and Language Core and Periphery
Art, Languages of Learning Corpus Callosum
Articulatory Phonetics Cognitive Linguistics, Language Science, Corpus Linguistics
Artiicial Languages and Metatheory Creativity in Language Use
Aspect Cognitive Poetics Creoles
Auditory Processing Coherence, Discourse Critical Discourse Analysis
Autism and Language Coherence, Logical Critical Periods
Autonomy of Syntax Colonialism and Language Culture and Language
Color Classiication Cycle, he
B Communication
Babbling Communication, Prelinguistic D
Basal Ganglia Communicative Action Deconstruction
Basic Level Concepts Communicative Intention Dei nite Descriptions
Bilingual Education Comparative Method Deixis
Bilingualism, Neurobiology of Competence Descriptive, Observational, and
Bilingualism and Multilingualism Competence and Performance, Literary Explanatory Adequacy
Binding Compositionality Dhvani and Rasa
xi
List of Entries
Dialect Genes and Language Left Hemisphere Language Processing

Dialogism and Heteroglossia Gesture Legal Interpretation
Dif usion Government and Binding Lexical Acquisition
Digital Media Grammaticality Lexical-Functional Grammar
Diglossia Grammaticality Judgments Lexical Learning Hypothesis
Discourse Analysis (Foucaultian) Grammaticalization Lexical Processing, Neurobiology of
Discourse Analysis (Linguistic) Grooming, Gossip, and Language Lexical Relations
Discrete Ini nity Lexical Semantics
Disorders of Reading and Writing H Lexicography
Division of Linguistic Labor Habitus, Linguistic Linguistic Relativism
Dyslexia Head-Driven Phrase Structure Grammar Literacy
Hippocampus Literariness
E Historical Linguistics Literary Character and Character Types
Ellipsis Historical Reconstruction Literary Universals
Embodiment Holophrastic Stage, he Literature, Empirical Study of
Emergentism Homologies and Transformation Sets Logic and Language
Emergent Structure Logical Form
Emotion and Language I Logical Positivism
Emotion, Speech, and Writing Icon, Index, and Symbol
Emotion Words Ideal Speech Situation M
Emplotment Identity, Language and Mapping
Encoding Ideology and Language Markedness
nonc/Statement (Foucault) Idioms Market, Linguistic
Essentialism and Meaning Idle Talk and Authenticity Marxism and Language
Ethics and Language Ijtihd (Interpretive Efort) Meaning and Belief
Ethnolinguistic Identity I-Language and E-Language Meaning and Stipulation
Event Structure and Grammar Illocutionary Force and Sentence Types Meaning Externalism and Internalism
Evidentiality Image Schema Media of Communication
Evolutionary Psychology Implicational Universals Memes and Language
Exemplar Indeterminacy of Translation Memory and Language
Exemplar heory Indexicals Mental Models and Language
Extinction of Languages Inequality, Linguistic and Mental Space
Communicative Merge
F Infantile Responses to Language Metalanguage
Family Resemblance Information Structure in Discourse Metaphor
Feature Analysis Information heory Metaphor, Acquisition of
Felicity Conditions Innateness and Innatism Metaphor, Information Transfer in
Field (Bourdieu) Integrational Linguistics Metaphor, Neural Substrates of
Film and Language Intension and Extension Metaphor, Universals of
Filters Intentionality Meter
Focus Internal Reconstruction Methodological Solipsism
Foregrounding Interpretation and Explanation Methodology
Forensic Linguistics Interpretive Community Metonymy
Formal Semantics Intertextuality Minimalism
Forms of Life Intonation Mirror Systems, Imitation, and
Frame Semantics Irony Language
Framing Efects Modality
Frontal Lobe L Modern World-System, Language and
Functional Linguistics Language, Natural and Symbolic the
Language Acquisition Device (LAD) Modularity
G Language Change, Universals of Montague Grammar
Games and Language Language Families Mood
Gender and Language Language-Game Morpheme
Gender Marking Language-Learning Environment Morphological Change
Generative Grammar Language of hought Morphological Typology
Generative Poetics Language Policy Morphology
Generative Semantics Laws of Language Morphology, Acquisition of
Generic- and Speciic-Level Metaphors Learnability Morphology, Evolution and
xii
List of Entries
Morphology, Neurobiology of Poetic Language, Neurobiology of Rhythm

Morphology, Universals of Poetic Metaphor Right Hemisphere Language Processing
Motif Poetics Role and Reference Grammar
Movement Point of View Rule-Following
Music, Language and Politeness
Politics of Language S
N Possible Worlds Semantics Schema
Narrative, Grammar and Possible Worlds Semantics and Fiction Scripts
Narrative, Neurobiology of Pragmatic Competence, Acquisition of Second Language Acquisition
Narrative, Scientiic Approaches to Pragmatics Self-Concept
Narrative Universals Pragmatics, Evolution and Self-Organizing Systems
Narratives of Personal Experience Pragmatics, Neuroscience of Semantic Change
Narratology Pragmatics, Universals in Semantic Fields
Nationalism and Language Pragmatism and Language Semantic Memory
Natural Kind Terms Predicate and Argument Semantic Primitives (Primes)
Necessary and Suicient Conditions Preference Rules Semantics
Negation and Negative Polarity Prestige Semantics, Acquisition of
Network heory Presupposition Semantics, Evolution and
Neurochemistry and Language Primate Vocal Communication Semantics, Neurobiology of
Neuroimaging Priming, Semantic Semantics, Universals of
Number Principles and Parameters heory Semantics-Phonology Interface
Principles and Parameters heory and Semantics-Pragmatics Interaction
O Language Acquisition Semiotics
Occipital Lobe Print Culture Sense and Reference
Optimality heory Private Language Sentence
Oral Composition Projectibility of Predicates Sentence Meaning
Oral Culture Projection (Blending heory) Sexuality and Language
Ordinary Language Philosophy Projection Principle Signed Languages, Neurobiology of
Origins of Language Proposition Sign Language, Acquisition of
Overregularizations Propositional Attitudes Sign Languages
Prototypes Sinewave Synthesis
P Proverbs Situation Semantics
Parable Psychoanalysis and Language Socially Distributed Cognition
Paralanguage Psycholinguistics Sociolinguistics
Parameters Psychonarratology Source and Target
Parietal Lobe Psychophysics of Speech Speciic Language Impairment
Parsing, Human Speech-Acts
Parsing, Machine Q Speech Anatomy, Evolution of
Passing heories Qualia Roles Speech-Language Pathology
Performance Quantiication Speech Perception
Performative and Constative Quantitative Linguistics Speech Perception in Infants
Perisylvian Cortex Speech Production
Perlocution R Spelling
Person Radical Interpretation Spreading Activation
Philology and Hermeneutics Reading Standardization
Phoneme Realization Structure Standard heory and Extended Standard
Phonetics Rectiication of Names (Zheng Ming) heory
Phonetics and Phonology, Neurobiology Recursion, Iteration, and Stereotypes
of Metarepresentation Story and Discourse
Phonological Awareness Reference and Extension Story Grammar
Phonology Reference Tracking Story Schemas, Scripts, and Prototypes
Phonology, Acquisition of Register Stress
Phonology, Evolution of Regularization Structuralism
Phonology, Universals of Relevance heory Stylistics
Phrase Structure Religion and Language Stylometrics
Pidgins Representations Subjacency Principle
Pitch Rhetoric and Persuasion Suggestion Structure
Poetic Form, Universals of Rhyme and Assonance Syllable
xiii
List of Entries
Synchrony and Diachrony Transformational Grammar Verbal Reasoning

Syntactic Change Translation Verbal Reasoning, Development of
Syntax Truth Veriiability Criterion
Syntax, Acquisition of Truth Conditional Semantics Verse Line
Syntax, Evolution of Two-Word Stage Voice
Syntax, Neurobiology of Typology Voice Interaction Design
Syntax, Universals of
Syntax-Phonology Interface U W
Underlying Structure and Surface Wernickes Area
T Structure Word Classes (Parts of Speech)
Teaching Language Universal Grammar Word Meaning
Teaching Reading Universal Pragmatics Word Order
Teaching Writing Universals, Nongenetic Word Recognition, Auditory
Temporal Lobe Usage-Based heory Word Recognition, Visual
Tense Use and Mention Words
Text Working Memory and Language
Text Linguistics V Processing
halamus Vagueness Writing, Origin and History of
hematic Roles Verbal Art, Evolution and Writing and Reading, Acquisition of
heory of Mind and Language Acquisition Verbal Art, Neuropsychology of Writing and Reading, Neurobiology of
Tone Verbal Display Writing Systems
Topicalization Verbal Humor
Topic and Comment Verbal Humor, Development of X
Traces Verbal Humor, Neurobiology of X-Bar heory
xiv
A NOTE ON CROSS-REFERENCES AND THE ALPHABETIZATION
OF THE ENTRIES
Cross-references are signaled by small capitals (boldface when implicit). hey are designed
to indicate the general relevance of the cross-referenced entry and do not necessarily imply
that the entries support one another. Note that the phrasing of the cross-references does not
always match the entry headings precisely. In order to minimize the disruption of reading,
entries often use shortened forms of the entry headings for cross-references. For example, this
process involves parietal structures points to the entry Parietal Lobe. In some cases, a
cross-reference may refer to a set of entries. For example, architectures of this sort are found
in connectionism alerts the reader to the presence of entries on connectionism generally,
rather than to a single entry. Finally, a cross-reference may present a heading in a diferent
word order. For example, the target entry for here we see another universal of phonol-
ogy would be listed as Phonology, Universals of.
In general, entries with multiword headings are alphabetized under their main language
term. hus, the entry for Universals of Phonology is listed as Phonology, Universals of. he
main exceptions to this involve the words language and linguistic or linguistics, where another
term in the heading seemed more informative or distinctive in the context of language sciences
(e.g., Linguistic Market is listed as Market, Linguistic).
xv
PREFACE: ON THE VERY IDEA OF LANGUAGE SCIENCES
A title referring to language sciences tacitly raises at least three a complex of principles that roughly dei ne scientiic method.
questions. First, what is a science? Second, what is language? hese principles do not apply in the same way to chemical
Finally, what is a language science? I cannot propose answers interactions and group relations and that is one reason why
to these questions in a short preface. Moreover, it would not be narrow demarcation criteria fail. However, they are the same
appropriate to give answers here. he questions form a sort of general principles across diferent domains. Very simply,
background to the essays and entries in the following pages, scientiic method involves inter alia the following practices:
essays and entries that often difer in their (explicit or implicit) 1) the systematic study of empirically ascertainable patterns
answers. However, a preface of this sort can and should in a given area of research; 2) the formulation of general prin-
indicate the general ideas about science and language that ciples that govern those patterns; 3) the attempt to uncover
governed the development of he Cambridge Encyclopedia of cases where these principles do not govern observed pat-
the Language Sciences. terns; 4) the attempt to eliminate gaps, vagueness, ambiguity,
and the like from ones principles and from the sequences of
principles and data that produce particular explanations; and
WHAT IS SCIENCE?
5) the attempt to increase the simplicity of ones principles
Philosophers of science have often been concerned to dei ne a and particular explanations. Discourses are scientiic to the
demarcation criterion, separating science from nonscience. I extent that they routinely involve these and related practices.
have not found any single criterion, or any combination of cri- Note that none of this requires, for example, strict falsiica-
teria, compelling in the sense that I have not found any argu- tion or detailed prediction. For example, social phenomena
ment that, to my satisfaction, successfully provides necessary are most often too complex to allow for signiicant prediction,
and sufficient conditions for what constitutes a science. in part because one cannot gather all the relevant data before-
In many ways, ones acceptance of a demarcation criterion is hand. h is does not mean that they are closed to systematic
guided by what one already considers to be a science. More explanations after the fact, as more data become available.
exactly, ones formulation of a demarcation criterion tends to Of course, following such methodological guidelines is not
be a function of what one takes to be a paradigmatic science or, all there is to the actual practice of science. here are always
in some cases, an exemplary case of scientiic practice. multiple options for formulating general principles that it
he advocates of strict demarcation criteria meet their mir- the current data. he evaluation of simplicity is never entirely
ror opposites in writers who assert the social construction of straightforward. heories almost invariably encounter anom-
science, writers who maintain that the diference between alous data in some areas and fail to examine other areas.
science and nonscience is simply the diference between dis- Moreover, in many cases, the very status of the data is unclear.
tinct positions within institutions, distinct relations to power. Despite all this, we hierarchize theories. We teach some and do
Suppose we say that one discipline or theory is a science and not teach others. Agencies fund some and do not fund others.
another is not. h is is just to say that the former is treated as he very nature of the enterprise indicates that even in ideal cir-
science, while the latter is not. he former is given authority in cumstances, this cannot be purely meritocratic. Moreover, real
academic departments, in relevant institutions (e.g., banks, in circumstances are far from ideal. hus, in the real world, adher-
the case of economics), and so on. ence to methodological principles may be very limited (see, for
Again, this is not the place for a treatise on the philosophy example, Faust 1984, Mahoney 1977, and Peters and Ceci 1982).
of science. Here it is enough to note that I believe both sides h is is where social constructionism enters. It seems undeni-
are partially correct and partially incorrect. First, as already able that relations of institutional power, the political economy
noted, I do not believe that there is a strict, dei nitive demar- of professions, and ideologies of nation or gender guide what is
cation criterion for science. However, I do believe that there is institutionalized, valued, funded, and so forth.
xvii
Preface
In putting together a volume on science, then, I have tried

to incorporate the insights of both the more positive views Group Dynamics
of science and the more social constructionist views. Put in Society
Individual Interactions
a way that may seem paradoxical, I have tried to include all
approaches that it the loose criteria for science just mentioned.
I believe that these loose criteria apply not only to paradigmatic
sciences themselves but also to many social critiques of sci-
ence that stress social construction. I have therefore included a
wide range of what are sometimes called the human sciences.
Mental Representations
Indeed, the volume could be understood as encompassing the Mind
language-relevant part of the human sciences which leads to Intentions
our second question.
WHAT IS LANGUAGE?
Like science, ones dei nition of language depends to a great
extent on just what the word calls to mind. Ones view of lan- Body Associative Networks
guage is likely to vary if one has in mind syntax or seman- Brains
tics, hearers or speakers, dialogues or diaries, brain damage
or propaganda, storytelling or acoustic phonetics. A i rst Figure 1. Levels of cognitive analysis. Between the levels, black
impulse may be to see one view of language as correct and the arrows represent the direction of explanation, while hollow arrows repre-
others as false. And, of course, some views are false. However, sent the direction of interpretation. Within the levels, the superior items
I believe that our understanding of language can and, indeed, are more computationally tractable or algorithmically speciiable models
should sustain a great deal of pluralism. of the inferior items, either singly (in the case of brains and intentions)
In many ways, my own paradigm for human sciences is cog- or collectively (in the case of individual interactions). Tractability may be
nitive science. Cognitive science brings together work from a produced by simpliication (as in the case of bodily architectures), by
remarkable array of disciplines literally, from Anthropology systematic objectiication (as in the case of mental architectures), or by
to Zoology. Moreover, it sustains a range of cognitive archi- statistical abstraction (as in the case of social analysis).
tectures, as well as a range of theories within those architec-
tures. hus, it is almost by its very nature pluralistic. Of course,
some writers wish to constrain this pluralism, insisting that one most important architectures of the latter sort are found in
architecture is right and the others are wrong. Certainly, one connectionism.
can argue that particular architectures are wrong. However, As a wide range of writers have stressed, the distinctive fea-
perhaps the most noteworthy aspect of cognitive science is ture of mind our second level of analysis is intentional-
that it sustains a series of types of cognitive architecture. In ity. However, intentionality, as subjective and experiential, is
Cognitive Science, Literature, and the Arts (2003), I argued that often not well suited for scientiic study. Many theorists have
these types capture patterns at diferent levels of analysis. hus, therefore sought to systematize and objectify our understand-
all are scientiically valuable. ing of mind. Most cognitive treatments of the mental level
More exactly, we may distinguish three levels of cognitive have their roots in folk psychology, a minimal, common-
analysis: bodies, minds, and groups or societies. hese lev- sense objectiication of intention in terms of beliefs and aims.
els stand in a hierarchical relation such that bodies are more But these cognitive treatments draw on empirical research
explanatorily basic than minds, and minds are more explana- and principles of scientiic method to develop models of the
torily basic than groups or societies. Lower levels provide human mind that are sometimes very far removed from folk
causally necessary principles for higher levels. Minds do not psychology. Speciically, they most often replace belief by
operate without brains. People without minds do not interact mental representations and algorithmically speciiable
in groups. In other words, lower levels explain higher levels. operations on those representations. We may therefore refer to
However, higher-level patterns provide interpretive principles these models as representational. Representationalism serves
for understanding lower levels (see interpretation and to make intention more tractable through a mentalistic archi-
explanation). We explain the (mental) feeling of fear by ref- tecture that is precisely articulated in its structures, processes,
erence to the (bodily) operation of the amygdala. But, at the and contents.
same time, we understand amygdala activity as fear because Finally, our treatment of societies may be loosely divided
we interpret that activity in terms of the mental level. into the more intentional or mental pole of individual inter-
In the analysis of cognition, the most basic, bodily cogni- action and the less subjective, more broadly statistical pole of
tive architecture is provided by neurobiology. However, due group dynamics. (See Figure 1.)
to the intricate particularity of neurobiology, we often draw hese divisions apply to language no less than they apply
on more abstract associative models at this level. hese mod- to other areas of human science. We draw on our represen-
els serve to make the isolation and explanation of patterns tational account of syntax to understand certain brain pro-
less computationally complex and individually variable. he cesses in the perisylvian cortex . Conversely, we explain
xviii
Preface
the impairment of (mental) syntactic capacities by reference essays. However, it was not the only factor. In language sciences,
to (bodily) lesions in that area. For our purposes, the crucial and indeed in human sciences generally, we need to add two
part of this analysis is its implication that language includes further considerations. he preceding analysis implicitly treats
all three levels and that the sciences of language should language patterns as if they are comparable to any patterns
therefore encompass brains, associative networks, intentions, isolated in the natural sciences. However, there are two difer-
mental representations, individual interactions, and group ences between patterns in language and, say, the patterns iso-
dynamics. h is takes us to our third question. lated by physicists. First, language patterns are mutable. hey
are mutable in three ways at the level of groups, at the level of
individual minds or brains, and at the level of common genetic
WHAT IS A SCIENCE OF LANGUAGE?
inheritance. Insofar as language patterns change at the level
he preceding sections converge on a broad, pluralistic but of groups, this mutability is comprehended by group dynam-
not indiscriminate account of what constitutes a language ics and related processes (most obviously in historical lin-
science. Speciically, a language science is the application guistics). But mental and neurobiological theories do not
of general principles of scientiic method to language phe- necessarily treat the other two sorts of mutability, for such the-
nomena at any level. At the level of brain, we have neuro- ories tend to focus on steady states of language. We therefore
linguistics (see brain and language). At the level of account for changes in the individual mind or brain by refer-
associative networks, we have connectionism. Intentionalism ence to development or acquisition (see phonology, acqui-
leads us to certain forms of ordinary language philos- sition of; syntax, acquisition of ; and so on). We account
ophy. Representational architectures are particularly well for changes in common genetic properties through the evolu-
developed, including Noam Chomskys various theories (see, tion of language (see phonology, evolution of; syntax,
for example, minimalism), cognitive linguistics, and evolution of; and so on).
other approaches. Personal interaction and group dynamics he second diference between patterns in language and pat-
are taken up in pragmatics, linguistic discourse analy- terns isolated by physicists is related to the i rst. Just as we may
sis, and sociolinguistics. Just as language encompasses be insuicient in language, we may be more than suicient. In
patterns at all these levels, language science necessarily other words, there is a diference between ordinary usage and
includes systematic accounts of language at all these levels. skilled usage. Rocks do not fall well or badly. hey simply fall,
Again, the levels of language are interrelated without being and they do so at the same rate. People, however, speak well
reducible. Similarly, the various sciences are interrelated or badly, efectively or inefectively, in a manner that is clichd
systematically interrelated through upward explanation or unusually creative (see creativity in language use).
and downward understanding or interpretation without he point is most obvious in verbal art which leads us to the
being reducible. most sweet and pleasing sciences of poetry, as Cervantes put
he preceding points should serve to clarify something that it (1950, 426).
is obvious, but rather vague, in ordinary speech: Language In keeping with the preceding analysis, then, the main top-
science is not the same as language. Language science is a ics in language science are treated initially in a series of seven
systematic treatment of language that seeks to provide both overview essays. he i rst essay provides a general introduc-
explanation and understanding. hus, an encyclopedia of tion to the study of language. Its purpose is to orient readers
the language sciences does not present the same material as toward the ield as a whole. he second and third essays turn to
an encyclopedia of language. It presents the current state of the mental level of language since this is the most widely ana-
theoretical explanation and understanding (along with some lyzed. Due to the amount of work in this area, and due to the
historical background that is important for contextualizing diversity of approaches, the treatment of this level is divided
current theories). It does not present the current state of knowl- into two chapters. he i rst addresses formal aspects of lan-
edge about particular features of particular languages except guage syntax, phonology, and so forth. he second takes up
insofar as these features enter into research programs that aim meaning. he fourth and i fth chapters address the other two
toward broader explanatory accounts or principles of more levels of language society (at the top) and the brain (at the
general understanding. hus, the phonology of Urdu, the bottom). he latter also addresses the topics of genetics and
morphology of Quechua, the metrical principles of English evolution, integrating these with the treatment of the brain. he
verse lines, and the story and discourse structure of sixth chapter takes up language acquisition. hus, it turns from
Chinese narratives enter into the following essays and entries the evolution of the general language capacities of the human
only insofar as they enter into larger theoretical concerns. brain to the development of the particular language compe-
Of course, to say this is only to mark out a general area for an tence of individual human minds. Finally, the seventh chap-
encyclopedia of the language sciences. It does not determine ter considers the nonordinary use of language in verbal art.
precisely what essays and/or entries should make up such a he subsequent entries specify, elaborate, augment, and
work. h is leads to our i nal concern. revise the ideas of these essays. Here, of course, the editor of
a volume on language sciences faces the problem of just what
entries should be included. In other words, if language sciences
THE STRUCTURE OF THE VOLUME
encompass the language-related part of neuroscience, social
he preceding view of language science guided the formulation science, and so forth, just what is that language-related part?
of topics for the entries and the organization of the introductory What does it include, and what does it exclude? One might dei ne
xix
Preface
this part very narrowly as including only phenomena that are be important things to say, not only about the neurobiology
necessarily bound up with oral speech, sign languages, of syntax, but also about the neurobiology of pragmat-
or writing. More theoretically, one might dei ne this part as ics and the neuropsychology of verbal art. he i rst has
comprising neurobiological, mental, or social phenomena that certainly been more fully researched than the second or third.
occur only in connection with distinctive properties of speech, But that is only more reason to stress the importance of the sec-
signing, or writing. ond and third, to bring together what research has been done,
Certainly, an encyclopedia treating language will focus and to point to areas where this research could be productively
on phenomena that are inseparable from speech, sign lan- extended.
guages, and/or writing and on such distinctive aspects of natu- While it is possible to be systematic with research areas, one
ral language as syntax. However, here, too, I believe it would cannot be systematic with theories. heories are more idiosyn-
be a mistake to coni ne language sciences within a narrowly cratic. hey difer from one another along many axes and can-
dei ned domain. herefore, I have adopted a looser criterion. not be generated as a set of logical possibilities. I have sought to
he volume centrally addresses distinctive properties of natu- represent theories that have achieved some level of acceptance
ral language. However, it takes up a wider range of phenomena in scientiic communities. Given limitations of space, decisions
that are closely connected with the architectural or, even more on this score have often been diicult particularly because
importantly, the functional features of speech, sign languages, social constructionist and related analyses show that accep-
and writing. tance in scientiic communities is by no means a straightfor-
here are several cognitive operations for which speech, ward function of objective scientiic value.
signing, and writing appear to have direct functional conse- h is leads us once again to the issue of the validity of theo-
quences. One is referential the speciication, compilation, ries. It should come as no surprise that my view of the issue in
and interrelation of intentional objects (see the entries on efect combines a pluralistic realism with a roughly Lakatosian
reference). Here I have in mind phenomena ranging from advocacy of research programs and a Feyerabend-like practical
the division of the color spectrum to the elaboration of causal anarchism (Feyerabend 1975; Lakatos 1970). Speciically, I take
relations. A second area is mnemonic the facilitation and par- it that some theories are true and others are not. However, I do
tial organization of memory (see, for example, encoding). A not believe that only one theory is true. Diferent theories may
third is inferential the derivation of logical implications. A organize the world in diferent ways. here is no correct way of
fourth is imaginative the expansion and partial structuring organizing the world (though some ways will be more useful
of simulation. One could think of the i rst and second func- than others for particular purposes). On the other hand, once
tions as bearing on direct, experiential knowledge of present the world is organized in a certain way, then certain accounts of
or past objects and events. he third and fourth functions bear, the world are correct and certain accounts are incorrect. To take
rather, on indirect knowledge of actual or possible objects and a simple example, we may divide the color spectrum in difer-
events. Two other functions are connected with action rather ent ways (see color classification). No division is correct
than with knowledge. he i rst is motivational the extension or incorrect. But once we have a division, there are facts about
or elaboration of the possibilities for emotional response (see the color of particular objects. (h is view is related to Donald
emotion and language). A i nal area is interpersonal the Davidsons (1984) argument that truth is not relative to a con-
communication of referential intents, memories, inferences, ceptual scheme, though it is, of course, relative to the mean-
simulations, and motivations. ing of ones words. It also may have some similarity to Hilary
In determining what should be included in the volume, Putnams (1981) internal realism, depending on how that is
I have taken these functions into account, along with archi- interpreted.)
tectural considerations. hus I see issues of interpretation Moreover, even for one organization of the world, we can
and emplotment (one of the key ways in which we organize never dei nitively say that a given theory is or is not true. Note
causal relations) as no less important than phonology or syn- that this means we cannot even strictly falsify a theory. We can
tactic structure. Of course, we have more i rmly established refer to the ongoing success of a research program and that is
and systematic knowledge in some areas than in others. hus important. Yet I do not share Imre Lakatoss (1970) optimism
some entries will necessarily be more tentative, and make refer- about research programs. To some extent, research programs
ence to a broader variety of opinion or a more limited research appear to succeed insofar as they have powerful institutional
base. But that is not a reason to leave such entries aside. Again, support, often for not very good intellectual reasons. Here, then,
the purpose of an encyclopedia of language science is not to I agree with Paul Feyerabend (1975) that orthodoxy in theori-
present a compilation of well-established particular facts, but zation is wrong. It is wrong not only in explicitly or implicitly
rather to present our current state of knowledge with respect to identifying institutional support with validity. hus, it is wrong
explanation and understanding. not only for social constructionist reasons. It is wrong also for,
In keeping with this, when generating the entries (e.g., so to speak, positivist reasons. It is wrong in diminishing the
Phonology, Syntax, Neurobiology of Phonology, likelihood of intellectual progress, the likelihood of increasing
Neurobiology of Syntax, Acquisition of Phonology, and so the validity of our theories, which is to say the scope of explana-
on), I have tried to be as systematic as possible. hus the vol- tion and understanding produced by these theories.
ume includes some topics that have been under-researched Whether or not this very brief sketch points toward a
and under-theorized. For example, if neurobiology does in fact good philosophy of science, it does, I believe, point toward
provide a causal substrate for higher levels, then there should a good philosophy for an encyclopedia of science perhaps
xx
Preface
particularly language science. I have tried to follow this phi- WORKS CITED AND SUGGESTIONS FOR FURTHER READING
losophy throughout the volume. Speciically, I have sought to Bakhtin, M. M. 1981. he Dialogic Imagination: Four Essays. Ed.
present a range of theoretical ideas (as well as more theory- Michael Holquist. Trans. Caryl Emerson and Michael Holquist.
independent topics), placing them together in such a way as Austin: University of Texas Press.
to encourage a mutual sharpening of ideas and insights. To Cervantes Saavedra, Miguel de. 1950. he Adventures of Don Quixote.
borrow M. M. Bakhtins terms (1981), I have not set out to pro- Trans. J. M. Cohen . New York : Penguin.
vide a monological source of authoritative discourse. Rather, I Davidson, Donald. 1984. On the very idea of a conceptual scheme.
have sought to present a heteroglot volume with which read- In Inquiries into Truth and Interpretation, 18398. Oxford: Oxford
ers may interact dialogically (see dialogism and hetero- University Press.
glossia) hopefully, to produce more intellectually adequate Faust, David. 1984. he Limits of Scientiic Reasoning. Minneapolis:
University of Minnesota Press.
theories later. Toward this end, I have encouraged authors to
Feyerabend, Paul. 1975. Against Method: Outline of an Anarchistic
be open about their own judgments and attitudes. here is a
heory of Knowledge. London: Verso.
common view that a piece of writing is biased if the speaker Hogan, Patrick Colm. 2003. Cognitive Science, Literature, and the Arts:
frankly advocates one point of view. But, in fact, the opposite is A Guide for Humanists. New York : Routledge.
the case. A piece of writing is biased if a speaker acts as though Lakatos, Imre. 1970. Falsiication and the methodology of scientiic
he or she is simply reporting undisputed facts, when in fact he research programmes. In Criticism and the Growth of Knowledge,
or she is articulating a partisan argument. Being open, dialogi- ed. Imre Lakatos and Alan Musgrave, 91195. Cambridge: Cambridge
cal, and multivocal does not mean being bland. Indeed, insight University Press.
is more likely to be produced through the tension among ideas Mahoney, Michael. 1977. Publication prejudices: An experimental
and hypotheses that are clearly delineated in their diferences. study of coni rmatory bias in the peer review system. Cognitive
herapy and Research 1: 16175.
h is is no less true in the language sciences than elsewhere.
Peters, Douglas, and Stephen Ceci. 1982. Peer-review practices of psy-
Indeed, that is one reason why this volume treats language sci-
chological journals: he fate of published articles, submitted again.
ences, not the science of language.
Behavioral and Brain Sciences 5.2: 18795.
Patrick Colm Hogan Putnam, Hilary. 1981. Reason, Truth, and History. Cambridge:
Cambridge University Press.
xxi
ACKNOWLEDGMENTS
First of all, I must thank Phil Laughlin, who (inspired by the exemplary MIT Encyclopedia of
the Cognitive Sciences) suggested the project initially and invited me to make a more formal
proposal. Without Phils initial idea and subsequent encouragement, this volume would not
now exist. Eric Schwartz took over from Phil, then Simina Calin took over from Eric; both were
supportive and helpful, as were the several editorial assistants, most recently April Potenciano,
Christie Polchowski, and Jeanie Lee. Regina Paleski and Mark Fox ably shepherded this com-
plex project through the production process; Phyllis Berk worked with devotion on copy edit-
ing the manuscript; and Robert Swanson took on the tough job of indexing.
he members of the editorial board kindly provided comments on the list of entries and
suggested possible authors. hey also served as second readers for most of the entries. I am
indebted to them all. It is diicult and unrewarding work, but extremely valuable. Some entries
were evaluated by specialists not on the editorial board. I am deeply grateful to the following
scholars who agreed to read and comment on entries: J. Abutalebi, E. Ahlsn, A. Aikhenvald,
S. Anderson, A. Atkin, S. Barker, J. Beall, D. Beaver, H. Bejoint, H. Ben-Yami, A. Berger,
D. Bickerton, A. Bilgrami, S. Blackmore, J. Blommaert, C. Bowern, E. Brenowitz, J. Bybee,
J. Carroll, T. Deacon, M. DeGraf, J.-L. Dessalles, A. Edgar, C. Elgin, R. Ferrer i Cancho, J. Field,
H. Filip, D. Finkelstein, J. Forrester, R. Gibbs, R. Gibson, R. Giora, R. Gleave, K. Gluer-Pagin,
M. Goral, M. Hashimoto, J. Heath, D. Herman, R. Hilpinen, J. Hintikka, K. Hof man, K. Holyoak,
P. Hutchinson, J. Hyun, P. Indefrey, M. Israel, K. Johnson, M. Johnson, J. Kane, P. Kay, A. Kibort,
S. Kiran, C. Kitzinger, W. Labov, B. Laford, C. Larkosh, A. Libert, P. Livingston, K. Ludwig,
M. Lynch, J. Magnuson, G. Marcus, R. May, J. McGilvray, A. Mehler, S. Mills, D. Moyal-Sharrock,
K. Oatley, B. OConnor, L. Pandit, B. Partee, J. Pennebaker, P. Portner, C. Potts, J. Robinson,
S. Rosen, S. Ross, J. Saul, R. Schleifer, M. Shibatani, R. Skousen, S. Small, W. Snyder, M. Solms,
F. Staal, P. Stockwell, L. Talmy, H. Truckenbrodt, J. P. Van Bendeghem, W. van Peer, S. Wheeler,
and L. Zhang. hanks to M. Cutter for help with the illustrations.
For some time, a graduate assistant, Karen Renner, took care of many secretarial duties.
h is work was supported by the English Department at the University of Connecticut, with
some added funding from the University of Connecticut Research Foundation. he support of
the English Department was due to the kindness and commitment of our department head,
Bob Tilton without his help, this project would not have been possible.
Work for the entry on Ad Hoc Categories was supported by National Science Foundation
Grant BCS-0212134 and DARPA Contract FA865005-C-7256 to Lawrence W. Barsalou.
he entry on Dyslexia was prepared with support from a British Academy Research
Readership to Margaret J. Snowling.
Preparation of the manuscript for Speech Production was supported by NIDCD A-93 and
NIDCD grant DC-03782, both to Haskins Laboratories.
Research for Paisley Livingstons entries beneited from i nancial support from the Research
and Postgraduate Studies Committee of Lingnan University, Hong Kong.
xxiii
the sum of individual actions performed by separate persons;
in particular, each individual involved must take into consid-
eration the other individuals beliefs, intentions, and actions in
a way that can be described as cooperative. A shared coopera-
1 tive activity between two individuals can be dei ned in terms
of a set of attitudes held by the cooperating individuals and as
a way of carrying out the individual action (Bratman 1992). he
attitudes are as follows:
LANGUAGE STRUCTURE IN ITS HUMAN
CONTEXT: NEW DIRECTIONS FOR THE LANGUAGE (a) Each individual participant intends to perform the
joint action. hat is, each participants intention is not
SCIENCES IN THE TWENTY-FIRST CENTURY directed simply toward his/her individual action but
toward the joint action that is carried out by both partici-
William Croft pants together.
(b) Each participant intends to perform the joint action
in accordance with and because of each ones meshing
subplans. hat is, each participants individual actions are
intended to mesh with the other participants actions in
he science of language in the twenty-i rst century is likely to order to successfully achieve the joint action.
expand its scope compared to that of the twentieth century. (c) Neither participant is coercing the other.
he twentieth-century science of language focused its atten- (d) Each participant has a commitment to mutual sup-
tion on the analysis of language structure: the sound system port. hat is, each one will help the other to carry out the
of a language (phonology) and its grammatical system subplans; each participant is thus responsible for more than
(morphology and syntax). he analysis of linguistic struc- just execution of his/her own subplan.
ture, or form, is central to the science of language. After the
(e) All of (a)(d) are common ground, or shared knowl-
middle of the twentieth century, however, greater attention
edge between the individuals. he concept of common
was placed on the relationship between language form and its
ground plays a central role in understanding the function
psychological and social context.
of language in social interaction; it is discussed more fully
he analysis of linguistic structure will remain central to
toward the end of this essay.
the science of language. However, understanding language
in context will undoubtedly be a crucial feature of language Finally, in addition to these mental attitudes on the part
science in the twenty-i rst century. h is essay focuses on the of the participants, there must be mutual responsiveness in
basic principles that have emerged in research on language in action. hat is, the participants will coordinate their individual
its social and cognitive context, the ways that this context con- actions as they are executed in order to ensure that they mesh
strains language structure and use, and the new directions in with each other and, hence, that the joint action will be suc-
research implied by the integration of language structure and cessfully carried out (to the best of their abilities). Coordination
context. h is essay is necessarily selective in the topics cov- is essential in carrying out joint actions successfully, and this is
ered, and the selection represents a particular way to integrate where language plays a central role in joint actions.
language form and its context. It also brings together theories he social cognitive abilities necessary for shared cooper-
that have originated in philosophy, psychology, and sociology, ative activity appear to be unique to humans, providing what
as well as diferent branches of linguistics. Such efort is neces- Michael Tomasello (2008) calls the social cognitive infrastruc-
sary in order to treat language as a unitary phenomenon, and ture necessary for the evolution of the capacity for modern
also to relate central questions of linguistic analysis to other human language. Other species than humans have a capacity
scientiic domains. Language structure cannot be fully under- for imitative learning of complex vocalizations (see animal
stood without situating it with respect to current theories of communication and human language). h is has not
joint action, social cognition, conceptualization of experience, been suicient to lead to the evolution of human-like language
memory and learning, cultural transmission and evolution, among these species. Nonhuman primates have the ability to
shared knowledge and practice in communities, and demo- plan actions and to recognize regularities in behavior of other
graphic processes in human history. creatures, enough to manipulate their behavior. hese abilities
are preconditions for executing complex actions such as joint
actions, but they are not suicient for doing so.
WHY TALK? THE PRAGMATICS OF LANGUAGE
Research on primate behavior in natural and experimen-
Why do we talk? Why does language exist? It is only by answer- tal settings suggest that some primates even have the ability
ing these questions that we can understand how language its to recognize conspeciics as beings with intentional states like
in its context. he answer is that language plays an essential themselves in some circumstances (Tomasello, 2008; this abil-
role in social interaction, fundamentally at the level of joint ity develops in humans only at around nine months of age).
action between two or more individuals (Clark 1996; Tomasello, Nevertheless, it has not been demonstrated that nonhuman pri-
2008). What makes a joint action joint is that it is more than just mates have the ability to engage in shared cooperative activity
1
as already dei ned. Tomasello (ibid.) suggests that in particu- butterly) or a grammatical construction (such as the Modiier-
lar, helpfulness, Michael Bratmans condition (d), may be criti- Head construction for English noun phrases) emerges as a con-
cal to the evolution of the ability to carry out joint actions. vention when it becomes a regularly used means for solving the
he i nal condition for joint action is that the individual recurrent coordination problem of referring to a speciic expe-
actions must be coordinated in accordance with the shared rience that is to be communicated.
attitudes of the participants. Any joint action poses coordi- Linguistic convention actually operates at two levels: the
nation problems between the participants (Lewis 1969). Any grammatical level of words and constructions, at which the
means that is used to solve the coordination problem on a speakers intentions are formulated; and the phonological level
particular occasion is a coordination device. here are vari- of the articulation and perception of the sounds that make up
ous coordination devices that human beings use to solve the the grammatical units. h is is the phenomenon described as
coordination problems of joint actions, of which the simplest is duality of patterning in language (Hockett 1960). One could
joint attention to jointly salient properties of the environment imagine in principle that linguistic convention possessed only
(Tomasello 1999, 2003). But by far the most efective coordina- one level: perceivable sounds (or gestures or written images,
tion device is for the participants to communicate with each depending on the medium), corresponding to part (i) in the
other: By communicating their mental states, the participants dei nition of convention, that directly conveyed the speakers
greatly facilitate the execution of any joint action. intention (the recurrent coordination problem) as a whole, cor-
communication is itself a joint action, however. he responding to part (v) in the dei nition of convention. hese
speaker and hearer must converge on a recognition of the exist in interjections with speciic functions such as Hello and
speakers intention by the hearer (see communicative hanks. However, most linguistic expressions are complex,
intention; see also cooperative principle). h is is H. consisting of discrete, meaningful units. Complex linguistic
Paul Grices dei nition of meaning ([1948] 1989), or Herbert expressions evolved for two reasons: First, the number of difer-
Clarks informative act (Clark 1992; see the next section). And ent speaker intentions to be communicated grew to be indei-
this joint action poses coordination problems of its own. he nitely large; and second, a speakers intended message came to
essential problem for the joint action of communication is that be broken down into recurrent conceptual parts that could be
the participants cannot read each others minds. Language is recombined to produce the indei nite variety of messages.
the primary coordination device used to solve the coordina- Again, one could imagine that each conventional linguis-
tion problem of communication, which is in turn used to solve tic unit consisted of a unique sound (gesture, image). But lan-
the coordination problem for joint actions in general. Indeed, guages have distinct meaningful units that are made up of
that is the ultimate purpose of language: to solve the coordina- diferent sequences of the same sounds: bat, sat, Sam, same,
tion problem for joint actions, ranging from the mundane to the tame, time, etc. h is system has evolved for the same two rea-
monumental (Clark 1999). h is fact is essential for understand- sons: the increasing number of meaningful units (even the
ing the structure of discourse and the linguistic expressions recurring ones) necessary to convey the indei nitely large num-
used in it, as Clark (1992, 1996) has shown for many aspects of ber of speaker intentions, and an ability to break down a sound
conversational interaction, and it also accounts for many fun- signal (or gesture, or image) into parts that can be recombined
damental properties of linguistic structure. as a sequence of sounds (or gestures or images). hus, the dual-
ity of patterning characteristic of human language has evolved
to accommodate the huge number of speaker intentions that
LANGUAGE, COMMUNICATION, AND CONVENTION
people want to convey, and to exploit the facts that intentions
A language can be provisionally described as a conventional can be broken down into recombinable conceptual units and
system for communication (this dei nition is modiied later in that the medium of expression can be broken down into recom-
this section). David Lewis (1969) and Clark (1996, Chapter 5) binable units as well.
dei ne convention as follows: Language is therefore a joint action that operates simulta-
neously at four levels (Clark 1996). he higher-numbered levels
(i) A regularity in behavior
are dependent on the lower-numbered levels; the individual
(ii) that is partly arbitrary (that is, we could have equally actions of the interlocutors are given in italics:
chosen an alternative regularity of behavior),
(4) proposing and taking up a joint project (joint action);
(iii) that is common ground in the community,
(3) signaling and recognizing the communicative intention;
(iv) as a coordination device
(2) formulating and identifying the proposition;
(v) for a recurrent coordination problem.
(1) producing and attending to the utterance.
In other words, conventions can emerge when members of
the community have shared knowledge that a certain repeated he highest level corresponds to the illocutionary
behavior can act among them as a coordination device for a force in speech-act theory (Austin 1962); the next level to
recurrent coordination problem. h is dei nition of convention Gricean meaning, or the informative act (Clark 1992); the next
is general: It applies to conventions such as shaking hands (or level to the propositional act (Searle 1969); and the lowest level to
kissing on the cheek) for greeting, or driving on the right (left) the utterance act (Austin 1962; Searle 1969). Each level enables
side of the road. he dei nition also applies straightforwardly to the level(s) above it, and succeeds only if the level(s) below has
language: A string of sounds (i.e., a word or morpheme, such as been successfully achieved (e.g., one cannot recognize the
2
Language Structure in Its Human Context
communicative intention if one did not pay attention to the diferent recurrent coordination problems. For example, patient
utterance produced). is ambiguous between the linguistic semantic role (he patient
in sentence 25 is Roland ) and a role in the domain of medicine
(he patient in room 25 is Roland ). Linguistic convention alone
THE INCOMPLETENESS OF CONVENTION
cannot tell which meaning is intended by the speaker. Only
he model of language as joint action describes the social cog- joint salience, provided in the example sentences by the mean-
nitive system that must have evolved in the human species for ings of the other words and the broader context of conversa-
modern human language to have emerged. It describes what tion, will successfully solve the coordination problem of what
appears to be a stable system that led to the emergence of highly is meant by patient.
complex cooperative activity among humans, namely, what is Indexicality and ambiguity are so pervasive in language that
called society or culture. But it is not a complete picture of the no utterance can be successfully conveyed without recourse to
nature of language in social interaction. nonconventional coordination devices. But convention itself is
Linguistic convention can function as a coordination device also incomplete. h is is because every situation being commu-
for communication because there are recurrent coordination nicated is unique and can be construed as the recurrence of
problems in communication: People have repeatedly wished diferent coordination problems. he simplest example of this
to convey similar intentions formulated in similar concepts. phenomenon is that diferent words can be used to describe
Convention, linguistic or otherwise, is a regularity of behav- the current situation, each representing a diferent construal
ior that emerges in a community or society. But convention of the current situation in comparison to prior situations. For
must emerge from previous successful communication events example, one can refer to an individual as the prime minister,
where a convention did not previously exist. In other words, Tony Blair, the Labour Party leader, my friend, that guy, he, etc.;
there must be a precedent: You and I use a coordination device each expression construes reference to the current person as
because we used it before (or observed it used before), and it the recurrence of a diferent coordination problem.
worked. Following a precedent is a coordination device, but it he need to use nonconventional coordination devices as
is not (yet) convention; it is based not on regular behavior that well as linguistic convention in communication is not generally
is mutually known in the community but only on previous suc- a problem for successful joint actions by cooperative human
cessful uses that we are aware of (Lewis 1969). beings. However, in some contexts, successful coordination
Following a precedent cannot be the ultimate root of con- is quite diicult. For example, scholarly discourse on abstract
vention either. It always requires a successfully coordinated theoretical concepts often leads to alternative construals of
communicative act as a precedent. he ultimate coordination what is intended by particular scholars. What do we take Plato
device is joint salience: Each participant can assume that in a to have meant? h is changes over time and across persons.
particular situation, certain features are salient to both partici- Alternative construals, not always accurately described as
pants (Lewis 1969). Joint salience is possible because humans misunderstandings, occur in more everyday circumstances
have the social cognitive capacity for joint attention to their as well, as readers can verify for themselves.
environment (Tomasello 2003). Joint attention forms a basis for In addition, human beings are not always cooperative. he
common ground, as discussed later in this article. complexity of language as joint action here leaves open many
Linguistic convention, however, is not perfect; it does not possible means of language abuse. For example, lying abuses
trump or replace the nonconventional coordination devices of linguistic convention in its role of helping coordinate a shared
precedent and joint salience in the act of communication. h is cooperative activity, namely, coming to a shared belief. Other
is partly because of the kind of conventions found in language, types of language abuse exploit nonconventional coordination
and partly because of the nature of convention itself. devices. For example, in one lawsuit, the courts ordered a gov-
Linguistic conventions are incomplete because of the phe- ernment agency to destroy certain documents, intending the
nomena of indexicality and ambiguity (Clark 1996). A lin- term to denote their information content; the agency destroyed
guistic convention such as hat or ind represents a type, but on the documents, that is, the physical objects, after making cop-
a particular occasion of use, we often intend to convey a partic- ies of them (Bolinger 1980). Here, the ambiguity of documents
ular token of the category. hus, I found the hat communicates requires recourse to joint salience, but the agency abused this
a particular taking event involving a speciic hat. In order to nonconventional coordination device (the lawsuit was about
identify which i nding event and which hat, the interlocutors privacy of information). Finally, the fact that a current situation
must rely on joint salience in the context, facilitated in part by can be construed as an instance of diferent recurrent coordi-
the past tense of ind and the article the combined with hat, to nation problems leads to alternative framings of the situation,
coordinate successfully on the right i nding event and the right such as referring to an entity as a fetus or an unborn baby. hese
hat. Linguistic shifters, such as the pronoun I, more explicitly alternative framings bias the conceptualization of the current
require joint salience, namely, who is the speaker in the con- situation in ways that invite certain inferences and courses of
text. Proper names denote tokens, but even a proper name such action, rather than others.
as William Croft may be (and is) used for more than one indi-
vidual, for example, the contemporary linguist and the English
THE LINGUISTIC SYSTEM IN CONTEXT
Baroque musical composer.
Most words are also highly ambiguous; that is, the same In the preceding sections, language is described as a conven-
regularity of behavior is used as a coordination device to solve tional system for communication, and the role of convention
3
in language and of language in communication was discussed. by phonological categories, and syntagmatic contrasts by the
In this section, the linguistic system is described in broad out- phonological structure of words and larger prosodic units.
line. Linguistic structure has been intensively studied over the he syntagmaticparadigmatic distinction is the most basic
past century ever since Ferdinand de Saussure inaugurated way to describe the fact that the linguistic system allows a (re-)
the modern analysis of linguistic structure, Structuralism combination of meaningful units in diferent ways. he adap-
(Saussure [1916] 1966). h is section focuses on those aspects of tive motivation for the emergence of such a communication
linguistic structure that are generally agreed upon and shows system was described previously: he number of intentions to
the extent to which they emerge from the principles that have be communicated is so great that a set of simple (atomic) sym-
been presented in the preceding section. bols will not suice, but experience is such that it can be broken
he most fundamental structuralist principle is the central- down into recurrent parts for which conventional linguistic
ity of the linguistic sign or symbol, that is, the notion that lan- expressions can develop. he same motivations gave rise to the
guage pairs form and meaning, and that particular linguistic syntagmaticparadigmatic distinction in phonology as well.
forms convey particular meanings. h is principle its directly Paradigmatic principles of structure in grammar and pho-
with the dei nition of convention. he regularity in behavior in nology are represented in terms of linguistic categories, phono-
part (i) of the dei nition of convention is the expression of a lin- logical and grammatical. hese abstract linguistic categories
guistic form by a speaker; the recurrent coordination problem can be mapped onto the substantive categories of the actual
in part (v) of the dei nition is the communication of a meaning phonetic realization (for phonology) and of utterance meaning
between the interlocutors. (for grammar). Linguistic typology (Comrie 1989; Croft 2003),
Also central to the structural analysis of language is the which takes a cross-linguistic perspective on grammatical
arbitrariness of the linguistic sign. hat is, arbitrariness exists analysis, has demonstrated that the ways in which phonologi-
in the particular form and meaning that are paired. h is con- cal categories are mapped onto phonetic space, and grammati-
forms with part (ii) of the dei nition of convention, namely, cal or lexical categories are mapped onto conceptual space, are
that the convention is partly arbitrary. Arbitrariness is usu- not unlimited. For example, phonetic similarities and concep-
ally dei ned in structuralist analysis as the principle that one tual similarities constrain the structure of phonological and
cannot entirely predict the form used from the meaning that is grammatical categories, respectively.
intended. From a communicative point of view, arbitrariness Syntagmatic principles of structure are represented in var-
means that another choice could have served approximately ious ways, but all such representations relect another basic
equally well. For example, the choice of producing the string principle, the hierarchical organization of the structure of
of sounds butterly for a particular meaning could have been utterances. Sentences are organized in a hierarchical struc-
replaced with the choice of producing the string of sounds ture, representing groupings of words at diferent levels. So
Schmetterling a choice made by members of the German he cat sat on the mat is not just a string of roles that contrast
speech community. Two diferent choices are communica- syntagmatically, as in [Determiner Noun Copula Preposition
tively equivalent in that neither choice is preferred for the Determiner Noun]. Instead, it is a set of nested groupings of
meaning intended and that is usually because the choice of words: [[Determiner Noun] [Copula] [Preposition [Determiner
one expression over the other is arbitrary in the structuralist Noun]]]. he nested groupings are frequently represented in a
sense. variety of ways, such as the syntactic trees of phrase (constitu-
Another principle that can be traced back to Saussure is ent) structure analysis. hey can also be represented as depen-
the distinction between the paradigmatic and syntagmatic dency diagrams (for example, the determiner is related to the
contrast of linguistic units. In a complex (multiword or multi- noun as its modiier, which in turn is related to the copula as
morpheme) grammatical construction, such as he cat sat on its subject), and representations combining constituency and
the mat, each word enters into two diferent types of contrast. dependency also exist.
For example, the i rst word the contrasts with the word cat in he structure of a construction often appears to be motivated,
that the s role in the construction (determiner) contrasts with though not entirely predicted, by the structure of the meaning
cat s role (head noun). h is is a syntagmatic contrast. But the that it is intended to convey. For example, the syntactic group-
also contrasts with another possible i ller of the same role in ings in [[he cat] is [on [the mat]]] are motivated semantically;
the construction, such as a in A cat sat on the mat ; and cat con- the in the cat modiies cat semantically as well as syntactically
trasts with hamster, parakeet, etc. in the same way. hese are (indicating that the cats identity is known to both speaker and
paradigmatic contrasts. hearer). he (partial) motivation of syntactic structure by its
More recent grammatical theories represent paradigmatic meaning is captured by general principles in diferent theories.
contrast in terms of a set of elements belonging to a grammati- hese principles can be described as variants of the broader
cal category. hus, the and a belong to the category determiner, principle of diagrammatic iconicity (Peirce 1932): roughly, that
and cat, hamster, parakeet, etc. belong to the category noun. the abstract structure of the linguistic expression parallels the
Syntagmatic contrasts are represented by contrasting roles in abstract structure of the meaning intended, to a great extent.
the syntactic structure or constructions used in the utterance. It is diicult to evaluate the structure of meaning indepen-
For example, the determiner category is functioning as a modi- dently of the structure of linguistic form. However, diferent
ier of the noun category in a noun phrase construction. Finally, speech communities settle on a similar range of constructions
the syntagmaticparadigmatic distinction also applies to pho- to express the same complex meaning the regularities dis-
nology (sound structure): Paradigmatic contrast is represented covered in linguistic typology (see, for example, the studies
4
published in Typological Studies in Language and the Oxford content words, such as nouns and verbs. In this way, the speaker
Studies in Typology and Linguistic heory). h is fact suggests has transformed the unique whole of the original experience
that there are regularities in the meaning to be conveyed that into parts that can be expressed by language.
are then relected in the grammatical constructions used to h is is not the end of the verbalization process. Content
express them. words denote only general categories of parts of the experi-
ence to be verbalized. In order to communicate the original
experience, the speaker must tie down the categories to the
GRAMMAR AND THE VERBALIZATION OF EXPERIENCE
unique instances of objects, events, and so forth in the expe-
he preceding sections have described the general context of rience, and the speaker must assemble the parts into a struc-
language use and the basic principles of language structure. ture representing the original whole that the speaker intends
he grammars of particular languages conform to the basic to verbalize. hat is to say, corresponding to the categorizing
principles of language structure in the preceding section. But step in verbalizing the parts of the experience, there is a par-
the grammars of particular languages, while diverse in many ticularizing step that indicates the unique parts; and corre-
ways, are similar to a much greater degree than would be pre- sponding to the steps of propositionalizing and subchunking
dicted from the general principles in the preceding section, or are integrative steps of structuring and cohering, respectively
even the context of language use described in the earlier sec- (Croft 2007). hese latter three steps give rise to grammar in the
tions. For example, all languages have structures like clauses sense of grammatical constructions, inlections, and particles,
in which some concept (prototypically an action concept, and the semantic commonalities among grammatical catego-
usually labeled a verb) is predicated on one or more concepts ries across languages.
that are referred to (prototypically an object or person, usually he particularizing step takes a category (a type) and selects
labeled a noun). he noun-like expressions are in turn orga- an instance (token) or set of tokens, and also identiies it by sit-
nized into phrases with modiiers. Clauses are related to each uating it in space and time. For object concepts, selecting can
other by varying degrees of grammatical integration. Certain be accomplished via the inlectional category of number, and
semantic categories are repeatedly expressed across languages via the grammatical categories of number and quantiication
as grammatical inlections or function words (e.g., articles, (three books, an ounce of gold ). For action concepts, selecting
prepositions, auxiliaries) that combine with the major lexical is done via grammatical aspect, which helps to individuate
categories of words in sentences. events in time (ate vs. was eating), and via agreement with sub-
hese universal patterns in grammar are attributable to the ject and/or object, since events are also individuated by the
way that experience is verbalized by human beings. he fun- participants in them (I read the paper and She read the maga-
damental problem of verbalization is that each experience that zine describe diferent reading events). Objects and events can
a speaker wishes to verbalize is a unique whole. But a linguis- be situated in space via deictic expressions and other sorts of
tic utterance is unlike an experience: An utterance is broken locative expressions (this book, the book on the table). Events
down into parts, and these parts are not unique; they have and some types of objects can be situated in time via tense and
been used before in other utterances. (h is latter point is the temporal expressions (I ate two hours ago; ex-mayor). Events
fact of convention; a particular linguistic form is used regularly and objects can also be situated relative to the mental states
and repeatedly for a recurrent coordination problem.) of the interlocutors: he article in the book indicates that the
he process by which the unique whole of experience is particular object is known to both speaker and hearer, and the
turned into a linguistic utterance made up of reusable parts has modal auxiliary in She should come indicates that the event
been described by Wallace Chafe (1977). he i rst step is that exists not in the real world but in the attitude of obligation in
the speaker subchunks the experience into smaller parts, each the mind of the speaker.
also a unique Gestalt similar in this way to the original experi- he structuring step takes participants and the predicated
ence. he subchunking process may be iterated (in later work, event in a clause and puts them together, reassembling the
Chafe emphasizes how consciousness shifts from one chunk predicate and the argument(s) into the subchunk from which
to another in the experience to be verbalized). A subchunk of they were derived by propositionalizing. Grammatically this is
the experience is then propositionalized; this is the second a complex area. It includes the expression of grammatical rela-
step. Propositionalizing involves breaking up an experience by tions in what is called the argument structure of a predicate, so
extracting certain entities that are (at least prototypically) per- that She put the clock on the mantle indicates which referent is
sistent, existing across subchunks. hese entities are the refer- the agent (the subject), which the thing moved (the object), and
ents that function as arguments of the predicate; the predicate which the destination of the motion (the prepositional phrase).
is what is left of the subchunk after the arguments have been But it also includes alternative formulations of the same event,
separated. Propositionalizing therefore breaks down the expe- such as he clock was put on the mantle (the passive voice con-
rience into parts arguments and the predicate that are not of struction) and It was the mantle where she put the clock (a cleft
the same type as the original experience (i.e., not a Gestalt). construction). he alternative constructions function to pres-
Once the whole has been broken down into these parts, ent the information in the proposition in diferent ways to the
the parts must be categorized, that is, assigned a category that hearer, depending on the way the discourse is unfolding; they
relates the parts of the current experience to similar parts of are referred to as information structure or discourse function.
prior experiences. Categorizing is the third step in the verbali- Finally, the cohering step takes the clauses (subchunks) and
zation process. hese categories are what are expressed by reassembles them into a whole that evokes the original whole
5
experience for the hearer. h is step can be accomplished by i rst and then recent proposals to apply it to grammar (syntax
various clause-linking devices, including subordination of var- and semantics) are examined.
ious kinds, coordination, and other clause-linking construc- One of the major results of instrumental phonetics is the
tions found in the worlds languages. Coherence of clauses discovery that phonetic variation in speech is ubiquitous.
in discourse is also brought about by discourse particles and Variation in the realization of phonemes is found not just
reference tracking, that is, grammatical devices, such as across speakers but also in the speech of a single speaker. here
pronouns or ellipsis, which show that an event is related to are at least two reasons why such variation in the speech signal
another event via a shared participant (Harry illed out the form would exist. Presumably, the level of neuromuscular control
and _ mailed it to the customs oice). over articulatory gestures needed for identical (invariant) pro-
he three steps of particularizing, structuring, and cohering ductions of a phoneme is beyond a speakers ability. At least as
result in a grammatical structure that evokes a reconstituted important, the variation in the speech signal does not prevent
version of the original unique whole. hese six steps in verbal- successful communication (or not enough of the time to lead
ization are not necessarily processed sequentially or indepen- to the evolution of even i ner neuromuscular control abilities
dently. he steps in the verbalization process are dependent on in humans).
the grammatical resources available in the language, which here is evidence, moreover, that the mental representa-
constrain the possibilities available to the speaker. For exam- tion of phonological categories includes the representation of
ple, when a speaker takes a subchunk and extracts participants individual tokens of sounds and the words that contain them.
from it, there must be a construction available in the language Speakers retain knowledge of i ne-grained phonetic detail
to relate the participants to the predicate, as with put in the (Bybee 2001; Pierrehumbert 2003). Also, there are many fre-
earlier example. hus, subchunking must be coordinated with quency efects on phonological patterns (Bybee 2001). For
propositionalizing and structuring. Also, the steps may not be example, higher-frequency forms tend to have more reduced
overtly expressed by grammatical inlections or particles. For phonetic realizations of phonemes than lower-frequency
example, he book fell does not overtly express the singular forms.
number of book, or that the event is situated in the real world Finally, human beings are extremely good pattern detectors
rather than a nonreal mental space of the speaker. from infancy on into adulthood. Infants are able to develop
Finally, the reconstituted experience evoked by the linguis- sensitivity to subtle statistical patterns of the phonetic signals
tic utterance is not the same as the unique whole with which they are exposed to. h is type of learning, which occurs with-
the speaker began. he cognitive processes humans use in ver- out actively attending to the stimulus or an intention to learn
balization do not simply carry out one or more of the six steps is called implicit learning (Vihman and Gathercole, unpub-
described. hey also conceptualize the experience in diferent lished manuscript). It contrasts with explicit learning, which
ways, depending on the speakers choices. hese choices range takes place under attention from the learner particularly
from the subtle diference between describing something as joint attention between an infant learning language and an
leaves or foliage, or the more dramatic framing diferences adult and is involved in the formation of categories and sym-
between fetus and unborn baby referred to previously. here bolic processing. here is neuroscientiic evidence that implicit
are a wide range of conceptualization processes or construal learning is associated with the neocortex and explicit learning
operations that have been identiied in language (see, e.g., with the hippocampus (ibid.).
Langacker 1987; Talmy 2000). he construal operations can be A number of researchers have proposed a usage-based or
accounted for by processes familiar from cognitive psychol- exemplar model of phonological representation to account
ogy: attention, comparison, perspective, and Gestalt (Croft for these patterns (Bybee 2001; Pierrehumbert 2003). In this
and Cruse 2004, Chapter 4). hese psychological processes are model, phonological categories are not represented by speciic
part of the meaning of all linguistic units: words, inlections, phonetic values for the phoneme in the language, but by a clus-
and constructions. As a consequence, every utterance presents ter of remembered tokens that form a density distribution over
a complex conceptualization of the original experience that a space of phonetic parameters. he phonetic space represents
the speaker intends to verbalize for the hearer. he conven- the phonetic similarities of tokens of the phonological category.
tionalized conceptualizations embodied in the grammatical h is model includes properties of implicit learning (the clus-
resources of a language represent cultural traditions of ways to ter of individual tokens) and explicit learning (the labeling of
verbalize experience in the speech community. the density distribution as representing tokens of, say, /e/ and
not /i/). Consolidation of token memories also takes place
individual tokens decay in memory, highly similar tokens are
VARIATION AND THE USAGE - BASED MODEL
merged, and the distribution of tokens can be restructured
One of the results of recent research on language structure but new tokens are constantly being incorporated into the rep-
and language use is the focus on the ubiquity of variation in resentation and inluencing it.
language use, that is, in the verbalization of experience and its Marilyn Vihman and S. Kunnari (2006) propose three types
phonetic realization. he ubiquity of variation in language use of learning for an exemplar model. First, there is an initial
has led to new models of the representation of linguistic knowl- implicit learning of statistical regularities of the sensory input.
edge in the mind that incorporate variation as an essential Second, explicit learning of linguistic categories, such as the
characteristic of language. hese models are more developed in words that are templates containing the sound segments, takes
phonetics and phonology. he phonological model is described place. Finally, a second layer of implicit learning of statistical
6
regularities gives rise to probability distributions for each lin- linguistic comparison of the meanings expressed by grammat-
guistic phonological and lexical category. he result of this ical categories and constructions. he typological approach to
last layer of learning is the exemplar or usage-based model grammar has constructed conceptual spaces for a number of
described by Janet Pierrehumbert and Joan Bybee. semantic domains using techniques such as the semantic map
he application of the usage-based/exemplar model to model (see Haspelmath 2003 for a survey of recent studies) and
grammar is more complex. Most research in this area has multidimensional scaling (Croft and Poole 2008).
compared the range of uses of a particular word or grammati- To sum up, the usage-based/exemplar model can be applied
cal construction. However, this does not represent the process to both phonological patterns in words and grammatical struc-
of language production (that is, verbalization), analogous to tures in constructions. A speakers knowledge of language
the phonetic variation found in the production of phonemes. is the result of the interplay between two learning processes.
Studies of parallel verbalizations of particular scenes demon- One learning process is the tallying of statistical regularities of
strate that variation in the verbalization of the same scene by tokens of words and constructions with a particular phonetic
speakers in similar circumstances is ubiquitous, much like the realization, performing a particular communicative act in a
phonetic realization of phonological categories (Croft 2010). speciic social interaction. he other is the organization of these
here is also substantial evidence for frequency efects in tokens into categories and the formation of generalizations that
grammar. For example, English has a grammatical category of allow the reuse or replication of these grammatical structures
auxiliary verb that has distinctive syntax in negation (I ca nt to solve future coordination problems in communication.
sing vs. I didnt sing), questions (Can he sing? vs. Did he sing?).
hese syntactic patterns are actually a relic of an earlier stage
VARIATION AND CHANGE: AN EVOLUTIONARY APPROACH
of English when word order was freer; it has survived in the
auxiliaries of modern English because of their higher token he view of language described in the preceding sections roots
frequency (Bybee and hompson 1997), as well as their seman- both language structure and a speakers linguistic knowledge
tic coherence. Frequency plays a central role in the historical in the individual acts of linguistic behavior that a speaker has
process of Grammaticalization (Hopper and Traugott 2003), in engaged in and will engage in. It is a dynamic view of language
which certain constructions develop a grammatical function in that linguistic behavior is essentially historical: a tempo-
(more precisely, they are recruited to serve the particularizing, ral series of utterances, each one linked to prior utterances as
structuring, and cohering steps of the verbalization process). repeated behavior to solve recurrent coordination problems in
Part of the grammaticalization process is that the construction social interaction. Each member of a speech community has a
increases in frequency; it therefore undergoes grammatical history of his or her participation in linguistic events, either as
and phonological changes, such as i xation of word order, loss speaker or hearer. h is history is remembered in the exemplar-
of syntactic lexibility, and phonetic reduction (Bybee 2003). A based representation of that members linguistic knowledge,
well-known example is the recruitment of the go + Ini nitive but also consolidated and organized in such a way that each
construction for the future tense: She is going (to Sears) to buy unique experience is broken down and categorized in ways
a food processor becomes future Shes going to buy a food pro- that allow for reuse of words and constructions in future com-
cessor, with no possibility of inserting a phrase between go and munication events.
the ini nitive, and is i nally reduced to Shes gonna buy a food Each time a speaker produces an utterance, he or she rep-
processor. licates tokens of linguistic structures sounds, words, and
Finally, early syntactic acquisition is driven by implicit constructions based on the remembering of prior tokens of
learning of patterns in the linguistic input (Tomasello 2003). linguistic structures, following the principles of convention
he process of syntactic acquisition is very gradual and induc- and verbalization described earlier. However, the replication
tive, involving an interplay between detection of statistical reg- process is never perfect: Variation is generated all of the time,
ularities and the formation of categories that permit productive as described in the preceding section. he variation generated
extension of grammatical constructions. Children occasionally in the process of language use can be called i rst-order varia-
produce overregularization errors, and these are also sensitive tion. Variation in replication is the starting point for language
to frequency (more frequent forms are more likely to be pro- change. Language change is an instance of change by replica-
duced correctly, and less frequent forms are more likely to be tion (rather than inherent change); change by replication is the
subject to regularization). domain of an evolutionary model of change (Hull 1988; Croft
A usage-based model of grammatical form and meaning 2000).
is gradually emerging from this research. An exemplar model Change by replication is a two-step process. he i rst step is
of grammatical knowledge would treat linguistic meanings as the generation of variation in replication. h is requires a repli-
possessing a frequency distribution of tokens of remembered cator and a process of replication by which copies are produced
constructions used for that meaning. hose constructions that preserve much of the structure of the original. In bio-
would be organized in a multidimensional syntactic space logical evolution, the canonical replicator is the gene, and the
organized by structural similarity (e.g., Croft 2001, Chapter 8) process of replication takes place in meiosis (which in sexual
and whose dimensions are organized by the function played organisms occurs in sexual reproduction). Copies of the gene
by the construction in the verbalization process. he mean- are produced, preserving much of the structure of the original
ings of constructions are themselves organized in a conceptual gene. Variation is generated by random mutation processes
space whose structure can be inferred empirically via cross- and by recombination in sexual reproduction.
7
In language, replication occurs in language use. he repli- conventions of the speech community. Each speakers system-
cators are tokens of linguistic structures in utterances (called atic knowledge of his or her language is diferent, because of
linguemes in Croft 2000). hese tokens are instances of lin- diferences in the range of language use to which each speaker
guistic behavior. he process of language change is therefore is exposed.
an example of cultural transmission, governed by principles
of evolutionary change. he replication process in language
SPEECH COMMUNITIES AND COMMON GROUND
change is governed by the principle of convention. As we have
seen in the preceding section, variation is generated in the pro- Language in this revised sense is the product of a speech com-
cess of verbalization, including the recombination of linguistic munity: the utterances produced by communicative interac-
forms. h is represents innovation in language change. First- tions among speakers. A speech community is dei ned by its
order variation is the source of language change. Experiments social interactions involving language: Members of the speech
in phonological perception and production indicate that sound community communicate with one another, and the commu-
change is drawn from a pool of synchronic variation (the title nity is dei ned by communicative isolation from other com-
of Ohala 1989). Indeterminacy in the interpretation of a communities. Communicative isolation is relative, of course, and
plex acoustic signal can lead to reanalysis of the phonological in fact the structure of human speech communities is far more
categories in that signal. Likewise, it appears that grammati- complex than the structure of biological populations.
cal change is also drawn from a pool of synchronic variation, Two related phenomena serve to dei ne communities: com-
namely, variation in verbalization. here is an indeterminacy mon ground and shared practice. Common ground plays an
in the understanding of the meaning of a word or construction essential role in dei ning joint action and convention, both cen-
because we cannot read each others minds, our knowledge of tral to understanding the nature of language. Common ground
linguistic conventions difers because we have been exposed consists of knowledge, beliefs, and attitudes presumed by two or
to diferent exemplars, and every situation is unique and can more individuals to be shared between them. Common ground
be construed in diferent ways. h is indeterminacy gives rise can be divided into two types: personal common ground and
to variation in verbalization (Croft 2010), and can lead to the communal common ground (Clark 1996, Chapter 4). Personal
reanalysis of the mapping of function into grammatical form common ground is shared directly in face-to-face interaction
(Croft 2000). by the persons. Personal common ground has two bases. he
he second step of the evolutionary process is the selection i rst is the perceptual basis: We share knowledge of what is in
of variants. Selection requires an entity other than the repli- our shared perceptual ield. he perceptual basis is provided
cator, namely, the interactor. he interactor interacts with its by virtue of joint attention and salience, as mentioned earlier.
environment in such a way that this interaction causes repli- A shared basis for common ground has the following prop-
cation to be diferential (Hull 1988). In biological evolution, the erties: he shared basis provides information to the persons
canonical interactor is the organism. he organism interacts involved that the shared basis holds; the shared basis indicates
with its environment. In natural selection, some organisms to each person that it provides information to every person that
survive to reproduce and therefore replicate their genes while the shared basis holds; and the shared basis indicates the prop-
others do not; this process causes diferential replication. osition in the common ground (Clark 1996, 94). A basis for com-
In language, selection occurs in language use as well. he mon ground varies in how well it is justiied; hence, we may not
interactor is the speaker. he speaker has variant linguistic always be certain of what is common ground or not.
forms available and chooses one over others based on his or he second basis for personal common ground is a dis-
her environment. In language, the most important environ- course basis. When I report on situations I have experienced
mental interaction is the social relationship between speaker to you in conversation, and vice versa, these become part of
and hearer and the social context of the speech event. h is is, of our personal common ground. Although we did not expe-
course, the realm of sociolinguistics (see, e.g., Labov 2001, and rience them perceptually together, we did experience the
the following section). Selection goes under the name of propa- reporting of them linguistically together. he discourse basis
gation in language change. thus involves joint attention (to the linguistic signal), as well as
Selection (propagation) is a function of the social value that the common ground of a shared language. he discourse basis
variants acquire in language use. First-order variation does and the perceptual basis both require direct interaction by the
not have a social value. Socially conditioned variation is sec- interlocutors. hey correspond to social networks, which are
ond-order variation. Once a variant is propagated in a speech instrumental in language maintenance and change (Milroy
community, it can lead to third-order variation, that is, varia- 1987).
tion in linguistic conventions across dialects and languages. he other type of common ground is communal common
Linguistic diversity is the result of language change. ground. Communal common ground is shared by virtue of
he evolutionary model requires a revision to the dei ni- common community membership. A person can establish
tion of language ofered near the beginning of this essay. In the common ground with a stranger if they both belong to a com-
evolutionary model, a language is a population of utterances, mon community (e.g., Americans, linguists, etc.). Some com-
the result of the employment of linguistic conventions in a munities are quite specialized while other communities are
speech community. he linguistic system is the result of the very broad and even all-encompassing, such as the community
ways in which speakers have consolidated the uses of language of human beings in this world, which gives rise to the possibil-
in which they have participated into their knowledge of the ity of communication in the i rst place.
8
Clark argues that the basis of communal common ground leading to distinct dialects and eventually to mutually unintel-
is shared expertise. tienne Wenger, on the other hand, dei nes ligible languages.
communities of practice in terms of shared practice: Individuals h is ubiquitous demographic process is relected in the fam-
engage in joint actions together, and this gives them common ily trees of languages that have been constructed by linguists
ground and creates a community (Wenger 1998). Wengers working on genetic classiication. hese family trees allow for
dei nition of a community of practice, therefore, requires face- the possibility of reconstructing not just protolanguages but
to-face interaction, like personal common ground. However, also the underlying social processes that are traced in them.
shared practice can be passed on as new members enter the Even sociolinguistic situations that obscure family trees leave
community and share practice with remaining current mem- linguistic evidence of other social processes. Extensive bor-
bers. h is is cultural transmission and can lead to individuals rowing indicates a period of intensive social contact. Diiculty
being members of the same community through a history of in separating branches of a linguistic family tree indicates an
shared practice, even if they do not interact directly with every expansion through a new area but continued low-level con-
other member of the community. tact between the former dialects. hese can be seen in the dia-
Since communities are dei ned by shared practice, and lect continua found in much of Europe, where the Romance,
human beings engage in a great variety of joint actions with Germanic, and Slavic peoples expanded over a mostly continu-
diferent groups of people, the community structure of human ous terrain (Chambers and Trudgill 1998). Shared typological
society is very complex. Every society is made up of multiple (structural) traits may be due to intimate contact between lan-
communities. Each person in the society is a member of mul- guages with continued language maintenance, or to a major
tiple communities, depending on the range of shared activities language shift by a social group, resulting in a large proportion
he or she engages in. he diferent communities have only par- of non-native speakers at one point in a languages history.
tially overlapping memberships. he spread of human beings across the globe led to the cre-
As a consequence, the structure of a language is equally ation of a huge number of distinct societies whose languages
complex. A linguistic structure a pronunciation, a word, a diverged. he number of distinct languages that have survived
construction is associated with a particular community, or until the beginning of the twenty-i rst century is about 6,000.
set of communities, in a society. A pronunciation is recog- Most linguists generally accept the hypothesis that modern
nized as an accent characteristic of a particular community. human language evolved just once in human history, proba-
Words will have diferent meanings in diferent communi- bly no later than 70,000 to 100,000 years ago. So in principle,
ties (e.g., subject is a grammatical relation for linguists but all modern human languages may have originated in a single
a person in an experiment for psychologists). he same con- common ancestor. Tracing back the actual lineages of con-
cept will have diferent forms in diferent communities (e.g., temporary languages deep into human prehistory appears to
Zinfandel for the general layperson, Zin to a wine aicionado). be extremely diicult, if not impossible. Nevertheless, there is
hus, a linguistic convention is not just a symbol a pairing no doubt that contemporary linguistic diversity is extremely
of form and meaning but includes a third part, the commu- ancient in human history. What we can discover about linguis-
nity in which it is conventional. h is is part (iii) of the dei ni- tic history by the comparison of existing languages can poten-
tion of convention given in an earlier section. Finally, each tially shed important light on human history and prehistory.
individual has a linguistic repertoire that relects his or her here are linguistic descriptions of a small propor-
knowledge and exposure to the communities in which he or tion of existing human languages, though descriptive work
she acts. has increased and the overall quality of descriptions has
he choice of a linguistic form on the part of a speaker is improved dramatically, thanks to advances in linguistic sci-
an act of identiication with the community that uses it. h is ence throughout the twentieth century. It would be safe to say
is the chief mechanism for selection (propagation) in language that the diversity of linguistic structure, and how that struc-
change: Ihe propagation of variants relects the dynamics of ture is manifested in phonetic reality on the one hand and in
social change. More recent work in sociolinguistics has argued the expression of meaning on the other, is truly remarkable
that linguistic acts of social identity are not always pas- and often unexpected. Many proposed universals of language
sive: Individuals institute linguistic conventions to construct have had to be revised or even abandoned as a consequence,
an identity as well as to adopt one (Eckert 2000). although systematic analysis of existing linguistic descriptions
by typologists have revealed many other language universals
that appear to be valid. Linguistic diversity has revealed alter-
LANGUAGE DIVERSITY AND ITS ENDANGERMENT
native ways of conceptualizing experience in other societies, as
Variation in language can lead to language change if it is prop- well as alternative methods of learning and alternative means
agated through a speech community. Social processes over for communication for the accomplishment of joint actions.
human history have led to the enormous linguistic diversity But the single most important fact about the diversity of
we i nd today a diversity that newer social processes also human language is that it is severely endangered. Of the 6,000
threaten. he basic social process giving rise to linguistic diferent languages extant today, 5,000 are spoken by fewer
diversity is the expansion and separation of populations into than 100,000 people. he median number of speakers for a
distinct societies. As groups of people divide for whatever rea- language is only 6,000 (Crystal 2000). Many languages are no
son, they become communicatively isolated, and the common longer spoken by children in the community, and therefore
language that they once spoke changes in diferent directions, will go extinct in another generation. he loss for the science
9
of language, and more generally for our understanding of Clark , Herbert H. 1992. Arenas of Language Use. Chicago and
human history, human thought, and human social behav- Stanford: University of Chicago Press and the Center for the Study of
ior, is immense. But the loss is at least as great for the speak- Language and Information.
ers themselves. Language use is a mode of social identity, not . 1996. Using Language. Cambridge: Cambridge University Press.
Clark , Herbert H.. 1999. On the origins of conversation. Verbum
just in terms of identifying with a speech community but as
21: 14761.
the vehicle of cultural transmission. he loss of languages, like
Comrie, Bernard. 1989. Language Universals and Linguistic Typology.
other linguistic phenomena, is a relection of social processes. 2d ed. Chicago: University of Chicago Press.
he most common social processes leading to language loss are Croft, William. 2000. Explaining language change: An evolutionary
disruption, dislocation, or destruction of the society (language approach. Harlow, Essex : Longman.
loss rarely occurs via genocide of its speakers). he enormous . 2001. Radical Construction Grammar: Syntactic heory in
consequences of language loss has led to a shift in linguistic Typological Perspective. Oxford: Oxford University Press.
ieldwork from mere language description and documentation . 2003. Typology and Universals. 2d ed. Cambridge: Cambridge
to language revitalization in collaboration with members of University Press.
the speech community. But reversing language shift ultimately . 2007. he origins of grammar in the verbalization of experi-
requires a change in the social status of the speech community ence. Cognitive Linguistics 18: 33982.
. 2010. he origins of grammaticalization in the verbalization of
in the local and global socioeconomic system.
experience. Linguistics 48: 148.
Croft, William, and D. Alan Cruse. 2004. Cognitive Linguistics.
SUMMARY Cambridge: Cambridge University Press.
Croft, William, and Keith T. Poole. 2008. Inferring universals from
he scientiic study of language in its pragmatic, cognitive, grammatical variation: Multidimensional scaling for typological
and social context beginning in the latter half of the twenti- analysis. heoretical Linguistics 34: 137.
eth century is converging on a new perspective on language Crystal, David. 2000. Language Death. Cambridge: Cambridge
in the twenty-i rst century. Linguistic conventions coordinate University Press.
communication, which in turn coordinates joint actions. he Eckert, Penelope. 2000. Linguistic Variation as Social Practice: he
fragility of social interaction by individuals leads to creativity, Linguistic Construction of Identity in Belten High . Oxford: Blackwell.
variation, and dynamism in the verbalization and vocalization Grice, H. Paul. [1948] 1989. Meaning. In Studies in the Way of Words,
of language. Individual linguistic knowledge (the linguistic 21323. Cambridge: Harvard University Press.
Haspelmath, Martin. 2003. he geometry of grammatical mean-
system) relects the remembered history of language use and
ing: Semantic maps and cross-linguistic comparison. In he New
mediates processes of language change. he continually chang-
Psychology of Language. Vol. 2. Ed. Michael Tomasello, 21142.
ing structure of society, dei ned by common ground emerging
Mahwah, NJ: Lawrence Erlbaum Associates.
from shared practices (joint actions), guides the evolution of Hockett, Charles F. 1960. he origin of speech. Scientiic American
linguistic conventions throughout its history. Human history 203: 8896.
in turn has spawned tremendous linguistic diversity, which Hopper, Paul, and Elizabeth Traugott. 2003. Grammaticalization. 2d
relects the diversity of human social and cognitive capacity. ed. Cambridge: Cambridge University Press.
But the unchecked operation of contemporary social forces is Hull, David L. 1988. Science as a Process: An Evolutionary Account of the
leading to the destruction of speech communities and the mass Social and Conceptual Development of Science. Chicago: University
extinction of human languages today. of Chicago Press.
Labov, William. 2001. Principles of Linguistic Change. Vol. 2. Social
Factors. Oxford: Blackwell.
WORKS CITED AND SUGGESTIONS FOR FURTHER READING
Langacker, Ronald W. 1987. Foundations of Cognitive Grammar. Vol. 1.
Austin, J. L. 1962. How to Do hings with Words. Cambridge: Harvard heoretical Prerequisites. Stanford: Stanford University Press.
University Press. Lewis, David. 1969. Convention. Cambridge, MA : MIT Press.
Bolinger, Dwight. 1980. Language, the Loaded Weapon. London: Milroy, Lesley. 1987. Language and Social Networks. 2d ed. Oxford: Basil
Longmans. Blackwell.
Bratman, Michael. 1992. Shared cooperative activity. Philosophical Ohala, John. 1989. Sound change is drawn from a pool of syn-
Review 101: 32741. chronic variation. In Language Change: Contributions to the Study
Bybee, Joan L. 2001. Phonology and Language Use. Cambridge of its Causes, ed. Leiv Egil Breivik and Ernst Hkon Jahr, 17398.
Cambridge University Press. Berlin: Mouton de Gruyter.
. 2003. Mechanisms of change in grammaticalization: he role Peirce, Charles Sanders. 1932. Ground, object and interpretant. In
of frequency. In Handbook of Historical Linguistics, ed. Brian Joseph Collected Papers of Charles Sanders Peirce. Vol. 2: Elements of Logic,
and Richard Janda, 60223. Oxford: Blackwell. ed. Charles Hartshorne and Paul Weiss, 13455. Cambridge: Harvard
Bybee, Joan L ., and Sandra A. hompson. 1997. h ree frequency efects University Press.
in syntax. In Proceedings of the 23rd Annual Meeting of the Berkeley Pierrehumbert, Janet B. 2003. Probabilistic phonology: discrimination
Linguistics Society, ed. Matthew L. Juge and Jeri O. Moxley, 37888. and robustness. In Probabilistic Linguistics, ed. Rens Bod, Jennifer
Berkeley : Berkeley Linguistics Society. Hay, and Stefanie Jannedy, 177228. Cambridge, MA: MIT Press.
Chafe, Wallace. 1977. he recall and verbalization of past experi- Saussure, Ferdinand de. [1916] 1966. Cours de linguistique gnrale. Ed.
ence. In Current Issues in Linguistic heory, ed. Peter Cole, 21546. Ch. Bally and A. Sechehaye. (Course in General Linguistics. Trans.
Bloomington: Indiana University Press. Wade Baskin. New York : McGraw-Hill.)
Chambers, J. K., and Peter Trudgill. 1998. Dialectology. 2d ed. Searle, John R. 1969. Speech Acts: An Essay in the Philosophy of Language.
Cambridge: Cambridge University Press. Cambridge: Cambridge University Press.
10
Talmy, Leonard. 2000. Toward a Cognitive Semantics. Vol. 1. Concept Vihman, Marilyn M., and V. M. Gathercole. Language Development.
Structuring Systems Cambridge, MA : MIT Press. Unpublished manuscript.
Tomasello, Michael. 1999. he Cultural Origins of Human Cognition. Vihman, Marilyn M., and S. Kunnari. 2006. he sources of pho-
Cambridge: Harvard University Press. nological knowledge: A crosslinguistic perspective. Recherches
. 2003. Constructing a Language: A Usage-Based heory of Linguistiques de Vincennes 35: 13364.
Language Acquisition. Cambridge: Harvard University Press. Wenger, tienne. 1998. Communities of Practice: Learning, Meaning
. 2008. he Origins of Human Communication. Cambridge, and Identity. Cambridge: Cambridge University Press.
MA : MIT Press.
11
with a speakers (or hearers) semantic and contextual knowl-
edge. Here, we review some of what we have learned about the
psychology of linguistic form, as it pertains to sounds, words,
and sentences.
2
SOUNDS
Sound units. Since the advent of speech research, one of the
THE PSYCHOLOGY OF LINGUISTIC FORM most intensively pursued topics in speech science has been
the search for the fundamental sound units of language. Many
Lee Osterhout, Richard A. Wright, and Mark D. Allen
researchers have found evidence for phonological units that
are abstract (i.e., generalizations across any number of heard
utterances, rather than memories of speciic utterances) and
componential (constituent elements that operate as part of a
combinatorial system). However, there is other evidence for
less abstract phonological forms that may be stored as whole
words. As a result, two competing hypotheses about phonolog-
ical units have emerged: an abstract componential one versus
Humans can generate and comprehend a stunning variety of a holistic one.
conceptual messages, ranging from sophisticated types of men- he more widespread view is the componential one. It pos-
tal representations, such as ideas, intentions, and propositions, its abstract units that typically relate either to abstract versions
to more primal messages that satisfy demands of the immedi- of the articulatory gestures used to produce the speech sounds
ate environment, such as salutations and warnings. In order (Liberman and Mattingly 1985, Browman and Goldstein 1990),
for these messages to be transmitted and received, however, or to ones derived from descriptive units of phonological the-
they must be put into a physical form, such as a sound wave ory. Such descriptive units include the feature (see feature
or a visual marking. As noted by the Swiss linguist Ferdinand analysis), an abstract subphonemic unit of contrast; the pho-
de Saussure ([2002] 2006), the relationship between mental neme, an abstract unit of lexical contrast that is either a conso-
concepts and physical manifestations of language is almost nant or a vowel; the phone or allophone, a surface variant of the
always arbitrary. he words cat, sat, and mat are quite similar phoneme; the syllable, a timing unit that is made up of a vowel
in terms of how they sound but are very dissimilar in meaning; and one or more of its lanking consonants; the prosodic word ,
one would expect otherwise if the relationship between sound the rhythmic structure that relates to patterns of emphasized
and meaning was principled instead of arbitrary. Although the syllables; or various structures that related to tone, the lexi-
relationship between linguistic form and meaning is arbitrary, cally contrastive use of the voices pitch, and intonation, the
it is also highly systematic. For example, changing a phoneme pitch-based tune that relates to the meaning of a sentence (for
in a word predictably also changes its meaning (as in the cat, reviews, see Frazier 1995; Studdert-Kennedy 1980).
sat, and mat example). In the holistic view, the word is the basic unit, whereas
Human language is perhaps unique in the complexity of other smaller units are considered to be epiphenomenal (e.g.,
its linguistic forms (and, by implication, the system underly- Goldinger, Pisoni, and Logan, 1991). Instance-speciic mem-
ing these forms). Human language is compositional; that is, ory traces of particular spoken words are often referred to as
every sentence is made up of smaller linguistic units that have episodes. Proponents of this view point out that while abstract
been combined in highly constrained ways. A standard view units are convenient for description and relate transparently
(Chomsky 1965; Pinker 1999) is that units and rules of combina- to segment-based writing systems, such as those based on
tion exist at the levels of sound (phonemes and phonology), the alphabet, there is evidence that listeners draw on a vari-
words (morphemes and morphology), and sentences (words ety of highly detailed and instance-speciic aspects of a words
and phrases and syntax). Collectively, these rules comprise a pronunciation in making lexical decisions (for reviews, see
grammar that dei nes the permissible linguistic forms in the Goldinger and Azuma 2003; Nygaard and Pisoni 1995).
language. hese forms are systematically related to, but dis- Some researchers have proposed hybrid models in which
tinct from, linguistic meaning (semantics). there are two layers of representation: the episodic layer, in which
Linguistic theories, however, are based on linguistic highly detailed memory traces are stored, and an abstract layer
description and observation and therefore have an uncertain organized into features or phones (Scharenborg et al. 2005). he
relation to the psychological underpinnings of human lan- proponents of hybrid models try to capture the instance-spe-
guage. Researchers interested in describing the psychologically ciic efects in perception that inspire episodic approaches, as
relevant aspects of linguistic form require their own meth- well as the highly abstracted lexical contrast efects.
ods and evidence. Furthermore, psychological theories must
describe not only the relevant linguistic forms but also the pro- PROCESSES. speech production refers to the process by
cesses that assemble these forms (during language production) which the sounds of language are produced. he process nec-
and disassemble them (during language comprehension). Such essarily involves both a planning stage, in which the words and
theories should also explain how these forms are associated other linguistic units that make up an utterance are assembled
12
The Psychology of Linguistic Form
in some fashion, and an implementation stage, in which the its roots in research that dates back over a century. Prior to
various parts of the vocal tract, for example the articulators, the advent of instrumental and experimental methods in the
execute a motor plan to generate the acoustic signal. See Carol late nineteenth century, it was commonly accepted that the
A. Fowler (1995) for a detailed review of the stages involved in basic units of speech were discrete segments that were alpha-
speech production. It is worth noting here that even if abstract betic in nature and serially ordered. While it was recognized
phonological units such as features are involved in planning that speech sounds varied systematically depending on the
an utterance, at some point the linguistic string must be imple- phonetic context, the variants themselves were thought to be
mented as a motor plan and a set of highly coordinated move- static allophones of an abstract and lexically contrastive sound
ments. h is has motivated gestural representations that include unit, that is, a phoneme. Translated into modern terminology,
movement plans, rather than static featural ones (Browman phonological planning involved two stages: 1) determining the
and Goldstein 1990; Fowler 1986, 1996; Saltzman and Munhall contextually determined set of discrete surface variants, given
1989; Stetson 1951). a particular lexical string, and 2) concatenating the resulting
speech perception is the process by which human lis- allophones. he physiological implementation of the concat-
teners identify and interpret the sounds of language. It, too, enated string was thought to result in a series of articulatory
necessarily involves at least two stages: 1) the conversion of the steady states, or postures. he only continuous aspects of sound
acoustic signal into an electrochemical response at the audi- production were believed to be brief transitional periods cre-
tory periphery and 2) the extraction of meaning from the neu- ated by articulatory transitions from one state to the next. he
rophysiological response at the cortical levels. Brian C. J. Moore transitional movements were thought to be wholly predictable
(1989) presents a thorough review of the physiological processes and determined by the physiology of a particular speakers
and some of the issues involved in speech perception. A funda- vocal tract. Translated again into modern terminology, percep-
mental point of interest here is perceptual constancy in the face tion (when considered) was thought to be simply the process
of a massively variable signal. Restated as a question, how is of translating the allophones back into their underlying pho-
it that a human listener is able to perceive speech sounds and nemes for lexical access. he earliest example of the phoneme-
understand the meaning of an utterance, given the massive allophone relationship is attributed to Pini, around 500 b.c.e.
variability created by physiological idiosyncrasies and con- whose sophisticated system of phonological rules and relation-
textual variation? he various answers to this question involve ships inluenced structuralist linguists of the early twentieth
positing some sort of perceptual units, be they individual seg- century, as well as generative linguists of the late twentieth
ments, subsegmental features, coordinated speech gestures, or century (for a review, see Anderson 1985; Kiparsky 1979).
higher-level units like syllables, morphemes, or words. he predominant view at the end of the nineteenth century
It is worth noting here that the transmission of linguistic was typiied by Alexander M. Bells (1867) descriptive work on
information does not necessarily rely exclusively on the audi- English pronunciation. In it, he presented a set of alphabet-in-
tory channel; the visible articulators, the lips and to a lesser spired symbols whose shapes and orientations were intended
degree the tongue and jaw, also transmit information. A listener to encode both the articulatory steady states and their result-
presented with both auditory and visual stimuli will integrate ing steady-state sounds. A fundamental assumption in the
the two signals in the perceptual process (e.g., Massaro 1987). endeavor was that all sounds of human language could be
When the information in the visual signal is unambiguous (as encoded as a sequence of universal articulatory posture com-
when the lips are the main articulators), the visual signal may plexes whose subcomponents were shared by related sounds.
even dominate the acoustic one (e.g., McGurk and Macdonald For example, all labial consonants (p, b, m, f, v, w, etc.) shared
1976). Moreover, writing systems convey linguistic informa- a letter shape and orientation, while all voiced sounds (b, d, g,
tion, albeit in a low-dimensional fashion. Most strikingly, sign v, z, etc.) shared an additional mark to distinguish them from
languages are fully as powerful as speech-based communica- their voiceless counterparts (p, t, k, f, s, etc.). Bells formaliza-
tion systems and are restricted to the visual domain. Despite tion of a set of universal and invariant articulatory constituents,
the diferences between signed and spoken languages in terms aligned as an alphabetic string, inluenced other universal
of the articulators and their perceptual modalities, they draw transcription systems such as Henry Sweets (1881) Romic
on the same sorts of linguistic constituents, at least so far as alphabet, which laid the foundation for the development of the
the higher-level units are concerned: syllable, morpheme, International Phonetic Alphabet (Passy 1888). It also foreshad-
word, sentence, and prosodic phrase (e.g., Brentari 1998). Some owed the use of articulatory features, such as those proposed
linguists have also proposed the decomposition of signed lan- by Noam Chomsky and Morris Halle (1968) in modern phonol-
guages into smaller units using manual analogs of phonologi- ogy, in that each speech sound, and therefore each symbol, was
cal features, despite the obvious diferences in the articulators made up of a set of universal articulatory components.
and the transmission media (for a review see Emmory 2002). A second way in which Bells work presaged modern research
he parallel of signed and spoken language structure despite was the connection between perception and production.
the diferences in transmission modalities is often interpreted Implicit in his system of writing was the belief that perception
as evidence for abstract phonological units at the level of the of speech sounds was the process of extracting the articulations
mental lexicon (Meier, Cormier, and Quinto-Pozos 2002). that produced them. Later perceptual models would incorpo-
rate this relationship in one way or another (Chistovich 1960;
THE HISTORY OF THE DEBATE: EARLY PHONOLOGICAL UNITS. he Dudley 1940; Fowler 1986, 1996; Joos 1948; Ladefoged and
current debate about how to characterize speech sounds has McKinney 1963; Liberman and Mattingly 1985; Stetson 1951).
13
THE HISTORY OF THE DEBATE: EARLY EXPERIMENTAL rediscovered and improved on the earlier observation that the
RESEARCH. Prior to the introduction of experimental meth- speech signal and the articulations that create it are continu-
ods into phonetics, the dominant methodologies were intro- ous, dynamic, and overlapping. Stetson (1951) can be seen as
spection about ones own articulations and careful but responsible for introducing kinematics into research on speech
subjective observations of others speech, and the measure- production. His research introduced the notions of coproduc-
ment units were letter-based symbols. hus, the observer and tion, in which articulatory gestures were initiated simulta-
the observed were inextricably linked while the resolution of neously, and gestural masking, in which the closure of one
the measurement device was coarse. h is view was challenged articulatory gesture hides another, giving rise to the auditory
when a handful of phoneticians and psychologists adopted percept of deletion. Stetsons work provided the foundation for
the scientiic method and took advantage of newly available current language models that incorporate articulatory ges-
instrumentation, such as the kymograph, in the late 1800s. tures and their movements as the fundamental phonological
hey discovered that there were no segmental boundaries in units (e.g., Browman and Goldstein 1990; Byrd and Saltzman
the speech stream and that the pronunciation of a particular 2003; Saltzman and Munhall 1989).
sound varied dramatically from one instance to the next (for In the perceptual and acoustic domains, the identiication
a review of early experimental phonetics, see Khnert and of perceptual cues to consonants and vowels raised a series
Nolan 1999 and Miniie 1999). In the face of the new instru- of questions that remain at the heart of the debate to this day.
mental evidence, some scholars, like Eduard Sievers (1876), P.-J. he coextensive and covarying movements that produce the
Rousselot (1897), and Edward Wheeler Scripture (1902), pro- speech signal result in acoustic information that exhibits a
posed that the speech stream, and the articulations that pro- high degree of overlap and covariance with information about
duced it, were continuous, overlapping, and highly variable, adjacent units (e.g., Delattre, Liberman, and Cooper 1955). Any
rather than being discrete, invariant, and linear. For them, the single perceptual cue to a particular speech sound can also be a
fundamental sound units were the syllable or even the word or cue to another speech sound. For example, the onset of a vowel
morpheme. Rousselots research (18971901) revealed several immediately following a consonant provides the listener with
articulatory patterns that were coni rmed by later work (e.g., cues that identify both the consonant and vowel (Liberman et
Stetson 1951). For example, he observed that when sounds that al. 1954). At the same time, multiple cues may identify a single
are transcribed as sequential are generated by independent speech sound. For example, the duration of a fricative (e.g., s),
articulators (such as the lips and tongue tip), they are initi- the fricatives noise intensity, and the duration of the preceding
ated and produced simultaneously. He also observed that one vowel all give information about whether the fricative is voiced
articulatory gesture may signiicantly precede the syllable it (e.g., z) or voiceless (e.g., s) (Soli 1982). Finally, the cues to
is contrastive in, thereby presenting an early challenge to the one phone may precede or follow cues to adjacent phones. he
notion of sequential ordering in speech. many-to-one, the one-to-many, and the nonlinear relation-
Laboratory researchers like Raymond H. Stetson (1905, ships between acoustic cues and their speech sounds pose a
1951) proposed that spoken language was a series of motor serious problem for perceptual models in which features or
complexes organized around the syllable. Stetson also i rst pro- phones are thought to bear a linear relationship to each other.
posed that perception was the process of perceiving the articu- More recently, researchers studying perceptual learning have
latory movements that generate the speech signal. However, discovered that listeners encode speaker-speciic details and
outside of the experimental phonetics laboratory, most speech even utterance-speciic details when they are learning new
researchers, particularly such phonologists as Leonard speech sounds (Goldinger and Azuma 2003). he latest set of
Bloomield (1933), continued to use phonological units that i ndings poses a problem for models in which linguistic sounds
remained abstract, invariant, sequential, and letter-like. h ree are stored as abstract units.
events that occurred in the late 1940s and early 1950s changed In distinctive feature theory, each phoneme is made up
this view dramatically. he i rst event was the application to of a matrix of binary features that encodes both the distinc-
speech research of modern acoustic tools like the spectrogram tions and the similarities between one class of sounds and
(Potter 1945), sophisticated models of vocal tract acoustics the next in a particular language (Jakobson, Fant, and Halle
(e.g., House and Fairbanks 1953), reliable articulatory instru- 1952; Chomsky and Halle, 1968). he features are thought to be
mentation, such as high-speed X-ray cinelourography (e.g., drawn from a language universal set, and thus allow linguists
Delattre and Freeman 1968), and electromyographic studies of to observe similarities across languages in the patterning of
muscle activation (Draper, Ladefoged, and Whitteridge 1959). sounds. Moreover, segmenting the speech signal into units
he second was the advent of modern perception research in that are hierarchically organized permits a duality of pattern-
which researchers discovered complex relationships between ing of sound and meaning that is thought to give language its
speech perception and the acoustic patterns present in the sig- communicative power. hat is, smaller units such as phonemes
nal (Delattre, Liberman, and Cooper 1955). he third was the may be combined according to language-speciic phonotactic
development of distinctive feature theory in which phonemes (sound combination) constraints into morphemes and words,
were treated as feature matrices that captured the relation- and words may be organized according to grammatical con-
ships between sounds (Jakobson 1939; Jakobson, Fant, and straints into sentences. h is means that with a small set of
Halle 1952). canonical sound units, together with recursion, the talker
When researchers began to apply modern acoustic and may produce and the hearer may decode and parse a virtually
articulatory tools to the study of speech production, they unbounded number of utterances in the language.
14
WORDS Using the lexical entry cat as an example, imagine a connec-

tionist system in which all the semantic features associated
In this section, we focus on those representations of form that
with cat, such as [whiskers], [domestic pet], and so on (which
encode meaning and other abstract linguistic content at the
are also shared with all other conceptual lexical entities bear-
most minimally analyzable units of analysis namely, words
ing those features, such as <lion>, <dog>, etc.), are directly
and morphemes. As such, we give a brief overview of the study
associated with the phonological units that comprise its word
of lexical morphology, investigations in morphological process-
form /k/, /ae/, /t/ (which are likewise shared with all other
ing, and theories about the structure of the mental lexicon.
word forms containing these phonemes) by means of individ-
ual association links that directly tie individual semantic fea-
LEXICAL FORM. What is the nature of a representation at the tures with individual phonological units (Rueckl et al. 1997).
level of lexical form? We limit our discussion here largely to One important consequence of this hypothetical arrangement
phonological codes, but recognize that a great many of the the- is that individual word forms do not exist as free-standing rep-
oretical and processing issues we raise apply to orthographic resentations. Instead, the entire lexical entry is represented as
codes as well. It is virtually impossible for the brain to store a vector of weighted links connecting individual phonemes to
exact representations for all possible physical manifestations of individual lexical semantic and syntactic features. It logically
linguistic tokens that one might encounter or produce. Instead, follows from this model, then, that if all or most of the semantic
representations of lexical form are better thought of as some- features of the word cat, for example, were destroyed or oth-
what abstract structured groupings of phonemes (or graph- erwise made unavailable to the processor, then the set of pho-
emes) that are stored as designated units in long-term memory, nological forms /k/ /ae/ /t/, having nothing to link to, would
either as whole words or as individual morpheme constituents, have no means for mental representation, and would therefore
and associated with any other sources of conceptual or linguis- not be available to the language processor. We will present
tic content encoded in the lexical entries that these form repre- experimental evidence against this model that, instead, favors
sentations map onto. As structured sequences of phonological models in which a full phonological word (e.g., /kaet/) is rep-
segments, then, these hypothesized representational units resented in a localist fashion and is accessible to the language
of lexical form must be able to account for essentially all the processor, even when access to its semantic features is partially
same meaning-to-form mapping problems and demands that or entirely disrupted.
individual phonological segments themselves encounter dur- Several of the most prominent theories of morphology and
ing on-line performance, due to idiosyncratic variation among lexical structure within formal linguistics make explicit claims
speakers and communicative environments. More speciically, about modularity of meaning and form. Ray Jackendof (1997),
representations of morphemes and words at the level of form for example, presents a theory that has a tripartite structure, in
must be abstract enough to accommodate signiicant variation which words have separate identities at three levels of represen-
in the actual physical energy proi les produced by the motor tation form, syntax, and meaning and that these three levels
systems of individual speakers/writers under various environ- are suicient to encode the full array of linguistic information
mental conditions. Likewise, in terms of language production, encoded by each word. His model provides further details in
units of lexical form must be abstract enough to accommo- which it is proposed that our ability to store, retrieve, and use
date random variation in the transient shape and status of the words correctly, as well as our ability to correctly compose
mouth of the language producer. morphemes into complex words, derives from a memorized
inventory of mapping functions that picks out the unique rep-
FORM AND MEANING: INDEPENDENT LEVELS OF LEXICAL REPRE- resentations or feature sets for a word at each level and asso-
SENTATION. he previous description of words and morphemes ciates these elements with one another in a given linguistic
to some degree rests on the assumption that lexical form is rep- structure.
resented independently from other forms of cognitive and lin- While most psycholinguistic models of language process-
guistic information, such as meaning and lexical syntax (e.g., ing have not typically addressed the mapping operations
lexical category, nominal class, gender, verbal subcategory, assumed by Jackendof, they do overlap signiicantly in terms
etc.). Many theories of the lexicon have crucially relied on the of addressing the psychological reality of his hypothetical tri-
assumption of separable levels of representation within the partite structure in the mental lexicon. Although most experi-
lexicon. In some sense, as explained by Allport and Funnell mental treatments of the multilevel nature of the lexicon have
(1981), this assumption follows naturally from the arbitrari- been developed within models of language production, as will
ness of mapping between meaning and form, and would thus be seen, there is an equally compelling body of evidence for
appear to be a relatively noncontroversial assumption. multilevel processing from studies of language comprehension
he skeptical scientist, however, is not inclined to simply as well.
accept assumptions of this sort at face value without consid- he most inluential lexical processing models over the
ering alternative possibilities. Imagine, for example, that the last two decades make a distinction between at least two lev-
various types of lexical information stored in a lexical entry els: the lemma level, where meaning and syntax are stored,
are represented within a single data structure of highly inter- and the lexeme level, where phonological and orthographic
connected, independent, distributed features. h is sort of descriptions are represented. hese terms and the functions
arrangement is easy to imagine within the architecture of a associated with them were introduced in the context of a com-
connectionist model (McClelland and Rumelhart 1986). putational production model by Gerard Kempen and Pieter
15
Huijbers (1983) and receive further rei nement with respect to one must recognize that in many other cases, such as those
human psycholinguistic performance in the foundational lexi- involving irregularly inlected words, such as taught, the sys-
cal production models of Merrill F. Garrett (1975) and Willem tem cannot store a stem and ai x at the level of form, as there
Levelt (1989). Much compelling evidence for a basic lemma/ are no clear morpheme boundaries to distinguish these con-
lexeme distinction has come from analyses of naturally occur- stituents, but must instead obligatorily store it as a whole word
ring speech errors generated by neurologically unimpaired at the lexeme level.
subjects, including tip-of-the-tongue phenomena (Meyer and Many prominent theories have favored the latter, non-
Bock 1992), as well as from systematic analyses of performance decompositional hypothesis for all words, including irregu-
errors observed in patients with acquired brain lesions. A more lar words like taught, as well as regular compositional words
common experimental approach, however, is the pictureword like wanted (Bybee 1988). Other inluential processing models
interference naming paradigm, in which it has been shown propose that complex words are represented as whole-word
that lemma- and lexeme-level information can be selectively units at the lexeme level, but that paradigms of inlectionally
disrupted during the course of speech production (Schriefers, related words (want, wants, wanted ) map onto a common rep-
Meyer, and Levelt 1990). resentation at the lemma level (Fowler, Napps, and Feldman
In terms of lexical comprehension models, perhaps the 1985). In addition to this, another class of models, which has
most straightforward sources of evidence for a meaning/form received perhaps the strongest empirical support, posits full
distinction have come from analyses of the performance of morphological decomposition at the lexeme level whenever
brain-damaged patients. A particularly compelling case for possible (Allen and Badecker 1999). According to these fully
the independence of meaning and form might be demon- decompositional models, a complex word like wanted is rep-
strated if an individual with acquired language pathology resented and accessed in terms of its decomposed constitu-
were to show an intact ability to access word forms in his/ ents want- and -ed at the level of form, such that the very same
her lexicon, yet remain unable to access meaning from those stem want- is used during the recognition of want, wants, and
form representations. h is is precisely the pattern observed in wanted. According to these models, then, the recognition rou-
patients designated as sufering from word meaning deafness. tines that are exploited by morphological decomposition at the
hese patients show a highly selective pattern of marked dei- level of form resemble those in theoretical approaches to sen-
cit in comprehending word meanings, but with perfect or near- tence processing, in which meaning is derived compositionally
perfect access to word forms. A good example is patient WBN by accessing independent units of representation of form and
as described in Mark D. Allen (2005), who showed an entirely combining the content that these forms access into larger lin-
intact ability to access spoken word-form representations. In guistic units, according to algorithms of composition speciied
an auditory lexical decision task, WBN scored 175/182 (96%) by the grammar.
correct, which shows that he could correctly distinguish real While there is compelling empirical support for decompo-
words from nonwords (e.g., lag vs. l ig), presumably relying sitional models of morphological processing, researchers are
on preserved knowledge of stored lexemes to do so. However, becoming increasingly aware of important factors that might
on tasks that required WBN to access meaning from spoken limit decomposition. hese factors are regularity, formal and
words, such as picture-to-word matching tasks, he performed semantic transparency, and productivity.
with only 40%60% accuracy (at chance in many cases). Regularity refers to the reliability of a particular word-for-
mation process. For example, the plural noun kids expresses
LEXICAL STRUCTURE: COMPLEX WORDS. A particularly impor- noun plurality in a regular, reliable way, while the plural noun
tant issue in lexical representation and processing concerns children does not.
the cognitive structure of complex words, that is, words com- Formal transparency refers to the degree to which the mor-
posed of more than one morpheme. One of the biggest debates pheme constituents of a complex structure are obvious from
surrounding this issue stems from the fact that in virtually all its surface form. For example, morpheme boundaries are fairly
languages with complex word structures, lexical information obvious in the transparently inlected word wanted, com-
is encoded in consistent, rule-like structures, as well as in idio- pared to those of the opaquely (and irregularly) inlected word
syncratic, irregular structures. h is issue can be put more con- taught.
cretely in terms of the role of morphological decomposition in Although an irregular form like taught is formally opaque,
single-word comprehension theories within psycholinguistics. as dei ned here, it is nonetheless semantically transparent,
Consider the written word wanted, for example. A question for because its meaning is a straightforward combination of the
lexical recognition theories is whether the semantic/syntactic semantics of the verb teach and the feature [+Past]. In contrast,
properties of this word [WANT, Verb, +Past, ] are extracted an example of a complex word that is formally transparent yet
and computed in a combinatorial fashion each time wanted semantically opaque is the compound word dumbbell, which
is encountered by accessing the content associated with is composed of two recognizable morphemes, but the content
the stem want- [WANT, Verb] and combining it with the con- associated with these two surface morphemes do not combine
tent extracted from the ai x -ed [+Past] or whether instead, semantically to form the meaning of the whole word.
a single whole-word form wanted is stored at the lexeme level Productivity describes the extent to which a word-formation
and associated directly with all of its semantic/syntactic con- process can be used to form new words freely. For example, the
tent. To understand the plausibility that a lexical system could sui x -ness is easily used to derive novel nouns from adjectives
in principle store whole-word representations such as wanted, (e.g., nerdiness, awesomeness, catchiness), while the ability to
16
form novel nouns using the analogous sui x -ity is awkward at claimed to be derived directly, without an intervening level of
best (?nerdity) if not impossible. syntactic structure.
Another phenomenon associated with these lexical proper- he initial evidence of separable syntactic and semantic pro-
ties is that they tend to cluster together in classes of morpholog- cessing streams came from studies of brain-damaged patients
ically complex word types across a given language, such that sufering from aphasia , in particular the syndromes known as
there will often exist a set of highly familiar, frequently used Brocas and Wernickes aphasia. Brocas aphasics typically pro-
forms that are irregular, formally opaque, and nonproductive, duce slow, labored speech; their speech is generally coherent
and also a large body of forms that are morphologically regular, in meaning but very disordered in terms of sentence structure.
formally transparent, and productive. Given the large variety of Many syntactically important words are omitted (e.g., the, is),
complex word types found in human languages with respect to as are the inlectional morphemes involved in morphosyntax
these dimensions of combinability, as well as the idiosyncratic (e.g., -ing, -ed, -s). Wernickes aphasics, by contrast, typically
nature of the tendency for these dimensions to cluster together produce luent, grammatical sentences that tend to be inco-
from language to language, it would appear that empirical evi- herent. Initially, these disorders were assumed to relect dei-
dence for morphological decomposition must be established cits in sensorimotor function; Brocas aphasia was claimed to
on a case-by-case basis for each word-formation type within result from a motoric deicit, whereas Wernickes aphasia was
each language. h is indeed appears to be the direction that claimed to relect a sensory deicit. he standard assumptions
most researchers have taken. about aphasia changed in the 1970s, when theorists began to
stress the ungrammatical aspects of Brocas aphasics speech;
the term agrammatism became synonymous with Brocas
SENTENCES
aphasia. Particularly important in motivating this shift was evi-
On the surface, a sentence is a linear sequence of words. But in dence that some Brocas aphasics have a language-comprehen-
order to extract the intended meaning, the listener or reader sion problem that mirrors their speech-production problems.
must combine the words in just the right way. hat much is Speciically, some Brocas aphasics have trouble understanding
obvious. What is not obvious is how we do that in real time, syntactically complex sentences (e.g., John was inally kissed by
as we read or listen to a sentence. Of particular relevance to Louise) in which the intended meaning is crucially dependent
this essay are the following questions: Is there a representa- on syntactic cues in this case, the grammatical words was
tional level of syntactic form that is distinct from the meaning and by (Caramazza and Zurif 1976). h is evidence seemed to
of a sentence? And if so, exactly how do we extract the implicit rule out a purely motor explanation for the disorder; instead,
structure in a spoken or written sentence as we process it? One Brocas aphasia was viewed as fundamentally a problem con-
can ask similar questions about the process of sentence pro- structing syntactic representations, both for production and
duction: When planning a sentence, is there a planning stage comprehension. By contrast, Wernickes aphasia was assumed
that encodes a speciically syntactic form? And if so, how do to relect a problem in accessing the meanings of words.
these representations relate to the sound and meaning of the hese claims about the nature of the aphasic disorders are
intended utterance? still quite inluential. Closer consideration, however, raises
For purely practical reasons, there is far more research on many questions. Pure functional deicits afecting a single lin-
extracting the syntactic form during sentence comprehen- guistically dei ned function are rare; most patients have a mix-
sion (a process known as parsing ; see parsing , human) than ture of problems, some of which seem linguistic but others of
on planning the syntactic form of to-be-spoken sentences. which seem to involve motor or sensory processing (Alexander
Nonetheless, research in both areas has led to substantive 2006). Many of the Brocas patients who produce agrammatic
advances in our understanding of the psychology of sentence speech are relatively good at making explicit grammaticality
form. judgments (Linebarger, Schwartz, and Saf ran 1983), suggest-
ing that their knowledge of syntax is largely intact. Similarly, it
SYNTAX AND SEMANTICS. A fundamental claim of a genera- is not uncommon for Brocas aphasics to speak agrammatically
tive grammar is that syntax and semantics are clearly dis- but to have relatively normal comprehension, bringing into
tinct. A fundamental claim of a cognitive grammar is that question the claim that Brocas aphasia relects damage to an
syntax and semantics are so entwined that they cannot be easily abstract syntax area used in production and comprehension
separated. h is debate among linguists is mirrored by a similar (Miceli et al. 1983). Taken together, then, the available evidence
debate among researchers who study language processing. A from the aphasia literature does not provide compelling evi-
standard assumption underlying much psycholinguistic work dence for distinct syntactic and semantic processing streams.
is that a relatively direct mapping exists between the levels of Another source of evidence comes from neuroimaging
knowledge posited within generative linguistic theories and studies of neurologically normal subjects. One useful method
the cognitive and neural processes underlying comprehension involves the recording of event-related brain potentials (ERPs)
(Bock and Kroch 1989). Distinct language-speciic processes from a persons scalp as he or she reads or listens to sentences.
are thought to interpret a sentence at each level of analysis, ERPs relect the summed, simultaneously occurring postsyn-
and distinct representations are thought to result from these aptic activity in groups of cortical pyramidal neurons. A par-
computations. But other theorists, most notably those working ticularly fruitful approach has involved the presentation of
in the connectionist framework, deny that this mapping exists sentences containing linguistic anomalies. If syntactic and
(Elman et al. 1996). Instead, the meaning of the sentence is semantic aspects of sentence comprehension are segregated
17
into distinct streams of processing, then syntactic and seman- (mostly involving the measurement of subjects eye movements
tic anomalies might afect the comprehension system in dis- as they read sentences) indicated that readers tended to read
tinct ways. A large body of evidence suggests that syntactic straight through such syntactically simple sentences as he cat
and semantic anomalies do in fact elicit qualitatively distinct scratched the ratty old sofa but experienced longer eye i xations
ERP efects, and that these efects are characterized by distinct and more eye regressions when they encountered by the rac-
and consistent temporal properties. Semantic anomalies (e.g., coon in the more complex sentences. When confronted with
he cat will bake the food ) elicit a negative wave that peaks syntactic uncertainty, readers seemed to immediately choose
at about 400 milliseconds after the anomalous word appears the simplest syntactic representation available (Frazier 1987).
(the N400 efect) (Kutas and Hillyard 1980). By contrast, syntac- When this analysis turned out to be an erroneous choice (that
tic anomalies (e.g., he cat will eating the food ) elicit a large is, when the disambiguating material in the sentence required
positive wave that onsets at about 500 milliseconds after pre- a more complex structure), longer eye i xations and more
sentation of the anomalous word and persists for at least half a regressions occurred as the readers attempted to reanalyze
second (the P600 efect [Osterhout and Holcomb 1992]). In some the sentence.
studies, syntactic anomalies have also elicited a negativity over A stronger test of the garden-path model, however, requires
anterior regions of the scalp, with onsets ranging from 100 to the examination of situations in which the semantic cues in
300 milliseconds. hese results generalize well across types the sentence are clearly consistent with a syntactically com-
of anomaly, languages, and various methodological factors. plex parsing alternative. A truly modular, syntax-driven
he robustness of the efects seems to indicate that the human parser would be unafected by the semantic cues in the sen-
brain does in fact honor the distinction between the form and tence. Consider, for example, the sentence fragment he
the meaning of a sentence. sofa scratched. Sofas are soft and inanimate and therefore
unlikely to scratch anything. Consequently, the semantic cues
SENTENCE COMPREHENSION. Assuming that sentence pro- in the fragment favor the more complex relative clause anal-
cessing involves distinct syntactic and semantic processing ysis, in which the sofa is the entity being scratched (as in he
streams, the question arises as to how these streams interact sofa scratched by the cat was given to Goodwill ). Initial results
during comprehension. A great deal of evidence indicates that seemed to suggest that the semantic cues had no efect on the
sentence processing is incremental, that is, that each succes- initial parse of the sentence; readers seemed to build the syn-
sive word in a sentence is integrated into the preceding sen- tactically simplest analysis possible, even when it was incon-
tence material almost immediately. Such a strategy, however, sistent with the available semantic information. Such evidence
introduces a tremendous amount of ambiguity that is, led to the hypothesis that the language processor is comprised
uncertainty about the intended syntactic and semantic role of a of a number of autonomously functioning components, each of
particular word or phrase. Consider, for example, the sentence which corresponds to a level of linguistic analysis (Ferreira and
fragment he cat scratched. here are actually two ways to Clifton 1986). he syntactic component was presumed to func-
parse this fragment. One could parse it as a simple active sen- tion independently of the other components.
tence, in which the cat is playing the syntactic role of subject he modular syntax-i rst model has been increasingly chal-
of the verb scratched and the semantic role of the entity doing lenged, most notably by advocates of constraint-satisfaction
the scratching (as in he cat scratched the ratty old sofa). Or one models (Trueswell and Tanenhaus 1994). hese models pro-
could parse it as a more complex relative clause structure, in pose that all sources of relevant information (including statisti-
which the verb scratched is the start of a second, embedded cal, semantic, and real-world information) simultaneously and
clause, and the cat is the entity being scratched, rather than rapidly inluence the actions of the parser. Hence, the implau-
the one doing the scratching (as in he cat scratched by the rac- sibility of a sofa scratching something is predicted to cause
coon was taken to the pet hospital ). he ambiguity is resolved the parser to initially attempt the syntactically more complex
once the disambiguating information (the ratty sofa or by the relative-clause analysis. Consistent with this claim, numer-
raccoon) is encountered downstream, but that provides little ous studies have subsequently demonstrated compelling
help for a parser that assigns roles to words as soon as they are inluences of semantics and world knowledge on the parsers
encountered. response to syntactic ambiguity (ibid.).
How does an incremental sentence-processing system here is, however, a fundamental assumption underlying
handle such ambiguities? An early answer to this question most of the syntactic ambiguity research (regardless of theo-
was provided by the garden-path (or modular) parsing models retical perspective): that syntax always controls combinatory
developed in the1980s. he primary claim was that the initial processing when the syntactic cues are unambiguous. Recently,
parse of the sentence is controlled entirely by the syntactic cues this assumption has also been challenged. he challenge cen-
in the sentence (Ferreira and Clifton 1986). As words arrive in ters on the nature of thematic roles, which help to dei ne
the linguistic input, they are rapidly organized into a structural the types of arguments licensed by a particular verb (McRae,
analysis by a process that is not inluenced by semantic knowl- Ferretti, and Amyote 1997; Trueswell and Tanenhaus 1994).
edge. he output of this syntactic process then guides semantic Exactly what is meant by thematic role varies widely, especially
interpretation. h is model can be contrasted with interactive with respect to the amount of semantic and conceptual content
models, in which a wide variety of information (e.g., seman- it is assumed to hold (McRae, Ferretti, and Amyote 1997). For
tics and conceptual/world knowledge) inluences the earliest most syntax-i rst proponents, a thematic role is limited to a few
stages of sentence parsing. Initial results of numerous studies syntactically relevant selectional restrictions, such as animacy
18
(Chomsky 1965); thematic roles are treated as (largely mean- by changing the inlectional morpheme at the end of the verb
ingless) slots to be i lled by syntactically appropriate i llers. A to a passive form (he mysterious crime was solved ).
second view is that there is a limited number of thematic roles herefore, if meaning drives sentence processing in this situa-
(agent, theme, benefactor, and so on), and that a verb selects a tion, then the verb solving would be perceived to be in the wrong
subset of these (Fillmore 1968). Although this approach attri- syntactic form (-ing instead of ed ), and should therefore elicit a
butes a richer semantics to thematic roles, the required gener- P600 efect. Kim and Osterhout observed that verbs like solving
alizations across large classes of verbs obscure many subtleties elicited a P600 efect, showing that a strong semantic attrac-
in the meaning and usage of these verbs. tion between a predicate and an argument can determine
Both of these conceptions of thematic roles exclude knowl- how words are combined, even when the semantic attraction
edge that people possess concerning who tends to do what to contradicts unambiguous syntactic cues. Conversely, in anom-
whom in particular situations. Ken McRae and others have pro- alous sentences with an identical structure but with no seman-
posed a third view of thematic roles that dramatically expands tic attraction between the subject noun and the verb (e.g., he
their semantic scope: hematic roles are claimed to be rich, envelope was devouring ), the critical verb elicited an N400
verb-speciic concepts that relect a persons collective expe- efect rather than a P600 efect. hese results demonstrate that
rience with particular actions and objects (McRae, Ferretti, semantics, rather than syntax, can drive word combinations
and Amyote 1997). hese rich representations are claimed to during sentence comprehension.
be stored as a set of features that dei ne gradients of typical-
ity (situation schemas), and to comprise a large part of each SENTENCE PRODUCTION. Generating a sentence requires the
verbs meaning. One implication is that this rich knowledge rapid construction of novel combinations of linguistic units,
will become immediately available once a verbs meaning has involves multiple levels of analysis, and is constrained by a vari-
been retrieved from memory. As a consequence, the plausibil- ety of rules (about word order, the formation of complex words,
ity of a particular word combination need not be evaluated by word pronunciation, etc). Errors are a natural consequence of
means of a potentially complex inferential process, but rather these complexities (Dell 1995). Because they tend to be highly
can be evaluated immediately in the context of the verbs mean- systematic, speech errors have provided much of the data upon
ing. One might therefore predict that semantic and conceptual which current models of sentence production are based. For
knowledge of events will have profound and immediate efects example, word exchanges tend to obey a syntactic category rule,
on the way in which words are combined during sentence pro- in that the exchanged words are from the same syntactic category
cessing. McRae and others have provided evidence consistent (for example, two nouns have been exchanged in the utterance
with these claims, including semantic inluences on syntactic Stop hitting your brick against a head wall). he systematicity of
ambiguity resolution. speech errors suggests that regularities described in theories of
he most compelling argument against the absolute pri- linguistic form also play a role in the speech-planning process.
macy of syntax, however, would be evidence that semantic and he dominant model of sentence production is based
conceptual knowledge can take control of sentence process- on speech error data (Dell 1995; Garrett 1975; Levelt 1989).
ing even when opposed by contradicting and unambiguous According to this model, the process of preparing to speak a
syntactic cues. Recent work by Ferranda Ferreira (2003) sug- sentence involves three stages of planning: conceptualization,
gests that this might happen on some occasions. She reported formulation, and articulation, in that order. During the concep-
that when plausible sentences (e.g., he mouse ate the cheese) tualization stage, the speaker decides what thought to express
were passivized to form implausible sentences (e.g., he mouse and how to order the relevant concepts sequentially. he for-
was eaten by the cheese), participants tended to name the wrong mulation stage begins with the selection of a syntactic frame to
entity as do-er or acted-on, as if coercing the sentences to encode the thought; the frame contains slots that act as place
be plausible. However, the processing implications of these holders for concepts and, eventually, speciic words. he phono-
results are uncertain, due to the use of postsentence rumina- logical string is translated into a string of phonological features,
tive responses, which do not indicate whether semantic inlu- which then drive the motor plan manifested in articulation.
ences relect the listeners initial responses to the input or some h is model, therefore, posits the existence of representa-
later aspect of processing. tions of syntactic structure that are distinct from the represen-
Researchers have also begun to explore the inluence of tations of meaning and sound. Other evidence in support of this
semantic and conceptual knowledge on the on-line process- view comes from the phenomenon of syntactic priming : Having
ing of syntactically unambiguous sentences. An illustrative heard or produced a particular syntactic structure, a person
example is a recent ERP study by Albert Kim and Lee Osterhout is more likely to produce sentences using the same syntactic
(2005). he stimuli in this study were anomalous sentences that structure (Bock 1986). Syntactic priming occurs independently
began with a active structure, for example, he mysterious crime of sentence meaning, suggesting that the syntactic frames are
was solving. he syntactic cues in the sentence require that independent forms of representation that are quite distinct
the noun crime be the Agent of the verb solving. If syntax drives from meaning.
sentence processing, then the verb solving would be perceived
to be semantically anomalous, as crime is a poor Agent for the
CONCLUSIONS
verb solve, and therefore should elicit an N400 efect. However,
although crime is a poor Agent, it is an excellent heme (as in Collectively, the evidence reviewed in this essay indicates that
solved the crime). he heme role can be accommodated simply psychologically relevant representations of linguistic form
19
exist at all levels of language, from sounds to sentences. At each Byrd, Dani, and Elliot Saltzman. 2003. he elastic phrase: Modeling
level, units of linguistic form are combined in systematic ways the dynamics of boundary-adjacent lengthening. Journal of
to form larger units of representation. For the most part, these Phonetics 31: 149 80.
representations seem to be abstract; that is, they are distinct Caplan, David. 1995. Issues arising in contemporary studies of disor-
ders of syntactic processing in sentence comprehension in agram-
from the motor movements, sensory experiences, and episodic
matic patients. Brain and Language 50: 325 38.
memories associated with particular utterances. However, it
Caramazza, Alfonzo, and Edgar Zurif. 1976. Dissociations of algorith-
is also clear that more holistic (that is, nondecompositional) mic and heuristic processes in language comprehension: Evidence
representations of linguistic form, some of which are rooted from aphasia. Brain and Language 3: 57282.
in speciic episodic memories, also play a role in language Chistovich, Ludmilla A 1960. Classiication of rapidly repeated speech
processing. sounds. Akustichneskii Zhurnal 6: 39298.
It also seems to be true that linguistic forms (e.g., the mor- Chomsky, Noam. 1957. Syntactic Structures. he Hague: Mouton.
phological structure of a word or the syntactic structure of a . 1965. Aspects of the heory of Syntax . Cambridge, MA : MIT
sentence) are dissociable from the meanings they convey. Press.
At the same time, semantic and conceptual knowledge can Chomsky, Noam, and Morris Halle. 1968. he Sound Pattern of English.
strongly inluence the processing of linguistic forms, as exem- New York : Harper and Row.
Delattre, Pierre, and Donald Freeman. 1968. A dialect study of
pliied by semantic transparency efects on word decomposi-
American Rs by x-ray motion picture. Linguistic s 44: 29 68.
tion and thematic role efects on sentence parsing.
Delattre, Pierre C., Avin M. Liberman, and Franklin S. Cooper. 1955.
hese conclusions represent substantive progress in our Acoustic loci and transitional cues for consonants. Journal of the
understanding of linguistic form and the role it plays in lan- Acoustical Society of America 27: 769 73.
guage processing. Nonetheless, answers to some of the most Dell, Gary S. 1995. Speaking and misspeaking. In An Invitation to
basic questions remain contentiously debated, such as the pre- Cognitive Science: Language. Cambridge, MA : MIT Press.
cise nature of the rules of combination, the relative roles of Draper, M., P. Ladefoged, and D. Whitteridge. 1959. Respiratory mus-
compositional and holistic representations, and the pervasive- cles in speech . Journal of Speech and Hearing Research 2: 16 27.
ness of interactions between meaning and form. Dudley, Homer. 1940. he carrier nature of speech. Bell System
Technical Journal 14: 495 515.
Elman, Jef rey L. 1990. Representation and structure in connection-
ist models. In Cognitive Models of Speech Processing, ed. G. T. M.
Alexander, M. P. 2006 . Aphasia I: Clinical and anatomical issues. Altmann, 22760. Cambridge, MA : MIT Press.
In Patient-Based Approaches to Cognitive Neuroscience (2d ed.), Elman, Jef rey L., Elizabeth A. Bates, A. Karmilof-Smith, D. Parisi, and
ed. M. J. Farah and T. E. Feinberg , 16582. Cambridge, MA : MIT K. Plunkett. 1996. Rethinking innateness. Cambridge, MA: MIT
Press . Press.
Allen, Mark D. 2005. he preservation of verb subcategory knowledge Emmory, Karen. 2002. Language, Cognition, and the Brain: Insights
in a spoken language comprehension deicit. Brain and Language from Sign Language Research. Mahwah, NJ: Lawrence Erlbaum
95: 255 64. Associates.
Allen, Mark , and William Badecker. 1999. Stem homograph inhibition Ferreira, Fernanda. 2003. he misinterpretation of noncanonical sen-
and stem allomorphy: Representing and processing inlected forms tences. Cognitive Psychology 47: 164 203.
in a multilevel lexical system. Journal of Memory and Language Ferreira, Fernanda, and Charles Clifton, Jr. 1986. he independence of
41: 105 23. syntactic processing. Journal of Memory and Language 25: 348 68.
Allport, D. A., and E. Funnell. 1981. Components of the mental lexi- Fillmore, Charles. 1968. he case for case. In Universals of Linguistic
con. Philosophical Transactions of the Royal Society of London B heory, ed. E. Bach, 180. New York : Holt, Rinehart, & Winston.
295: 397410. Fowler, Carol A. 1986. An event approach to the study of speech per-
Anderson, Stephen R . 1985. Phonology in the Twentieth Century: heories ception from a direct-realist perspective. Journal of Phonetics
of rules and heories of Representations. Chicago: he University of 14: 328.
Chicago Press. . 1995. Speech production. In Speech, Language, and Communi-
Bell, Alexander M. 1867. Visible Speech: he Science of Universal cation, ed. J. L. Miller and P. D. Eimas, 2961. New York : Academic
Alphabetics. London: Simpkin, Marshal. Press.
Bloomield, Leonard. 1933. Language. New York : H. Holt & Co. . 1996. Listeners do hear sounds not tongues. Journal of the
Bock , J. Katherine. 1986. Syntactic persistence in language produc- Acoustical Society of America 99: 1730 41.
tion. Cognitive Psychology 18: 355 87. Fowler, Carol, Susan Napps, and Laurie Feldman. 1985. Relations
Bock, J. K., and Anthony S. Kroch. 1989. he isolability of syntac- among regular and irregular morphologically related words in the
tic processing. In Linguistic Structure in Language Processing, ed.. lexicon as revealed by repetition priming. Memory and Cognition
G. N. Carlson and M. K. Tanenhaus, 15796. Boston: Kluwer 13: 24155.
Academic. Franklin, S., J. Lambon Ralph, J. Morris, and P. Bailey. 1996. A distinc-
Brentari, Dianne. 1998. A Prosody Model of Sign Language Phonology. tive case of word meaning deafness? Cognitive Neuropsychology
Cambridge, MA : MIT Press. 13: 1139 62.
Browman, Catherine P., and Louis Goldstein. 1990. Gestural specii- Frazier, L. 1987. Sentence processing: A tutorial review. In Attention
cation using dynamically-dei ned articulatory structures. Journal and Performance XII: he Psychology of Reading, ed. M. Coltheart,
of Phonetics 18: 299 320. 330. Hillsdale, NJ: Lawrence Erlbaum Associates.
Bybee, Joan . 1988. Morphology as lexical organization. In h eoretical . 1995. Representation in psycholinguistics. In Speech,
Morphology: Approaches in Modern Linguistics, ed. M. Hammond Language, and Communication, ed. J. L. Miller and P. D. Eimas, 127.
and M. Noonan, 11941. San Diego, CA : Academic Press. New York : Academic Press.
20
Garrett , Merrill F. 1975. he analysis of sentence production. In h e McClelland, Jamesv, and David Rumelhart. 1986. Parallel Distributed
Psychology of Learning and Motivation , ed. G. Bower, 13377. New Processing: Explorations in the Microstructure of Cognition . Vol. 1.
York : Academic Press . Cambridge, MA : MIT Press.
Goldinger, Stephen D., and Tamiko Azuma . 2003. Puzzle-solving McGurk , Harry, and John Macdonald. 1976. Hearing lips and seeing
science: he quixotic quest for units of speech perception . Journal voices. Nature 264: 746 8.
of Phonetics 31: 305 20. McRae, Ken, Todd R. Ferretti, and Liane Amyote. 1997. hematic roles
Goldinger, Stephen D., David B. Pisoni, and John S. Logan . 1991. On as verb-speciic concepts. Language and Cognitive Processes 12.2:
the nature of talker variability efects on recall of spoken word 13776.
lists . Journal of Experimental Psychology: Learning, Memory and Meier, Richard P., Kearsy Cormier, and David Quinto-Pozos. 2002.
Cognition 17: 152 62. Modularity in Signed and Spoken Languages. Cambridge: Cambridge
Hall, D. A., and M. J. Riddoch . 1997. Word meaning deafness: Spelling University Press.
words that are not understood . Cognitive Neuropsychology Meyer, Antje, and Kathryn Bock . 1992. Tip-of-the-tongue phe-
14: 113164. nomenon: Blocking or partial activation? Memory and Cognition
Hillis , Argye E. 2000. he organization of the lexical system. In 20: 715 26.
What Dei cits Reveal about the Human Mind/Brain: Handbook Miceli, G., L. Mazzuchi, L. Menn, and H. Goodglass. 1983. Contrasting
of Cognitive Neuropsychology, ed. B. Rapp, 185210. New cases of Italian agrammatic aphasia without comprehension disor-
York : Psychology Press . der. Brain and Language 19: 65 97.
House, Arthur S., and Grant Fairbanks. 1953. he inluence of con- Miller, George A., and Patricia E. Nicely. 1955. An analysis of percep-
sonant enviroment upon the secondary acoustical characteristics tual confusions among some English consonants. Journal of the
of vowels . Journal of the Acoustical Society of America 25: 105 13. Acoustical Society of America 27: 329 35.
Jackendof, Ray. 1997. h e Architecture of the Language Faculty. Miniie, Fred D. 1999. he history of physiological phonetics in the
Cambridge, MA : MIT Press . United States. In A Guide to the History of the Phonetic Sciences in
Jakobson, Roman . 1939. Observations sur le classment pho- the United States: Issued on the Occasion of the 14th International
nologique des consonnes . Proceedings of the 3rd International Congress of Phonetic Sciences, San Francisco, 17 August 1999, ed.
Conference of Phonetic Sciences, 34 41. Ghent. J. Ohala, A. Bronstein, M. Bus, L. Grazio, J. Lewis, and W. Weigel.
Jakobson, Roman, Gunnar Fant , and Morris Halle. 1952 . Preliminaries Berkeley : University of California Press..
to Speech Analysis . Cambridge, MA : MIT Press . Moore, Brian C. J. 1989. An Introduction to the Psychology of Hearing. 3d
Joos , Martin . 1948 . Acoustic Phonetics . Language Monograph 23, ed. London: Academic Press.
Supplement to Language 24: 136. Nygaard, Lynn C., and David B. Pisoni. 1995. Speech
Kempen, Gerard, and Pieter Huijbers . 1983. he lexicalization Perception: New directions in research and theory. In Speech,
process in sentence production and naming: Indirect election of Language, and Communication, ed. J. Miller and P. Eimas, 6396.
words . Cognition 14: 185 209. New York : Academic Press.
Kim , Albert, and Lee Osterhout . 2005. he independence of combi- Osterhout, Lee, and Philip J. Holcomb. 1992. Event-related brain
natory semantic processing: Evidence from event-related poten- potentials elicited by syntactic anomaly. Journal of Memory and
tials . Journal of Memory and Language 52: 205 25. Language 31: 785 806.
Kiparsky, Paul. 1979. Panini as a Variationist . Cambridge, MA : MIT Passy, Paul. 1888. Our revised alphabet. he Phonetic Teacher, 5760.
Press . Pinker, Steven. 1999. Words and Rules: he Ingredients of Language.
Khnert , Barbara, and Francis Nolan . 1999. he origin of coartic- New York : Basic Books.
ulation. In Coarticulation: h eory, Data and Techniques , ed. B. Potter, Ralph K. 1945. Visible patterns of sound. Science 102: 46370.
Rapp, 730. Cambridge: Cambridge University Press . Rousselot, P.-J. 18971901. Principes de Phontique Experimentale.
Kutas , Marta, and Steven A. Hillyard . 1980. Reading senseless Paris: H. Welter.
sentences: Brain potentials relect semantic anomaly. Science Rueckl, Jay, Michelle Mikolinski, Michal Raveh, Caroline Miner, and F.
207: 203 5. Mars. 1997. Morphological priming, fragment completion, and con-
Ladefoged, P., and N. McKinney. 1963. Loudness, sound pressure, nectionist networks. Journal of Memory and Language 36: 382405.
and subglottal pressure in speech . Journal of the Acoustical Saltzman, Elliot L., and Kevin G. Munhall. 1989. A dynamical
Society of America 35: 454 60. approach to gestural patterning in speech production. Ecological
Lambon, Ralph M., K. Sage, and A. Ellis . 1996 . Word meaning blind- Psychology 1: 33382.
ness: A new form of acquired dyslexia . Cognitive Neuropsychology Saussure, Ferdinand de. [2002] 2006. crits de linguistique gnrale, ed.
13: 61739. Simon Bouquet and Rudolf Engler. Paris: Gallimard. English trans-
Levelt , Willem . 1989. Speaking: From intention to articulation . lation: Writings in General Linguistics, Oxford: Oxford University
Cambridge, MA : MIT Press . Press.
Liberman, Alvin M., Pierre C. Delattre, Franklin S. Cooper, and Scharenborg , O., D. Norris, L. ten Bosch, and J. M. McQueen. 2005.
Lou J. Gerstman . 1954 . he role of consonant vowel transitions How should a speech recognizer work? Cognitive Science
in the perception of the stop and nasal consonants . Journal of 29: 867918.
Experimental Psychology 52: 12737. Schriefers, Herbert, Antje Meyer and Willem Levelt. 1990. Exploring
Liberman, Alvin M., and Ignatius G. Mattingly. 1985. h e motor the time course of lexical access in language production: Picture-
theory of speech perception revised . Cognition 21: 136. word interference studies. Journal of Memory and Language
Linebarger, Marcia, Myrna Schwartz , and Eleanor Saf ran . 1983. 29: 86 102.
Sensitivity to grammatical structure in so-called agrammatic Scripture, Edward Wheeler. 1902. he Elements of Experimental
aphasics . Cognition 13: 36193. Phonetics. New York : Charles Scribners Sons.
Massaro, Dominic W. 1987. Speech Perception by Ear and Eye: A Sievers, Eduard. 1876. Grundzuge der Lautphysiologie zur Einfuhrung
Paradigm for Psychological Inquiry. Hillsdale, NJ: Lawrence in das Studium der Lautlehere der Indogermanischen Sprachen.
Erlbaum Associates . Leipzig: Breitkopf and Hartel.
21
Soli, Sig D. 1982. Structure and duration of vowels together specify frica- Sweet, Henry. 1881. Sound notation. Transactions of the Philological
tive voicing. Journal of the Acoustical Society of America 72: 366 78. Society: 177235.
Stetson, Raymond H. 1905. A motor theory of rhythm and discrete Trueswell, John C., and Michael K. Tanenhaus. 1994. Toward a lexi-
succession II. Psychological Review 12: 293350. calist framework of constraint-based syntactic ambiguity reso-
. 1951. Motor Phonetics: A Study of Speech Movement in Action. 2d lution. In Perspectives on Sentence Processing , ed. C. Clifton, L.
ed. Amsterdam: North-Holland. Frazier, and K. Rayner, 15580. Hillsdale, NJ: Lawrence Erlbaum
Studdert-Kennedy, Michael. 1980. Speech perception. Language and Associates.
Speech 23: 45 65.
22
(2) a. he girl laughed and sang.
b. he girl laughed.
h is is an example of structural entailment, because it is the

3 structure itself that allows the inference (i.e., if someone does
both A and B, then someone does A). h is particular rule is
essentially the classical inference rule of conjunction elimina-
tion from propositional logic; that is,
THE STRUCTURE OF MEANING AB
(3)
A
James Pustejovsky
While this relies on a largely syntactic notion of entailment,
semantics should also explain how (4b) is a legitimate infer-
ence from (4a).
(4) a. he drought killed the crops.

b. he crops died.
Such lexical entailments involve an inference that is tied directly

to the meaning of a word, namely, the verb kill; that is, when
1 INTRODUCTION
something is killed, it dies. Hence, the role of lexical informa-
Semantics is the systematic study of meaning in language. As a tion in the construction of logical forms and the inferences we
discipline, it is directed toward the determination of how humans can compute from our utterances is an important area of lin-
reason with language, and more speciically, discovering the pat- guistics, and one we return to in Section 3.5 below.
terns of inference we employ through linguistic expressions.he here is an important distinction in semantics among prop-
study of semantics has diverse traditions, and the current litera- ositions, sentences, and utterances. We can think of an
ture is quite heterogeneous and divided on approaches to some utterance as a speech-act, situated in time and space, that is,
of the basic issues facing the ield (cf. semantics). While most which happens at a particular time and location. A sentence, on
things in the world have meaning to us, they do not carry mean- the other hand, is a expression that is inherently linguistic, and
ing in the same way as linguistic expressions do. For example, can be expressed on multiple occasions by multiple utterances.
they do not have the properties of being true or false, or ambigu- he notion of a proposition is more complex and contentious, but
ous or contradictory. (See Davis and Gillon [2004] for discussion it is that object that is traditionally taken as being true or false,
and development of this argument.) For this and other reasons, expressed by the sentence when uttered in a speciic context.
this overview essay addresses the question of how linguistic
expressions carry meaning and what they denote in the world.
1.1 Historical Remarks
Where syntax determines the constituent structure of a sen-
tence along with the assignment of grammatical and thematic he study of meaning has occupied philosophers for centuries,
relations, it is the role of semantics to compute the deeper beginning at least with Platos theory of forms and Aristotles
meaning of the resulting expression. For example, the two sen- theory of meaning. Locke, Hume, and Reid all pay particular
tences in (1) difer in their syntactic structures (through their attention to the meanings of words in composition, but not until
voice), but they mean essentially the same thing; that is, their the late nineteenth century do we see a systematic approach to
propositional content is identical. the study of logical syntax emerge, with the work of Bertrand
Russell and Gottlob Frege. Russell and Frege were not interested
(1) a. he child ate a cookie.
in language as a linguistic phenomenon, but rather as a medium
b. A cookie was eaten by the child.
through which judgments can be formed and expressed. Freges
Early on, such observations led philosophers and linguists to focus lay in formulating the rules that create meaningful expres-
distinguish meaning from the pure structural form of a sen- sions in a compositional manner, while also introducing an
tence (Saussure [1916] 1983; Russell 1905). Semantic theories in important distinction between an expressions sense and its ref-
linguistics assume that some sort of logical form is computed erence (cf. sense and reference, reference and exten-
from the constituent structure associated with a sentence, and sion). Russells work on the way in which linguistic expressions
it is this meaning representation that allows us to make cate- denote introduced the problem of definite descriptions
gorical and truth-conditional judgments, such as the equiva- and referential failure, and what later came to be recognized as
lence in meaning of the two sentences in (1). the problem of presupposition (cf. pragmatics).
Another role played by semantics is in the computation of Ferdinand de Saussure ([1916] 1983), working within an
inferences from our utterances, such as entailments, implica- emerging structuralist tradition, developed relational tech-
tures, and presuppositions. For example, consider the var- niques for linguistic analysis, which were elaborated into a
ious notions of entailment. From the logical form (LF) of the framework of componential analysis for language meaning.
sentence in (2a), semantics enables us to infer (2b) as a legiti- he idea behind componential analysis is the reduction of a
mate inference. words meaning into its ultimate contrastive elements. hese
23
contrastive elements are structured in a matrix, allowing for is the notion of compositionality (cf. compositionality). As
dimensional analysis and generalizations to be made about speakers of language, we understand a sentence by under-
lexical sets occupying the cells in the matrix. standing its parts and how they are put together. he principle
h is technique developed into a general framework for lin- of compositionality characterizes how smaller units of mean-
guistic description called distinctive FEATURE ANALYSIS. h is is ing are put together to form larger, more meaningful expres-
essentially the inspiration for J. Katz and J. Fodors 1963 the- sions in language. he most famous formulation of this notion
ory of lexical semantics within transformational grammar. On comes from Frege, paraphrased as follows:
this theory, usually referred to as markerese, a lexical entry in
he meaning of an expression is a function of the meanings of
the language consists of grammatical and semantic markers,
its parts and the way they are syntactically combined. (Partee
and a special feature called a semantic distinguisher. In the
1984)
subsequent discussion by U. Weinreich (1972) and many oth-
ers, this model was demonstrated to be far too impoverished to h is view has been extremely inluential in semantics
characterize the compositional mechanisms inherent in lan- research over the past 40 years. If one assumes a compositional
guage. In the late 1960s and early 1970s, alternative models of approach to the study of meaning, then two things immediately
word meaning emerged (Fillmore 1968 [frame semantics]; follow: 1) One must specify the speciic meaning of the basic
Lakof [1965] 1970 [generative semantics]; Gruber 1976; elements of the language, and 2) one must formulate the rules
Jackendof 1972), which respected the relational structure of of combination for how these elements go together to make
sentence meaning while encoding the named semantic func- more complex expressions. he i rst aspect includes determin-
tions in lexical entries. In D. R. Dowty (1979), a model theoretic ing what words and morphemes mean, that is, lexical seman-
interpretation of the decompositional techniques of G. Lakof, tics, which we address in the next section. he second aspect
J. D. McCawley, and J. R. Ross was developed. entails dei ning a calculus for how these elements compose to
In the later twentieth century, montague grammar form larger expressions, that is, argument selection and modi-
(Montague 1973, 1974) was perhaps the most signiicant develop- ication. Needless to say, in both of these areas, there is much
ment in the formal analysis of linguistic semantics, as it brought divergence of opinion, but semanticists generally agree on the
together a systematic, logically grounded theory of composition- basic assumptions inherent in compositionality.
ality, with a model theoretic interpretation. Subsequent work
enriched this approach with insights from D. Davidson (1967),
2 LEXICAL MEANING
H. P. Grice (1969), Saul Kripke ([1972] 1980), David Lewis (1976),
and other philosophers of language (cf. Partee 1976; Davidson Semantic interpretation requires access to knowledge about
and Harman 1972). words. he lexicon of a grammar must provide a systematic and
Recently, the role of lexical-syntactic mapping has become eicient way of encoding the information associated with words
more evident, particularly with the growing concern over projec- in a language. lexical semantics is the study of what words
tion from lexical semantic form, the problem of verbal alternations mean and how these meanings are structured. he lexicon is
and polyvalency, and the phenomenon of polysemy. he work of not merely a collection of words with their semantic forms, but
R. Jackendof (1983, 1997) on conceptual semantics has come to rather a set of structured objects that participate in larger opera-
the fore, as the ield of lexical semantics has developed into a more tions and compositions, both enabling syntactic environments
systematic and formal area of study (Pustejovsky and Boguraev and acting as signatures to semantic entailments and implica-
1993; Copestake and Briscoe 1995, 1567). tures in the context of larger discourse.
Finally, one of the most signiicant developments in the here are four basic questions in modeling the semantic
study of meaning has been the dynamic turn in how sentences content and structure of the lexicon: 1) What semantic infor-
are interpreted in discourse. Inspired by the work of Irene Heim mation goes into a lexical entry? 2) How do lexical entries
(1982) and H. Kamp (1981), the formal analysis of discourse has relate semantically to one another? 3) How is this informa-
become an active and growing area of research, as seen in the tion exploited compositionally by the grammar? 4) How is this
works of Jeroen Groenendijk and Martin Stokhof (1991), Kamp and information available to semantic interpretation generally?
U. eyle (1993), and Nicholas Asher and Alex Lascarides (2003). he lexicon and lexical semantics have traditionally been
In the remainder of this essay, we examine the basic prin- viewed as the most passive modules of language, acting in the ser-
ciple of how meanings are constructed. First, we introduce vice of the more dynamic components of the grammar. his view
the notion of compositionality in language. Since words are has its origins in the generative tradition (Chomsky [1955] 1975)
the building blocks of larger meanings, we explore various and has been an integral part of the notion of the lexicon ever
approaches to lexical semantics. hen, we focus on how units since. While the Aspects-model of selectional features (Chomsky
of meaning are put together compositionally to create proposi- 1965) restricted the relation of selection to that between lexical
tions. Finally, we examine the meaning of expressions above items, work by McCawley (1968) and Jackendof (1972) showed
the level of the sentence, within a discourse. that selectional restrictions must be available to computations
at the level of derived semantic representation rather than at
deep structure. Subsequent work by Joan Bresnan (1982), Gerald
1.2 Compositionality
Gazdar et al. (1985), and C. Pollard and I. Sag (1994) extend the
Because semantics focuses on how linguistic expressions come range of phenomena that can be handled by the projection and
to have meaning, one of the most crucial concepts in the ield exploitation of lexically derived information in the grammar.
24
The Structure of Meaning
Natural Entity and alternations, among other relations (cf. Pustejovsky and
Boguraev 1993).
Physical Abstract
2.2 Argument Structure
Once the basic semantic types for the lexical items in the
Mass Individuated Mental Experiential
language have been speciied, their subcategorization and
selectional information must be encoded in some form. he
inanimate animate argument structure for a word can be seen as the simplest spec-
iication of its semantics, indicating the number and type of
rock human parameters associated with the lexical item as a predicate. For
Figure 1. example, the verb die can be represented as a predicate taking
one argument, and kill as taking two arguments, while the verb
give takes three arguments.
Recently, with the convergence of several areas in linguis-
(5) a. die(x)
tics (lexical semantics, computational lexicons, type theories),
b. kill(x,y)
several models for the determination of selection have emerged
c. give(x,y,z)
that put even more compostional power in the lexicon, making
explicit reference to the paradigmatic systems that allow for What originally began as the simple listing of the parameters
grammatical constructions to be partially determined by selec- or arguments associated with a predicate has developed into
tion. Examples of this approach are generative lexicon theory a sophisticated view of the way arguments are mapped onto
(Pustejovsky 1995; Bouillon and Busa 2001), and construc- syntactic expressions. E. Williamss (1981) distinction between
tion grammar (Goldberg, 1995; Jackendof 1997, 2002). hese external (the underlined arguments for kill and give) and
developments have helped to characterize the approaches to internal arguments and J. Grimshaws proposal for a hierarchi-
lexical design in terms of a hierarchy of semantic expressive- cally structured representation (cf. Grimshaw 1990) provide
ness. here are at least three such classes of lexical description, us with the basic syntax for one aspect of a words meaning.
dei ned as follows: sense enumerative lexicons, where lexical Similar remarks hold for the argument list structure in HPSG
items have a single type and meaning, and ambiguity is treated (head-driven phrase structure grammar) and LFG
by multiple listings of words; polymorphic lexicons, where lexi- (lexical-functional grammar).
cal items are active objects, contributing to the determination of One inluential way of encoding selectional behavior has
meaning in context, under well-dei ned constraints; and unre- been the theory of thematic relations (cf. thematic roles;
stricted sense lexicons, where the meanings of lexical items are Gruber 1976; Jackendof 1972). hematic relations are now gen-
determined mostly by context and conventional use. It seems erally dei ned as partial semantic functions of the event being
clear that the most promising direction seems to be a careful denoted by the verb or noun, and behave according to a pre-
and formal elucidation of the polymorphic lexicons, and this will dei ned calculus of role relations (e.g., Carlson 1984; Dowty
form the basis of our subsequent discussion. 1991; Chierchia 1989). For example, semantic roles, such as
Lexical items can be systematically grouped according to agent, theme, and goal, can be used to partially determine
their syntactic and semantic behavior in the language. For this the meaning of a predicate when they are associated with the
reason, there have been two major traditions of word cluster- grammatical arguments to a verb.
ing, corresponding to this distinction. Broadly speaking, for
(6) a. put< AGENT,THEME,LOCATION>
those concerned mainly with grammatical behavior, the most
b. borrow<RECIPIENT,THEME,SOURCE>
salient aspect of a lexical item is its argument structure; for those
focusing on a words entailment properties, the most important hematic roles can be ordered relative to each other in terms
aspect is its semantic class. In this section, we examine these of an implicational hierarchy. For example, there is consider-
two approaches and see how their concerns can be integrated able use of a universal subject hierarchy such as shown in the
into a common lexical representation. following (cf. Fillmore 1968; Comrie 1981).
(7) AGENT > RECIPIENT/BENEFACTIVE > THEME/PATIENT >

2.1 Semantic Classes INSTRUMENT > LOCATION
One of the most common ways to organize lexical knowledge Many linguists have questioned the general explanatory
is by means of type or feature inheritance mechanisms (Evans coverage of thematic roles, however, and have chosen alter-
and Gazdar 1990; Carpenter 1992; Copestake and Briscoe native methods for capturing the generalizations they prom-
1992; Pollard and Sag 1994). Furthermore, T. Briscoe, V. de ised. Dowty (1991) suggests that theta-role generalizations
Paiva, and A. Copestake (1993) describe a rich system of types are best captured by entailments associated with the predi-
for allowing default mechanisms into lexical type descrip- cate itself. A theta-role can then be seen as the set of predicate
tions. Similarly, type structures, such as that shown in Figure entailments that are properties of a particular argument to the
1, can express the inheritance of syntactic and semantic fea- verb. Characteristic entailments might be thought of as proto-
tures, as well as the relationship between syntactic classes type roles, or proto-roles; this allows for degrees or shades of
25
meaning associated with the arguments to a predicate. Others 12 CAUSE x, BECOME NOT ALIVE y
have opted for a more semantically neutral set of labels to
Here, the predicate CAUSE is represented as a relation between
assign to the parameters of a relation, whether it is realized as
an individual causer x and an expression involving a change
a verb, noun, or adjective. For example, the theory of argument
of state in the argument y. R. Carter ([1976] 1988) proposes a
structure as developed by Williams (1981), Grimshaw (1990),
representation quite similar, shown here for the causative verb
and others can be seen as a move toward a more minimal-
darken:
ist description of semantic diferentiation in the verbs list of
parameters. 13 x CAUSE y BE.DARK CHANGE
he interaction of a structured argument list and a rich
Although there is an intuition that the cause relation
system of types, such as that presented previously, provides
involves a causer and an event, neither Lakof nor Carter makes
a mechanism for semantic selection through inheritance.
this commitment explicitly. In fact, it has taken several decades
Consider, for instance the sentence pairs in (8).
for Davidsons (1967) observations regarding the role of events
(8) a. he man / the rock fell. in the determination of verb meaning to i nd their way con-
b. he man / *the rock died. vincingly into the major linguistic frameworks. Recently, a new
synthesis has emerged that attempts to model verb meanings
Now consider how the selectional distinction for a feature
as complex predicative structures with rich event structures
such as animacy is modeled so as to explain the selectional
(cf. Parsons 1990; Pustejovsky 1991b; Tenny 1992; Krifka 1992).
constraints of predicates. For the purpose of illustration, the
h is research has developed the idea that the meaning of a verb
arguments of a verb will be identiied as being typed from the
can be analyzed into a structured representation of the event
system shown previously.
that the verb designates, and has furthermore contributed to
(9) a. x :physical[fall(x)] the realization that verbs may have complex, internal event
b. x :animate[die(x)] structures. Recent work has converged on the view that com-
In the sentences in (8), it is clear how rocks cant die and men plex events are structured into an inner and an outer event,
can, but it is still not obvious how this judgment is computed, where the outer event is associated with causation and agency
given what we would assume are the types associated with and the inner event is associated with telicity (completion) and
the nouns rock and man, respectively. What accomplishes this change of state (cf. Tenny and Pustejovsky 2000; Levin and
computation is a rule of subtyping, , that allows the type asso- Rappaport Hovav 2005).
ciated with the noun man (i.e., human) to also be accepted as Jackendof (1990) develops an extensive system of what he
the type animate, which is what the predicate die requires of its calls Conceptual Representations, which parallel the syntactic
argument as stated in (9b) (cf. Gunter 1992; Carpenter 1992): representations of sentences of natural language. hese employ
(10) [human animate]: human animate

a set of canonical predicates, including CAUSE, GO, TO, and ON,
and canonical elements, including hing, Path, and Event. hese
he rule applies since the concept human is subtyped under approaches represent verb meaning by decomposing the predi-
animate in the type hierarchy. Parallel considerations rule out cate into more basic predicates. h is work owes obvious debt to
the noun rock as a legitimate argument to die since it is not sub- the innovative work within generative semantics, as illustrated
typed under animate. Hence, one of the concerns given for the by McCawleys (1968) analysis of the verb kill. Recent versions
way that syntactic processes can systematically keep track of of lexical representations inspired by generative semantics can
which selectional features are entailed and which are not is par- be seen in the Lexical Relational Structures of K. Hale and S.
tially addressed by such lattice traversal rules as the one pre- J. Keyser (1993), where syntactic tree structures are employed
sented here. to capture the same elements of causation and change of state
as in the representations of Carter, Levin and T. Rapoport,
Jackendof, and Dowty. he work of Levin and Rappaport, build-
2.3 Decomposition ing on Jackendof s Lexical Conceptual Structures, has been
he second approach to the aforementioned lexical speciica- inluential in further articulating the internal structure of verb
tion is to dei ne constraints internally to the predicate itself. meanings (see Levin and Rappaport 1995).
Traditionally, this has been known as lexical decomposition. J. Pustejovsky (1991b) extends the decompositional approach
Since the 1960s, lexical semanticists have attempted to for- presented in Dowty (1979) by explicitly reifying the events and
mally model the semantic relations between such lexical items subevents in the predicative expressions. Unlike Dowtys treat-
as the adjective dead and the verbs die and kill (cf. Lakof [1965] ment of lexical semantics, where the decompositional calcu-
1970; McCawley 1968) in the sentences that follow. lus builds on propositional or predicative units (as discussed
earlier), a syntax of event structure makes explicit reference
(11) a. John killed Bill.
to quantiied events as part of the word meaning. Pustejovsky
b. Bill died.
further introduces a tree structure to represent the temporal
c. Bill is dead.
ordering and dominance constraints on an event and its subev-
Assuming that the underlying form for a verb like kill directly ents. For example, a predicate such as build is associated with
encodes the stative predicate in (11c) and the relation of causation, a complex event such as that shown in the following (cf. also
generative semanticists posited representations such as (12). Moens and Steedman 1988).
26
(14) [transition [e1:PROCESS ] [e2:STATE ] ] Although chair and rock are both physical objects, they difer
in their mode of coming into being (i.e., agentive): Chairs are
he process consists of the building activity itself, while the State
man-made; rocks develop in nature. Similarly, a concept such
represents the result of there being the object built. Grimshaw
as food or cookie has a physical manifestation or denotation,
(1990) adopts this theory in her work on argument structure,
but also a functional grounding pertaining to the relation of
where complex events such as break are given a similar represen-
eating. hese apparently contradictory aspects of a category
tation. In such structures, the process consists of what x does to
are orthogonally represented by the qualia structure for that
cause the breaking, and the state is the resultant state of the bro-
concept, which provides a coherent structuring for diferent
ken item. he process corresponds to the outer causing event as
dimensions of meaning.
discussed earlier, and the state corresponds in part to the inner
change of state event. Both Pustejovsky and Grimshaw difer
from earlier authors in assuming a speciic level of representa- 2.5 The Problem of Polysemy
tion for event structure, distinct from the representation of other
lexical properties. Furthermore, they follow J. Higginbotham Given the compactness of a lexicon relative to the number of
(1989) in adopting an explicit reference to the event place in the objects and relations in the world, and the concepts we have
verbal semantics. Recently, Levin and Rappaport (2001, 2005) for them, lexical ambiguity is inevitable. Add to this the cul-
have adopted a large component of the event structure model for tural, historical, and linguistic blending that contributes to the
their analysis of verb meaning composition. meanings of our lexical items, and ambiguity can appear arbi-
trary as well. Hence, homonymy where one lexical form has
2.4 Noun Meaning many meanings is to be expected in a language. Examples of
homonyms are illustrated in the following sentences:
hus far, we have focused on the lexical semantics of verb
entries. All of the major categories, however, are encoded with (15) a. Mary walked along the bank of the river.
syntactic and semantic feature structures that determine their b. She works for the largest bank in the city.
constructional behavior and subsequent meaning at logical (16) a. he judge asked the defendant to approach the bar.
form. In Generative Lexicon heory (Pustejovsky, 1995), it is b. he defendant was in the pub at the bar.
assumed that word meaning is structured on the basis of four
Weinreich (1964) calls such lexical distinctions contrastive
generative factors (qualia roles) that capture how humans
ambiguity, where it is clear that the senses associated with
understand objects and relations in the world and provide the
the lexical item are unrelated. For this reason, it is generally
minimal explanation for the linguistic behavior of lexical items
assumed that homonyms are represented as separate lexical
(these are inspired in large part by Moravcsiks (1975, 1990)
entries within the organization of the lexicon. h is accords with
interpretation of Aristotelian aitia). hese are: the formal
a view of lexical organization that has been termed a sense enu-
role: the basic category that distinguishes the object within
meration lexicon (cf. Pustejovsky 1995). Words with multiple
a larger domain; constitutive role: the relation between an
senses are simply listed separately in the lexicon, but this does
object and its constituent parts; the telic role: its purpose and
not seem to compromise or complicate the compositional pro-
function; and the agentive role: factors involved in the objects
cess of how words combine in the interpretation of a sentence.
origin or coming into being.
h is model becomes diicult to maintain, however, when
Qualia structure is at the core of the generative properties of
we consider the phenomenon known as polysemy. Polysemy is
the lexicon, since it provides a general strategy for creating new
the relationship that exists between diferent senses of a word
types. For example, consider the properties of nouns such as
that are related in some logical manner, rather than arbitrarily,
rock and chair. hese nouns can be distinguished on the basis
as in the previous examples. It is illustrated in the following
of semantic criteria that classify them in terms of general cat-
sentences (cf. Apresjan 1973; Pustejovsky 1991a, 1998).
egories, such as natural kind or artifact object. Although very
useful, this is not suicient to discriminate semantic types in a (17) a. Mary carried the book home.
way that also accounts for their grammatical behavior. A cru- b. Mary doesnt agree with the book.
cial distinction between rock and chair concerns the properties
(18) a. Mary has her lunch in her backpack.
that diferentiate natural kinds from artifacts : Functionality
b. Lunch was longer today than it was yesterday.
plays a crucial role in the process of individuation of arti-
(19) a. he l ight lasted three hours.
facts, but not of natural kinds. h is is relected in grammatical
b. he l ight landed on time in Los Angeles.
behavior, whereby a good chair or enjoy the chair are well-
formed expressions relecting the speciic purpose for which Notice that in each of these pairs, the same nominal form
an artifact is designed, but good rock or enjoy a rock are is assuming diferent semantic interpretations relative to its
semantically ill-formed since for rock the functionality (i.e., selective context. For example, in (17a), the noun book refers
telic) is undei ned. Exceptions exist when new concepts are to a physical object, while in (17b), it refers to the informa-
referred to, such as when the object is construed relative to a tional content. In (18a), lunch refers to the physical manifes-
speciic activity, for example, as in he climber enjoyed that tation of the food, while in (18b), it refers to the eating event.
rock; rock itself takes on a new meaning, by virtue of having Finally, in (19a), light refers to the l ying event, while in
telicity associated with it, and this is accomplished by inte- (19b), it refers to the plane. h is phenomenon of regular (or
gration with the semantics of the subject noun phrase (NP). logical) polysemy is one of the most challenging in semantics
27
and has stimulated much research recently (Bouillon 1997;

Bouillon and Busa 2001; Cooper 2006). he determination
of what such lexical items denote will of course have conse-
quences for ones theory of compositionality, as we will see in
a later section.
3 BUILDING SENTENCE MEANINGS

Figure 2.
3.1 Function Application
language of types, we can express the rule of APPLY as a prop-
he principle of compositionality follows the view that syntax
erty associated with predicates (or functions), and application
is an initial guide to the interpretation process. Hence, there
as a relationship between expressions of speciic types in the
would appear to be a strong relationship between the meaning
language.
of a phrase and where it appears in a sentence, as is apparent
from grammatical function in the following sentences. (24) Function Application:
If is of type a, and is of type a b, then () is of type b.
(20) a. he woman loves the child.
b. he child loves the woman. Viewed as typed expressions, the separate linguistic units
in (23a) combine as function application, as illustrated in
However, this is not always a reliable association, as seen
Figure 2.
in languages that have freer word order restrictions, such as
As one can see, the -calculus is an expressive mechanism
German.
for modeling the relation between verbs and their arguments
(21) a. Die Frau liebt das Kind. interpreted as function application.
he woman loves the child. One important extension to the type language used here
b. Das Kind liebt die Frau. provides a compositional analysis of the semantics of proposi-
he child loves the woman. tional attitude verbs, such as believe and think (Montague 1973).
he sentential complements of such verbs, as is well known,
In German, both word orders are ambiguous, since infor-
create opaque contexts for substitutions under identity. For
mation about the grammatical case and gender of the two NPs
example, if Lois is unaware of Supermans true identity, then
is neutralized.
the belief statement in (25b) is false, even though (25a) is true.
Although there is often a correlation between the gram-
matical relation associated with a phrase and the meaning (25) a. Lois believes Superman rescued the people.
assigned to it, this is not always a reliable association. Subjects b. Lois believes Clark Kent rescued the people.
are not always doers and objects are not always undergoers
On this view, verbs such as believe introduce an inten-
in a sentence. For example, notice how in both (22a) and (22b),
sional context for the propositional argument, instead of an
the NP the watch is playing the same role; that is, it is undergo-
extensional one. In such a context, substitution under identity is
ing a change, even though it is the subject in one sentence and
not permitted without possibly afecting the truth value (truth
the object in the other.
conditional semantics). h is is an important contribution
(22) a. he boy broke the watch. to the theory of meaning, in that a property of opacity is associ-
b. he watch broke. ated with speciic types within a compositional framework.
One potential challenge to a theory of function application
To handle such verbal alternations compositionally requires
is the problem of ambiguity in language. Syntactic ambiguities
either positing separate lexical entries for each syntactic con-
arise because of the ways in which phrases are bracketed in a
struction associated with a given verb, or expressing a deeper
sentence, while lexical ambiguity arises when a word has mul-
relation between diferent verb forms.
tiple interpretations in a given context. For example, in the fol-
For most semantic theories, the basic mechanism of com-
lowing sentence, the verb treat can mean one of two things:
positionality is assumed to be function application of some
sort. A rule of application, apply, acts as the glue to assign (or (26) he doctor treated the patient well.
discharge) the argument role or position to the appropriate
candidate phrase in the syntax. hus, for a simple transitive Either 1) the patient is undergoing medical care, or 2) the doc-
sentence such as (23a), two applications derive the proposi- tor was kind to the patient. More often than not, however, the
tional interpretation of the sentence in (23d). context of a sentence will eliminate such ambiguities, as shown
in (27).
(23) a. John loves Mary.
b. love(Arg1,Arg2) (27) a. he doctor treated the patient with antibiotics. (Sense 1)
c. APPLY love(Arg1,Arg2) to Mary = love(Arg1,Mary) b. he doctor treated the patient with care. (Sense 2)
d. APPLY love(Arg1,Mary) to John = love(John,Mary)
In this case, the interpretation is constructed from the appro-
One model used to dei ne the calculus of compositional priate meaning of the verb and how it combines with its
combinations is the -calculus (Barendregt 1984). Using the arguments.
28
3.2 Quantiiers and Scope (32) a. Every woman sang a song.

b. xy[woman(x) [song ( y) & sang (x, y)]]
c. yx[[song ( y) & woman(x)] sang (x, y)]]
Another type of ambiguity, one that is not associated with
the constituent structure of the sentence or lexical senses in
any obvious way, involves quantiied noun phrases (e.g., every An alternative treatment for handling such cases is to posit a
cookie, some cake, and most pies). It is interesting that when a rule of quantiier raising, where the scope ambiguity is reduced
sentence has more than one of these phrases, one often sees to a diference in syntactic structures associated with each
more than one interpretation possible because of the ways the interpretation (May 1985).
quantiied NPs relate to each other. h is is not the case in the
following sentence, however, where there is only one interpre- 3.3 Semantic Modiication
tation as to what happened with the cookie.
In constructing the meaning of expressions, a semantic the-
(28) Some student ate a cookie. ory must also account for how the attribution of properties
Now consider the sentences in (29), where there is a combi- to an entity is computed, what is known as the problem of
nation of a some-NP and an every-NP. modiication. he simplest type of modiication one can imag-
ine is intersective attribution. Notice that in the phrases in (33),
(29) a. Every student saw a movie. the object denoted correctly has both properties expressed in
b. Every cookie was eaten by a student. the NP:
he sentence in (29a) can mean one of two things: 1) that (33) a. black cofee x[black (x) & cofee (x)]
there was one movie, for example, Star Wars, that every stu- b. Italian singer x[Italian(x) & singer (x)]
dent saw; or 2) that everyone saw a movie, but it didnt have to c. metal cup x[metal(x) & cup (x)]
be the same one. Similarly, for (29b), there could be one stu-
dent who ate all the cookies, or each cookie that was eaten here are two general solutions to computing the meaning
by a diferent student. h is kind of quantiier scope ambiguity of such expressions: a) Let adjectives be functions over com-
has to be resolved in order to determine what kind of infer- mon noun denotations, or b) let adjectives be normal predi-
ences one can make from a sentence. Syntax and semantics cates, and have a semantic rule associated with the syntax of
must interact to resolve this kind of ambiguity, and it is the modiication.
theory of sentence meaning that dei nes this interaction (cf. Computing the proper inferences for relative clauses will
quantification). involve a similar strategy, since they are a sort of intersective
One of the roles of semantic theory is to correctly derive the modiication. hat is, for the relative clause in (34), the desired
entailment relations associated with a sentences logical form, logical form will include an intersection of the head noun and
since this has an obvious impact on the valid reasoning pat- the relation predicated in the subordinated clause.
terns in the language. How these interpretations are computed (34) a. writer who John knows
has been an area of intense research, and one of the most inlu- b. x[writer (x) & know ( j, x)]
ential approaches has been the theory of generalized quan- Unfortunately, however, most instances of adjectival modi-
tiiers (cf. Barwise and Cooper 1981). On this approach, the ication do not work so straightforwardly, as illustrated in (35).
denotation of an NP is treated as a set of sets of individuals, and Adjectives such as good, dangerous, and fast modify polyse-
a sentence structure such as [NP VP] is true if and only if the mously in the following sentences.
denotation of the VP is a member of the family of sets denoted
by the NP. hat is, the sentence in (30) is true if and only if sing- (35) a. John is a a good teacher.
ing (the denotation of the VP) is a member of the set of proper- b. A good meal is what we need now.
ties denoted by every woman. c. Mary took a good umbrella with her into the rain.
(30) Every woman sang. In each of these sentences, good is a manner modiier whose
interpretation is dependent on the noun it modiies; in (35a), it
On this view, quantiiers such as most, every, some, and so means to teach well; in (35b), it means a tasty meal; and in
on are actually second-order relations between predicates, (35c), it means something keeping you dry. Similar remarks
and it is partly this property that allows for the compositional hold for the adjective dangerous.
interpretation of quantiier scope variation seen previously.
he intended interpretation of (30) is (31b), where the subject (36) a. h is is a dangerous road at night.
NP every woman is interpreted as a function, taking the VP as b. She used a dangerous knife for the turkey.
its argument. hat is, the road is dangerous in (36a) when one drives on it,
(31) a. P x[woman(x) P (x)](sang) and the knife is dangerous in (36b) when one cuts with it.
b. x[woman(x) sang (x)] Finally, the adjective fast in the following sentences acts as
though it is an adverb, modifying an activity implicit in the
When combined with another quantiied expression, as in noun, that is, programming in (37a) and driving in (37b).
(32a), the relational interpretation of the generalized quantii-
ers is crucial for being able to determine both scope interpreta- (37) a. Mary is the fastest programmer we have on staf.
tions shown in (32). b. he turnpike is a faster road than Main Street.
29
To account for such cases, it is necessary to enrich the mode of Each of the italicized phrases is an argument of something,
composition beyond simple property intersection, to accom- but is it selected by the matrix predicate? Jackendof has pro-
modate the context dependency of the interpretation. Analyses posed a solution that relies on the notion of construction,
taking this approach include Borschev and Partee (2001), as introduced by A. E. Goldberg (1995) (cf. construction
Bouillon (1997), and Pustejovsky (1995). grammars).
Another problem in compositionality emerges from the
interpretation of adjuncts. he question posed by the exam-
3.4 Arguments versus Adjuncts
ples in (41) is this: Which NPs are arguments semantically and
In our discussion thus far of how predicates select arguments to which are merely adjuncts?
create compositionally complex expressions, we have assumed
(41) a. Mary ate the soup.
that the matrix predicate (the main verb of the sentence) acts
b. Mary ate the soup with a spoon.
as the only function over other phrases. In fact, what an argu-
c. Mary ate the soup with a spoon in the kitchen.
ment of the verb is and what an adjunct is are questions just as
d. Mary ate the soup with a spoon in the kitchen at 3:00 p.m.
much of meaning as of syntax. In this section, we examine the
semantic issues involved. For Davidson (1967), there is no semantic distinction between
In this overview, we have adopted the position that lan- arguments and adjuncts in the logical form. Under his pro-
guage relects the workings of our deeper conceptual sys- posal, a two-place predicate such as eat contains an additional
tems in some direct and nonidiosyncratic manner. Lexical argument, the event variable, e, which allows each event par-
choice as well as speciic grammatical phenomena can be ticipant a speciic role in the interpretation (cf. Parsons 1990;
constrained by underlying conceptual bias. Well-known event structure and grammar).
examples of this transparency include count/mass noun
(42) yxe[eat(e, x, y)]
distinctions in the lexicon, and case marking and valence
distinctions in the syntax. For example, concepts entail- hen, any additional adjunct information (such as locations,
ing unindividuated stuf or material will systematically be instruments, etc.) is added by conjunction to the meaning
semantically typed as mass nouns in the grammar, whereas of the main predicate, in a fashion similar to the interpreta-
naturally individuating (countable) substances will assume tion of intersective modiication over a noun. In this man-
the status of count nouns, with their respective grammatical ner, Davidson is able to capture the appropriate entailments
consequences, as illustrated in (38). (Some mass terms are between propositions involving action and event expressions
not shared by all languages, such as the concept of paper through conventional mechanisms of logical entailment. For
or furniture.) example, to capture the entailments between (41bd) and (41a)
in the following, each more speciically described event entails
(38) a. {not much/all/lots of } gold/water/dirt/sand
the one above it by virtue of conjunction elimination (already
b. {every/two/several} chairs/girls/beaches
encountered) on the expression.
(43) a. e[eat(e, m, the-soup)]

Similarly, as presented in previous sections, the classii-
b. e[eat(e, m, the-soup) & with(e, a spoon)]

cation of verbs appears to relect their underlying relational
c. e[eat(e, m, the-soup) & with(e, a spoon) & in(e, the

structure in fairly obvious ways.
(39) a. Mary arrived. kitchen)]

b. John greeted Mary. d. e[eat(e, m, the-soup) & with(e, a spoon) & in(e, the
c. Mary gave a book to John. kitchen) & at (e, 3:00 p.m.)]
hat is, the argument structure of each verb encodes the h is approach has the advantage that no special inference
semantics of the underlying concept, which in turn is relected mechanisms are needed to derive the entailment relations
in the projection to the speciic syntactic constructions, that between the core propositional content in (43a) and forms
is, as intransitive, transitive, and ditransitive constructions, modiied through adjunction. h is solution, however, does not
respectively. For unary, binary, and ternary predicates, there extend to cases of verbs with argument alternations that result
is a visible or transparent projection to syntax from the under- in diferent meanings. For example, how do we determine what
lying conceptual structure, as well as a predictable composi- the core arguments are for a verb like sweep?
tional derivation as function application.
(44) a. John swept.
So, the question arises as to what we do with nonselected
b. John swept the loor.
arguments and adjuncts within the sentence. It is well known,
c. John swept the dirt.
for example, that arguments not selected by the predicate
d. John swept the dirt of the sidewalk.
appear in certain contexts (cf. Jackendof 1992; Levin and
e. John swept the loor clean.
Rappaport Hovav 2005).
f. John swept the dirt into a pile.
(40) a. he man laughed himself sick.
he semantics of such a verb should determine what its argu-
b. he girl danced her way to fame.
ments are, and how the diferent possible syntactic realizations
c. Mary nailed the window shut.
relate to each other semantically. hese cases pose an interest-
30
ing challenge for the theory of compositionality (cf. Jackendof be predicted from the meaning of its parts. We have already
2002). encountered modiication constructions that do not conform to
simple intersective interpretations, for example, good teacher.
here are two other constructions that pose a problem for the
3.5 Presupposition
principle of compostionality in semantics:
In computing the meaning of a sentence, we have focused
(51) a. Idioms: hear it through the grapevine, kick the bucket ;
on that semantic content that is asserted by the proposition.
b. Coercions: begin the book , enjoy a cofee.
h is is in contrast to what is presupposed. A presupposition
is that propositional meaning that must be true for the sen- he meaning of an idiom such as leave well enough alone is
tence containing it to have a proper semantic value (Stalnaker in no transparent way composed of the meanings of its parts.
1970; Karttunen 1974; Potts 2005). (Stalnaker makes the dis- Although there are many interesting syntactic properties and
tinction between what a speaker says and what a speaker constraints on the use of idiomatic expressions in languages,
presupposes.) from a semantic point of view its meaning is clearly associated
Such knowledge can be associated with a word, a grammat- with the entire phrase. Hence, the logical form for (52),
ical feature, or a syntactic construction (so-called presuppo-
(52) Every person kicked the bucket.
sition triggers). For example, in (45) and (46), the complement
proposition to each verb is assumed to be true, regardless of the will make reference to quantiication over persons, but not
polarity assigned to the matrix predicate. over buckets (cf. [53]).
(45) a. Mary realized that she was lost. (53) x[ person(x) & kick.the.bucket (x)]
b. Mary didnt realize that she was lost.
We confront another kind of noncompositionality in
(46) a. John knows that Mary is sick. semantics when predicates seem to appear with arguments of
b. John doesnt know that Mary is sick. the wrong type. For example, in (54a), a countable individ-
ual entity is being coerced into the food associated with that
here are similar presuppositions associated with aspectual
animal, namely, bits of chicken, while in (54b), the mass terms
predicates, such as stop and inish, as seen in (47).
water and beer are being packaged into unit measures (Pelletier,
(47) a. Fred stopped smoking. 1975). In (55), the aspectual verbs normally select for an event,
b. John i nished painting his house. but here are coercing entities into event denotations. Similarly,
in (56), both object NPs are being coerced into propositional
In these constructions, the complement proposition is assumed
interpretations. (Cf. Pustejovsky 1995 and Jackendof 2002 for
to have been true before the assertion of the sentence.
discussions of coercion phenomena and their treatment.)
Such conventional presuppositions are also triggered by
interrogative contexts, such as seen in (48). (54) a. heres chicken in the salad.
b. Well have a water and two beers.
(48) a. Why did you go the store?
b. When did you see Mary? (55) a. Roser i nished her thesis.
b. Mary began the novel.
As with all presuppositions, however, they are defeasible, as
the answer to (48b) in (49) illustrates. (56) a. Mary believes Johns story.
b. Mary believes John.
(49) But I didnt see Mary.
hese examples illustrate that semantics must accommodate
Conversational presuppositions, on the other hand, are impli-
speciic type-shifting and coercing operations in the language
cated propositions by virtue of a context and discourse situation.
in order to remain compositional. In order to explain just such
he response in (50b) conversationally implicates that I am not
cases, Pustejovsky (2007) presents a general theory of compo-
hungry (Recanati 2002); conversational implicature).
sition that distinguishes between four distinct modes of argu-
(50) a. Are you hungry? ment selection: a) function application, b) accommodation,
b. Ive had a very large breakfast. c) coercion by introduction, and d) coercion by exploitation.
he meaning of such implicatures is not part of the asserted

content of the proposition, but computed within a conversa-
4 DISCOURSE STRUCTURE
tional context in a discourse. We will return to this topic in a
later section. hus far we have been concentrating on the meaning of single
sentences. But no sentence is really ever uttered outside of a
context. Language is used as a means of communication and is
3.6 Noncompositionality
as much a way of acting as a means of representing (Austin 1975;
While semantic theory seems to conform to the principles of Searle 1969). In this section, we briely survey the major areas
compositionality in most cases, there are many constructions of research in discourse semantics. We begin by examin-
that do not it into the conventional function application paring the semantic models that have emerged to account for
adigm. A phrase is noncompositional if its meaning cannot dynamic phenomena in discourse, such as intersentential
31
anaphora. We then look at how discourse relations can be used is added to the listeners interpretation state so that the lis-
to model larger units of meaning. tener can use the quantiier to help understand future utter-
From our previous discussion, we have assumed the sen- ances. In this way, the meaning of a sentence is interpreted
tence as the unit for semantic interpretation, including the dynamically.
level for the interpretation of quantiier scope and anaphoric he dynamics of discourse, of course, involve more than the
binding, as in (57). binding of anaphors to antecedents across adjacent sentences.
Every utterance is made in the context of a common ground
(57) a. Every actress said she was happy.
of shared knowledge (presuppositions), with a communica-
b. Every actress came in and said hello.
tive intent, and in a particular time and place (cf. discourse
Notice that the anaphoric link between the quantiier and the analysis, communicative intention). Just as sentences
pronoun in (57a) is acceptable, while such a binding is not pos- have internal structure, with both syntactic and semantic
sible within a larger discourse setting, as in (58) and (59). dependencies, discourse can also be viewed as a sequence of
structured segments, with named dependencies between them.
(58) a. Every actress came in.
For example, the sentences in (62) form a discourse structured
b. *She said she was happy.
by a relation of narration, implying temporal sequence (Dowty,
(59) a. Every actress came in. 1986).
b. *She said hello.
(62) a. John entered the room.
So, in a larger unit of semantic analysis, a bound variable inter- b. He sat down.
pretation of the pronoun does not seem permitted.
In (63), on the other hand, the two sentences are related by the
Now notice that indei nites do in fact allow binding across
dependency of explanation, where (63b) temporally precedes
the level of the sentence.
and explains (63a).
(60) a. An actress came in.
(63) a. Max fell.
b. She said hello.
b. John pushed him.
he desired interpretation, however, is one that the semantic
heories of discourse relations, such as rhetorical struc-
model we have sketched out is unable to provide.
ture theory (Mann and hompson 1986), segmented dis-
(61) a. x[actress (x) & come.in(x)] course representation theory (SDRT) (Asher and Lascarides 3),
b. [& say.hello (x)] and that of Hobbs (1985) attempt to model the rhetorical func-
tions of the utterances in the discourse (hence, they are more
What this example points out is that the view of meaning we
expressive of discourse structure and speaker intent than dis-
have been working with so far is too static to account for phe-
course representation theory [DRT], which does not model
nomena that are inherently dynamic in nature (Chierchia 1995;
such parameters). For the simple discourses above, SDRT, for
Groenendijk and Stokhof 1991; Karttunen 1976). In this exam-
example extends the approach from dynamic semantics with
ple, the indei nite NP an actress is being used as a discourse
rhetorical relations and their semantic values, while provid-
referent, and is available for subsequent reference as the story
ing a more complex process of discourse updates. Rhetorical
unfolds in the discourse.
relations, as used in SDRT, carry speciic types of illocution-
Following Kamp and Reyles (1993) view, an indei nite NP
ary force (cf. Austin 1975; Searle 1969, 9), namely, explanation,
introduces a novel discourse referent, while a pronoun or def-
elaboration, giving backgrounds, and describing results.
inite description says something about an existing discourse
referent. Using the two notions of novelty and familiarity, we
can explain why she in (60b) is able to bind to the indei nite; 5 CONCLUSION
namely, she looks for an accessible discourse referent, the
indei nite. he reason that (58) and (59) are not good discourses In this essay, I have attempted to outline the basic components
is due to the universally quantiied NP every actress, which is for a theory of linguistic meaning. Many areas of semantics were
inaccessible as an antecedent to the pronoun. not touched on in this overview, such as issues relating to the
One inluential formalization of this approach is Dynamic philosophy of language and mind and the psychological conse-
Predicate Logic (Groenendijk and Stokhof, 1991), which com- quences of various semantic positions. Many of the accompa-
bines conventional interpretations of indei nites as existentials nying entries herein, however, address these issues directly.
with the insight from incremental interpretations, mentioned
previously. On this view, the interpretation of a sentence is a WORKS CITED AND SUGGESTIONS FOR FURTHER READING
function of an ordered pair of assignments, rather than a static
Apresjan, J. D. 1973. Synonymy and synonyms. In Trends in Soviet
single assignment. he output condition for a sentence with
heoretical Linguistics, ed. F. Kiefer, 17399. Dordrecht, the
an indei nite NP, such as (60a), speciies that a subsequent sen-
Netherlands: Reidel.
tence with a pronoun can share that variable assignment: he Asher, Nicholas, and Alex Lascarides. 2003. Logics of Conversation.
meaning of a sentence lies in the way it changes the representa- Cambridge: Cambridge University Press.
tion of the information of the interpreter (ibid.). hat is, when Austin, J. L. 1975. How to Do hings with Words. Cambridge: Harvard
a quantiied expression is used in discourse, something new University Press.
32
Barendregt, Henk. 1984. he Lambda Calculus, Its Syntax and Dowty, D. R . 1979. Word Meaning and Montague Grammar. Dordrecht,
Semantics. Amsterdam: North-Holland. the Netherlands: D. Reidel.
Barwise, Jon, and Robin Cooper. 1981. Generalized quantiiers and . 1986. he efects of aspectual class on the temporal structure
natural language. Linguistics and Philosophy 4.1: 159 219. of discourse: Semantics or pragmatics. Linguistics and Philosophy
Borschev, Vladimir, and Barbara H. Partee. 2001. Genitive modiiers, 9.1: 3761.
sorts, and metonymy. Nordic Journal of Linguistics 24.2: 140 60. . 1991. hematic proto-roles and argument selection. Language
Bouillon, P. 1997. Polymorphie et semantique lexicale: Le cas des adjec- 67: 547619.
tifs. Lille: Presses Universitaires du Spetentrion. Evans, R., and G. Gazdar. 1990. he DATR papers: February 1990.
Bouillon, P., and F. Busa, eds. 2001. he Language of Word Meaning. Cognitive Science Research Paper CSRP 139, School of Cognitive
Cambridge: Cambridge University Press. and Computing Science, University of Sussex, Brighton, England.
Bresnan, J., ed. 1982. he Mental Representation of Grammatical Fillmore, C. 1968. he case for case. In Universals in Linguistic
Relations. Cambridge, MA : MIT Press. heory, ed. E. Bach and R. Harms, 188. New York : Holt, Rinehart,
Briscoe, T., V. de Paiva, and A. Copestake, eds. 1993. Inheritance, and Winston.
Defaults, and the Lexicon. Cambridge: Cambridge University Press. Gazdar, G., E. Klein, G. Pullum, and I. Sag. 1985. Generalized Phrase
Busa, F. 1996. Compositionality and the semantics of nominals. Ph.D. Structure Grammar. Cambridge: Harvard University Press.
diss., Brandeis University. Goldberg , A. E. 1995. Constructions: A Construction Grammar Approach
Carlson, G. 1984. hematic roles and their role in semantic interpreta- to Argument Structure. Chicago: University of Chicago Press.
tion. Linguistics 22: 259 79. Grice, H. P. 1969. Utterers meaning and intentions. Philosophical
Carpenter, B. 1992. he Logic of Typed Feature Structures. Review 78: 14777.
Cambridge: Cambridge University Press. .1989. Studies in the Way of Words. Cambridge: Harvard University
Carter, R. [1976] 1988. On Linking: Papers by Richard Carter. Ed. B. Press.
Levin and C. Tenny. MIT Lexicon Project Working Papers 25, Center Grimshaw, J. 1979. Complement selection and the lexicon Linguistic
for Cognitive Science. Cambridge, MA : MIT Press. Inquiry 10: 279 326.
Chierchia, G. 1989. Structured meanings, thematic roles, and con- . 1990. Argument Structure. Cambridge, MA : MIT Press.
trol. In Properties, Types, and Meaning, ed. G. Chierchia, B. Partee, Groenendijk , Jeroen, and Martin Stokhof. 1991. Dynamic predicate
and R. Turner, II: 13166. Dordrecht, the Netherlands: Kluwer logic. Linguistics and Philosophy 14.1: 39100.
Academic Publishers. Gruber, J. S. 1976. Lexical Structures in Syntax and Semantics.
. 1995. he Dynamics of Meaning. Chicago: University of Chicago Amsterdam: North-Holland.
Press. Gunter, C. 1992. Semantics of Programming Languages. Cambridge,
Chomsky, N. [1955] 1975. he Logical Structure of Linguistic heory. MA : MIT Press.
Chicago: University of Chicago Press. Hale, K. and S. J. Keyser. 1993. On argument structure and the lex-
. 1965. Aspects of the heory of Syntax . Cambridge, MA : MIT ical expression of syntactic relations. In he View from Building
Press. 20: Essays in Honor of Sylvain Bromberger, ed. K. Hale and S. J. Keyser,
Comrie, Bernard. 1981. Language Universals and Linguistic 53109. Cambridge, MA : MIT Press.
Typology: Syntax and Morphology. Chicago: University of Chicago Heim, Irene. 1982. he semantics of deinite and indeinite noun phrases.
Press. Ph.D. thesis, University of Massachussets, Amherst.
Cooper, Robin . 2006 . A record type theoretic account of copredica- Higginbotham, J. 1989. Elucidations of meaning. Linguistics and
tion and dynamic generalized quantiication. In Kvantiikator Philosophy 12: 465 517.
for en Dag, Essays Dedicated to Dag Westerstahl on His Sixtieth Hobbs, Jerry R. 1985. On the coherence and structure of discourse.
Birthday. Avilable online at: http://www.phil.gu.se/posters/ Report No. CSLI-8537, Center for the Study of Language and
festskrift3/. Information, Stanford University.
Copestake, A. 1992. he Representation of Lexical Semantic Information . Jackendof, R. 1972. Semantic Interpretation in Generative Grammar.
Cognitive Research Paper CSRP 280, School of Cognitive and Cambridge, MA : MIT Press.
Computing Science, University of Sussex , Brighton, England. . 1983. Semantics and Cognition. Cambridge, MA : MIT Press.
. 1993. Defaults in Lexical Representation. In Inheritance, . 1990. Semantic Structures. Cambridge, MA: MIT Press.
Defaults, and the Lexicon, ed. Ted Briscoe, Valeria de Paiva, and Ann . 1992. Babe Ruth homered his way into the hearts of America.
Copestake, 22345. Cambridge: Cambridge University Press. In Syntax and the Lexicon, ed. T. Stowell and E. Wehrli, 15578. San
Copestake, A., and E. Briscoe. 1992. Lexical operations in a unii- Diego: Academic Press.
cation-based framework. In Lexical Semantics and Knowledge . 1997. he Architecture of the Language Faculty. Cambridge, MA:
Representation, ed. J. Pustejovsky and S. Bergler, 10119. New MIT Press.
York : Springer Verlag. . 2002. Foundations of Language. Oxford: Oxford University
Copestake, A., and T. Briscoe. 1995. Semi-productive polysemy and Press.
sense extension. Journal of Semantics 12: 15 67. Kamp, H. 1981. A theory of truth and semantic representation. In
Davidson, D. 1967. he logical form of action sentences. In he Logic Formal Methods in the Study of Language, ed. J. A. G. Groenendijk ,
of Decision and Action, ed. N. Rescher, 8195. Pittsburgh: Pittsburgh T. M. V. Janssen, and M. B. J. Stokhof. Mathematical Centre Tracts
University Press. 135, 277322. Amsterdam: Mathematical Centre.
Davidson, D., and G. Harman, eds. 1972. Semantics of Natural Kamp, H., and U. Reyle. 1993. From Discourse to Logic. Dordrecht, the
Language. Dordrecht, the Netherlands: Reidel. Netherlands: Kluwer Academic Publishers.
Davis, Steven, and Brendan Gillon. 2004. Semantics: A Reader. Karttunen, L. 1974. Presupposition and linguistic context. heoretical
Oxford: Oxford University Press. Linguistics 1: 18193.
Dolling , J. 1992. Flexible Interpretationen durch Sortenverschiebung. . 1976. Discourse referents. In Syntax and Semantics, ed. James
In Fgungspotenzen, ed. Ilse Zimmermann and Anatoli Strigen, D. McCawley. Vol. 7: Notes from the Linguistic Underground , 36385.
2362. Berlin: Akademie Verlag. New York : Academic Press.
33
Katz, J., and J. Fodor. 1963. he structure of a semantic theory. Partee, Barbara, ed. 1976 . Montague Grammar. New York : Academic
Language 39: 170 210. Press .
Krifka, M. 1992. hematic relations as links between nominal reference Partee, B., and M. Rooth. 1983. Generalized conjunction and type
and temporal constitution. In Lexical Matters, CSLI Lecture Notes, ed. ambiguity. In Meaning, Use, and Interpretation of Language, ed.
I. Sag and A. Szabolcsi, 2953. Chicago: University of Chicago Press. Rainer Buerle, Christoph Schwarze, and Arnim von Stechow,
Kripke, Saul. [1972] 1980. Naming and necessity. In Semantics 36183. Berlin: Walter de Gruyter.
of Natural Language, ed. D. Davidson and G. Harman, 253355. Pelletier, F. J. 1975. Non-singular reference: Some Preliminaries. In
Dordrecht and Boston: Reidel. Mass Terms: Some Philosophical Problems, ed. F. J. Pelletier, 114.
Lakof, G. [1965] 1970. Irregularity in Syntax . New York : Holt, Rinehart, Dordrecht, the Netherlands: Reidel.
and Winston. Pollard, C., and I. Sag. 1994. Head-Driven Phrase Structure Grammar.
Levin, B., and M. Rappaport Hovav. 1995. Unaccusativity: At the Syntax- Chicago: University of Chicago Press and Stanford CSLI.
Semantics Interface. Cambridge, MA : MIT Press. Potts, Christopher. 2005. he Logic of Conventional Implicatures.
. (2005. Argument Realization. Cambridge: Cambridge University Oxford: Oxford University Press.
Press. Pustejovsky, J. 1991a. he generative lexicon. Computational
Lewis, David. 1976. General semantics. In Montague Grammar, ed. Linguistics 17: 409 41.
Barbara H. Partee, 150. New York : Academic Press. . 1991b. he syntax of event structure. Cognition 41: 4781.
Lyons, John. 1968. Introduction to heoretical Linguistics. . 1995. he Generative Lexicon. Cambridge, MA: MIT Press.
Cambridge: Cambridge University Press. . 1998. he semantics of lexical underspeciication. Folia
Mann, William C., and Sandra A. hompson. 1986. Rhetorical Linguistica 32: 32347.
Structure heory: Description and Construction of Text Structures. . 2007. he Multiplicity of Meaning. Cambridge, MA : MIT Press.
ISI/RS-86174, 115. Nijmegen, the Netherlands: Information Pustejovsky, J., and B. Boguraev. 1993. Lexical knowledge repre-
Sciences Institute. sentation and natural language processing. Artiicial Intelligence
May, Robert. 1985. Logical Form: Its Structure and Derivation. 63: 193223.
Cambridge, MA : MIT Press. Recanati, Francois. 2002. Unarticulated constituents. Linguistics
McCawley, J. D. 1968. he role of semantics in a grammar. In and Philosophy 25: 299 345.
Universals in Linguistic heory, ed. E. Bach and R. T. Harms, 12469. Russell, Bertrand, 1905. On Denoting. Mind 14: 479 93.
New York : Holt, Rinehart, and Winston. Saussure, Ferdinand de. [1916] 1983. Course in General Linguistics.
Moens, M., and M. Steedman. 1988. Temporal ontology and temporal Trans. R. Harris. London: Duckworth.
reference. Computational Linguistics 14: 15 28. Searle, John. 1969. Speech Acts: An Essay in the Philosophy of Language.
Montague, Richard. 1973. he proper treatment of quantiication in ordi- Cambridge: Cambridge University Press.
nary English. In Approaches to Natural Language, ed. Jaakko Hintikka, . 1979. Expression and Meaning. Cambridge: Cambridge
Julius Matthew, Emil Moravcisk, and Patrick Suppes, 22142. Dordrecht, University Press.
the Netherlands: D. Reidel. Repr. in Montague 1974, 24770. Stalnaker, Robert. 1970. Pragmatics. Synthese 22.1/2: 27278.
. 1974. Formal Philosophy: Selected Papers of Richard Montague. Strawson, P. F. 1971. Logico-Linguistic Papers. London: Methuen.
New Haven, CT: Yale University Press. Tenny, C. 1992. he Aspectual Interface Hypothesis. In Lexical
Moravcsik , J. M. 1975. Aitia as generative factor in Aristotles philoso- Matters, CSLI Lecture Notes, ed. I. Sag and A. Szabolcsi, 127.
phy. Dialogue 14: 62236. Chicago: University of Chicago Press.
. 1990. hought and Language. London: Routledge. Tenny, C., and J. Pustejovsky. 2000. Events as Grammatical Objects.
Nunberg , G. 1979. he non-uniqueness of semantic solu- Stanford, CA: CSLI Publications. Chicago: University of Chicago
tions: Polysemy. Linguistics and Philosophy 3: 14384. Press.
Parsons, T. 1990. Events in the Semantics of English. Cambridge, MA: Weinreich, U. 1964. Websters h ird: A Critique of its Semantics.
MIT Press. International Journal of American Linguistics 30: 405 9.
Partee, Barbara. 1984. Compositionality. In Varieties of Formal . 1972. Explorations in Semantic heory. he Hague: Mouton.
Semantics, ed. Fred Landman and Frank Veltman, 281311. Williams, E. 1981. Argument structure and morphology. Linguistic
Dordrecht, the Netherlands: Foris. Review 1: 81114.
34
diglossia, culture and language, digital media, lit-
eracy). his essay is focused on the issue of the media of com-
munication and the social conditions and consequences of their
evolution. he reason is that the social practices of speech and
4 writing both depend on the available technology and lead to tech-
nological and social innovation. As has been argued by Marshall
McLuhan (1964), Elizabeth L. Eisenstein (1979), Jan Assmann
(1991), David R. Olson (1994), and Nicholas Negroponte (1995)
SOCIAL PRACTICES OF SPEECH AND WRITING
among others, civilizations are characterized by the media they
predominantly use and which shape the way they exchange,
Florian Coulmas
store, and administer information, thus exercising a profound
inluence on social practice.
he nexus between speech and writing is variable and more
obvious in some cases than in others. For instance, when the
lyrics of a song are read on the monitor and sung in a karaoke
bar, speech and writing are joined together in one activity. On
the other hand, the songs of bards handed down by word of
mouth from one generation to another are part of the culture of
INTRODUCTION spoken, as opposed to written, language (Ong 1982, Olson 1991;
oral culture, oral composition). However, the very idea
Language is constitutive for human society. As a social fact of orality is predicated on literacy and would not have become
it cannot be thought of in the abstract, for the medium of an object of study without it. Just as there is no silence without
communication is what allows it to serve social functions. he sound, illiteracy exists but in a literate society. On the face of it,
nature of the social relationship that exists by virtue of language many kinds of verbal behavior, such as speech contests, bidding
partially depends on the externalization of language, that is, on at an auction, and election campaign addresses, do not involve
how it is transmitted from one actor to another as speech, writ- writing. he institutional frameworks in which they take place in
ing, sign, or Braille. he anatomy of speech organs (cf. Liberman modern society, school, trade and government, though, rely to a
and Blumstein 1988) provides the biological foundation of very large extent on written texts. To analyze social practices of
human society in the most general sense, which is why oral speech and writing, then, it is necessary to consider technologi-
speech is considered fundamental for socialization both in the cal aspects of writing and institutional aspects of literacy.
phylogenetic and ontogenetic sense. But unless we study human
society like that of other primates from the point of view of physi-
cal anthropology, other forms of language externalization must TECHNOLOGICAL ASPECTS OF WRITING
also be taken into account as communication potential from Many social practices and ligatures of contemporary society
the beginning. here are two reasons for this. One is that the would be impossible without writing. his does not imply that the
invention of writing (sign language, Braille) cannot be undone. externalization of language by technological means is the only
he other, which follows therefrom, is that writing has brought force that shaped modern society. he assumption of an unme-
about basic changes in the nature of human communication. It diated cause-and-efect relationship between writing and social
brought in its wake a literate mindset that cannot be reversed. organization, of a watershed between primitive oral life and com-
Research about language in literate societies is carried out by plex literate civilization, is a simpliication that fails to do justice
researchers who, growing up, were socialized into a literate to the complexity of the interaction. It is surely tempting to argue
world organized by and large on the basis of literate principles. that what all great early civilizations had in common was writing
It is not fortuitous, therefore, that social practices of speech and and that it was hence writing that caused complex societies to
writing are dealt with here under one heading. come into existence. However, if we look at the uses of writing
he scientiic enterprise in general, linguistics in particular, is in early civilizations, many diferences are apparent. For exam-
a social practice involving speech and writing. Even the investi- ple, economic administration was preeminent in Mesopotamia
gation of unwritten languages happens against the background (Nissen, Damerow, and Englund 1990), whereas cult stood out
of literate society and by means of the tools developed for what in Egypt (Assmann 1991). In both cases, it is untenable to argue
Goody (1977, 151) felicitously called the technology of the intel- that accounting and the cult of the dead, respectively, were an
lect. For the language sciences, it is important to keep in mind outlow of the invention of writing. Yet the opposite proposition,
that it is not just the technicians who use a tool to do what they claiming that the demands of bookkeepers and priests led to the
need to do and want to do, but that the tool restricts what can creation of writing, is no less simplistic.
be done. his holds true for the hardware, that is, the writing Similarly, the invention of the printing press and mov-
implements, as well as for the software, the code or the writing able type has often been seen as a technological breakthrough
systems. with vast social consequences (Febvre and Martin [1958] 1999;
he social aspects of speech and writing encompass a wide Eisenstein 1979). In our day, the digital turn (Fischer 2006),
range of topics many of which are dealt with in other entries of described as the third media revolution after chirographic
this encyclopedia (sociolinguistics, discourse analysis, culture (Schmandt-Besserat 1992) and print culture
35
(Olson 1994), is regarded as a driving force of globalization can read the latest stock quotes of a scrolling light-emitting
(Kressel 2007). Both of these propositions are defensible, but diode (LED) display. Brand-new buildings are adorned with the
not in a unidirectional, monocausal sense. Equally true are the old technique of cutting messages in stone. Stelae with com-
opposite propositions, that socioeconomic developments led to memorative inscriptions, gravestones, and buildings bearing
the emergence of a larger reading public, thus paving the way for the names of their owners or occupants are still being put up,
a simpler and more eicient reproduction technology than the much as in ancient times. here are hardly any material objects
copying of manuscripts, and that modern industrial society with to which writing cannot be aixed. Since the earliest times of
mass participation generated pressure for the development of a literate culture, few have been discarded and many added. he
technology of mass dissemination of information. he invention choice continues to expand. Hard surfaces made for endurance
of writing facilitated complex social organization, and the print- are stone, marble, metal, ceramics, wood, and, today, plastics.
ing press was conducive to the spread of education. However, Inscriptions are incised, engraved, etched, carved, and chiseled
writing has been a common possession of humanity for more into them as they were in the past, and malleable surfaces such
than 5,000 years and the printing press for half a millennium, if as moist clay and molten metal are impressed or molded into
we disregard the use of cast-metal movable type in Korea in the shape.
early thirteenth century. In addition to monumental inscriptions, writing is found on
Yet we are living in a world with hundreds of millions of adult various other surfaces, such as whitewashed walls, street signs,
illiterates, even, or rather particularly, where writing irst emerged, posters, billboards, handbills, notice boards, memorial plaques,
that is, in Mesopotamia, in Egypt, in China, and in Mesoamerica. cloth, clothing, commercials carried around by sandwichmen
According to unesco (2006), there were 781 million adult illiter- and mounted on trucks, advertising pillars, buses and other
ates worldwide and 100 million school-age children not attend- vehicles covered with commercials, shop windows, and digital
ing school in 2005. In spite of the uneven distribution of illiterates display panels. hese and some other surfaces, such as palm
in the world, these igures suice to discredit the notion that a leaves, papyrus, parchment, and wax tablets that have gone
new technology of communication of and by itself brings about out of fashion, are variously suitable for realizing the functional
social change. Economic development, social structure, ethnic potential of writing.
and linguistic composition, fecundity, ideology, and tradition Two fundamental functions of writing are memory support
are intervening variables that determine how a society makes use and communication. hey are not mutually exclusive, but dif-
of and adjusts to a new technology. It is necessary, therefore, to ferent surfaces lend themselves better to one than to the other.
reckon with the contemporaneity of diferent levels of develop- Hard surfaces answer the requirement of durability. hey are
ment, diferent technologies, and diferent literacies. Assuming inscribed only once, but with a lasting trace that can be recov-
a dialectic relationship of mutual inluence between writing and ered after years, decades, even millennia. Baked clay tablets,
social change is a more promising approach for understanding the hallmark of cuneiform civilization, and mural inscriptions
the transition from oral to literate society. on Egyptian monuments embody this type. Memory is in time
New technologies both respond to practical needs and create turned into history, the recording of the past and the collection
new practices. Any technology is an artifact, but to conclude that of knowledge, which are the cardinal functional character-
its creators rule over it is a fallacy, for the applications of techno- istics of this technology. Inscriptions on hard surfaces are, of
logical innovations are often recognized not in advance but after course, also communicative but stationary. Clay tablets can be
the fact, when they have been used for some time. Like the genie transported in limited numbers only, and monumental inscrip-
let out of the bottle, they may have unplanned and sometimes tions have to be visited to be read. In order to allow written
unwelcome consequences. he material and functional proper- signs to travel and thus to realize a potential that fundamen-
ties of writing technologies determine their potential uses, which, tally distinguishes writing from speech, freeing the message
however, are not necessarily evident at the outset. from the copresence of sender and receiver, lighter materials
he locus of writing is the city. Even a supericial look at are needed. In antiquity, three main writing surfaces met this
present-day urban environments reveals that city dwellers are requirement: papyrus, parchment, and paper.
surrounded by written messages wherever they go. Of late, this For millennia, Egypt was practically the only producer of
has given rise to a new branch of scholarship known as linguis- papyrus because the reed of which it is made grows in abun-
tic landscape research (Landry and Bourhis 1997; Backhaus dance along the banks of the Nile. he papyrus scroll hieroglyph
2007), as it were, a social epigraphy for posterity. he variety is attested in the most ancient known Egyptian inscriptions, and
of writing surfaces on which the literate culture of modern the oldest papyrus fragments covered with writing date from the
cityscapes manifests itself is striking. It testiies to the traces of third millennium b.c.e. Papyrus came to be commonly used for
history in the present and to the contemporaneity of diferent documentary and literary purposes throughout Greece, Asia
stages of development, for it includes some of the oldest mate- Minor, and the Roman Empire. As of the fourth century c.e.,
rials used for writing side by side with the most recent devices. parchment (processed animal hide), a more durable writing
his contemporaneity is one of the foremost characteristics material than the brittle papyrus, began to be more widely used in
of writing. For writing arrests change and enables accumula- Europe, where the scroll was gradually edged out by the book in
tion of information. Some genuine monuments from antiquity codex form (Roberts and Skeat 1983). he word paper is derived
speak to us today, such as the Egyptian obelisk of Ramses II of from papyrus, but paper making is quite diferent from papyrus
the 19th Dynasty, 13041237 b.c.e., re-erected on the Place de la making. It was invented by the Chinese some 1,900 years ago
Concorde in the center of Paris. Around the corner, the passerby (Twitchett 1983). he earliest Chinese documents on paper date
36
Social Practices of Speech and Writing
from the second century c.e. In the wake of the Islamic expan- digitalized contents would require some 750 gigabyte (GB) stor-
sion to Central Asia, the Arabs acquired the paper-making tech- age space, which easily its on an external hard disk the size of
nology in the eighth century c.e., which they in turn introduced a small book. As compared to print, digital information storage
to Europe in the eleventh century. Relatively cheap, lexible, and thus reduces the necessary physical space by a factor of 50,000.
convenient to carry, paper replaced parchment as the principal Again, this is a coarse measure only. here is considerable
writing surface in Europe and in other parts of the world. variation in the bytes per book page, both in actual fact and
Since its invention in China, paper, which Pierre-Marc De in model calculations, but the correlation between informa-
Biasi (1999) called the greatest invention of all time, gave a tion amount and storage space of print media and digital media
boost to the production of written text wherever it was intro- transpires from it.
duced. In China, it was used for block printing as of the seventh In sum, as clay was followed by paper and paper by digi-
century. In the tenth century, the entire Buddhist scripture was tal storage media, information density per square centimeter
printed using 130,000 printing blocks (Taylor and Taylor 1995, increased exponentially, while the weight and size of written
156). Paper was the irst writing material that spread around the records decreased. It became, accordingly, ever easier to store
world. In the West, Johannes Gutenbergs invention of print- and to transmit written text with many consequences for read-
ing with movable type would hardly have had the same impact ing and writing behavior, for reproducing text, and for the role
without it. Of the 180 copies of the Bible he printed, 150 were on texts play in everyday life. What the history of writing shows is
paper and only 30 on parchment, one indication of the impor- that new technologies do not always replace old ones. Rather,
tance of paper for the dissemination of written text. Its position the new supplements the old and often transforms its use. For
in this regard is undiminished. he paperless oice is far from a instance, parchment was marginalized by paper but for centu-
reality even in the most advanced countries; rather, many ancil- ries never completely driven out; print never replaced handwrit-
lary devices that presuppose writing on paper form the mainstay ing and has not become obsolete by texts typed on a cell phone
of the thriving oice machines industry: printers, scanners, copi- keypad. Advances in writing technology have greatly expanded
ers, and fax machines. Although nowadays paper holds only a the repertoire of tools that humanity has acquired for handling
tiny fraction of all new information released, it is still the unchal- information in the form of written language. he fact that old
lenged surface for the formal publication of information. World technologies continue to be used side by side with new ones
paper consumption for information storage and distribution is at testiies not just to inertia and path dependency but also to the
an all-time high. Notwithstanding the shift of many periodicals diferent properties of the materials used. For centuries after
and scholarly journals to online publication most continue to the introduction of paper, it was considered too feeble a mate-
be printed on paper for archival purposes, for paper has a much rial to hold contracts and other important documents, which
longer duration of life than can be guaranteed for any digital were preferably executed on parchment. Similarly, although it
storage medium. is technically possible to keep birth registers as electronic iles
his brings to light a more general trade-of of information only, birth certiicates on paper continue to be issued. One of the
processing. Weight and storage capacity are inversely related. A reasons is that digital information storage is subject to physical
tablet measuring about 10 10 cm is the ideal size for writing on decay and technical obsolence not less but much more than pre-
wet clay. It holds about 300 characters. Depending on the thick- decessor media.
ness of the tablet, this yields an information/weight ratio of .2 kg Writing has made record keeping for posterity and accumu-
to 1 kg per 1,000 characters. A text of 300,000 characters would lation of knowledge possible. However, with the introduction of
weigh between 200 kg and 1000 kg. In modern terms, that would every new writing material, the storage problem that it seemed to
be a short book of fewer than 190 pages, assuming an informa- solve became more acute. An archive of records on paper takes
tion density of 1,600 characters per page. Give it a solid cover up much less space than one for clay tablets, but it is beset by
and it comes to a total of 250 g. With respect to the information/ dangers that pose no threat to baked clay: dust, humidity, insects,
weight ratio, paper thus outperforms clay by a factor of 4,000. ire, and water. heft, too, is a greater threat to libraries than to
Such a rough-and-ready calculation may suice to illustrate clay tablet archives and a greater threat to computers than to
the point. Papyrus was similarly superior to clay with regard to libraries. Keeping books in a usable physical state requires more
storage capacity and transportability; however, many more clay work than keeping clay tablets. he same kind of relationship
tablets than papyrus documents have come down to us through holds between libraries and digital data archives. Much more
the millennia. How many papyrus rolls were lost in the legendary can be stored, but preservation for future use becomes ever more
blaze of the library of Alexandria is not known, but when in 2004 diicult as time intervals of technical innovation shrink. Only a
a ire broke out in the Anna Amalia Library in Weimar, 50,000 few specialists are able to handle data stored with software that
books were destroyed, many of them unique or rare. Another ceased to be produced 20 years ago, whereas a book would last
65,000 volumes were severely damaged by ire and water. Baked for centuries. he problem of preserving and organizing the ever-
clay tablets would have withstood the lames and the water used swelling lood of information remains unsolved, and at the same
to put them out. time many traditional libraries and collections of documents fall
his line of thought can be extended into the digital age into decay. Technology has hugely expanded human memory,
by another calculation. Computer technology has exponen- but it has not yet eliminated the risk that many parts of the heri-
tially increased information storage density. he 50,000 burnt tage committed to writing will disappear forever.
books of the Anna Amalia Library took up some 1,660 meters of To guard against collective memory loss, the United Nations
shelf space. Assuming an average of 300 pages per book, their Educational, Scientiic and Social Orgnization (UNESCO) has
37
launched the Memory of the World Programme to assist in the years memorizing the Confucian classics and commentaries.
preservation of archive holdings and library collections all over hey were then able to recite, understand, and interpret every
the world. For the time being, this is the endpoint of a develop- clause of the ive canonical works Book of Changes, Book of
ment begun with the advent of literacy in ancient civilizations: the Documents, Book of Poetry, Records of Rites, and Spring
institutionalization of writing and the bureaucratization of soci- and Autumn Annals said to have been redacted by Confucius
ety. he more serviceable writing became to human society, the himself, as well as a collection of commentaries by subsequent
more it penetrated social relations and the more attention it scholars.
came to require on the part of society. hat such an education was an adequate preparation for
bureaucrats charged with administering the country was rarely
called into question. It was irmly rooted in the past, and the clas-
INSTITUTIONAL ASPECTS OF LITERACY
sics were thought to hold a solution to any problem that might
From its inception, writing has been an instrument of power. In arise. he authority of writing and the conservatism of literature
ancient civilizations of restricted literacy, its mastery was jeal- were never more efective. he strength of the system lay in the
ously guarded by the elite. It was indispensable for the work- fact that it encouraged respect for learning and provided the
ings of the temple-centered economies of ancient Near Eastern emperor with a bureaucracy educated in one standard curricu-
city states (Nissen 1988), symbolized the rule of the pharaohs in lum. Its weakness was its emphasis on commentary that stiled
Egypt (Posener 1956), became the bedrock of Chinas Confucian inquisitiveness and deviation from the trodden path. he civil
bureaucratic state (Lewis 1999), and was a sine qua non of service exam system institutionalized the transmission of texts
Athenian democracy (homas 1992). Certainly, literacy levels down a lineage and was, thus, inherently averse to change. In its
varied widely as did the uses of literacy, but the general ten- early days, it helped to loosen the hereditary aristocracys grip
dency of the extension of the human mind by means of writing to on power by rewarding merit rather than birth for recruiting
engender and necessitate institutions is unmistakable. he most bureaucrats. In actual fact, however, learning remained largely
important institutions produced by literate culture have to do a prerogative of aristocratic families out of reach for most com-
with government, cult, schooling, and economic organization. moners. Women were not permitted to sit for the exams. In the
end, the civil service examinations served as a system to perpetu-
ate the power of the thin elite of literati bureaucrats.
Government
Controlling literacy has always been the other side of its rela-
Writing was used early on to extend the reach of authority and to tion to authority. he powerful have lived in fear of the power
mediate the relationship between ruler and ruled. Monuments of the pen and have had little interest in promoting the art of
that embody power, such as the Rosetta Stone inscribed with a writing among the masses. Illiterates are powerless, unable to
decree to commemorate the reception of rulership by Pharaoh challenge the letter of the law or to invoke the laws on the books
Ptolemy V on March 27, 197 b.c.e. (Parkinson 1999, 29), as well to their own advantage. hat which is written down acquires
as stelae appealing with regulations to the literate public, were authority in its own right as a reference source independent of
erected throughout the Ancient Near East. heir inscriptions the ruler. While helping to project power far beyond earshot, it
were drafted by scribes who created the irst bureaucratic states. gains a measure of objectivity, thereby reducing the arbitrari-
he Egyptian vizier was responsible for the collection of taxes, ness of rule. But only the literate can hold the ruler accountable
the maintenance of archives, and the appointment of oicials. to his own decrees. he institutionalization of writing to this end
he skills of the scribe aforded privilege in Egyptian society. As occurred in ifth-century Greece where government was estab-
one of them put it, the scribe is released from manual tasks; it lished in the polis through written laws. hese were not God-
is he who commands (Goody and Watt 1968, 37). A thousand given but man-made laws, aiding the development of a division
years later, Mencius (372289 b.c.e.) made the same point in between cosmic order and human society (Stratton 1980), so
China: Some labour with their hearts and minds; some labour characteristic of the Greek Weltanschauung and so diferent
with their strength. hose who labour with their hearts and from the Chinese.
minds govern others. hose who labour with their strength are From the objectiication of language in writing follows
governed by others (Book of Mencius, quoted from Lloyd and another function with important implications for the exercis-
Sivin 2002, 16). hese observations bring to light the connection ing and curbing of power. Writing detaches the author from
between literacy and social hierarchy that persists to this day. his message, which makes it easier and less risky to express the
Wherever societies obtained the art of writing, the literati unspeakable. Two examples suice to illustrate. In Athens, ostra-
were close to the powers that be, but the Chinese institutional- cism was institutionalized as a procedure to protect democratic
ized literacy like no other culture. In Confuciuss day, literacy was rule. In the event that a charismatic politician (a demagogue)
already the preeminent mark of the gentleman, and as of the sec- became too inluential or was otherwise disruptive, the demos
ond century b.c.e., the study of the Confucian classics gradually (people) were entitled to exile him from the polis. To this end,
became institutionalized as the key to attaining public oice and every citizen was given a potsherd (ostrakon) on which to write
inluence. he civil service examination system was employed the name of the man to be sent into exile. he degree of literacy in
with few adjustments from the Han Dynasty (206 b.c.e.220 c.e.) Athens that can be inferred from this practice is a question that
until the closing stages of the Qing Dynasty at the beginning of has given rise to much research and speculation (Burns 1981;
the twentiethth century. It was based entirely on the study of Harvey 1966; Havelock 1982; W. Harris 1989; homas 1992). It is
texts. To prepare for the highest degree, students spent up to 20 unlikely that it will be possible ever to quote even approximately
38
correct igures, but what we know of the literate culture of the sacred and the profane may take a long time to complete. he
Greek city-states is that, unlike in China, there was no scribal Ten Commandments, written by God and given to Moses, were
class. Minimal competence such as was necessary to scratch a a code of conduct regulating the relations between God and the
name on a potsherd was relatively widespread. Both written law people, as well as the people among themselves. he spheres of
as the basis of social order and ostracism as a check on power spiritual and worldly power were only beginning to be separated
exemplify the institutionalization of writing as a precondition in antiquity, a process to which writing and the ensuing inter-
for political participation and as a seed of the public sphere. It pretation of texts contributed a great deal. Of Moses legendary
enabled the people to express discontent with a leader without Ten Commandments stone tablets no archaeological evidence
raising their hand individually or speaking out. Anonymity was remains, but the Aoka edicts are a tangible example of the
a protection from reprisals. closeness of cult and social order. Engraved in rocks and stone
he other example is from Babylon, as reported in the pillars that have been discovered in dozens of sites throughout
Old Testament. Writing means empowerment and, there- northern India, Nepal, Pakistan, and Afghanistan, these edicts
fore, has to be controlled. he proverbial writing on the wall, served to inform the subjects of King Aokas (304232 b.c.e.)
mene-tekel-parsin, an Aramaic phrase that, according to Daniel reforms and make them lead a moral life according to the truth
5:25, mysteriously appeared on the wall of King Belshazzars pal- of Buddhism. Moral precepts hewn in stone on public display
ace, cautioning him that his days were numbered and his empire once again raise the question of the degree of literacy. It must
was doomed, exempliies the potential of criticism. A message have been high enough to disseminate the message through-
by an unknown author exposing abuse of power to the public is out the vast realm. In the rich spiritual world of the Indian sub-
a direct challenge to authority. While it would be problematic to continent, much emphasis was always given to the importance
voice disapproval in the presence of others, writing afords the of oral tradition. Competing with other religious movements,
originator the protection of anonymity. Buddhism established itself by rigidly regulating monastic life
he dialectics of technological innovation come to bear and assembling a canon of scriptures. At the time of Aoka,
here. While the mighty use writing to establish, exercise, and Buddhism was an institutionalized religion that spread from
consolidate their power, it also lends itself to embarrassing and northern India to other parts of the subcontinent, irst and sub-
undermining them. his was clearly understood in antiquity. sequently to South and Southeast Asia, as well as Central Asia
Power holders always tried to curtail the power of the written and China.
word. Confuciuss Analects were burned in 200 b.c.e. on order he history of Buddhism and its emergence as a major world
of Emperor Pinyin Qin Shi Huang Di. Plato was convinced that religion is a history of translation. Translation means writ-
the state had to control the content of what pupils read. Tacitus ing, turning one text into another and all it involves: exegesis,
refers to banned books (Anales 4,34), and his own books were doctrine, scholasticism, and schism. As Ivan Illich and Barry
later banned in return. Censorship is ubiquitous throughout Sanders (1988: 52) have argued, there is no translation in oral-
history (Jones 2001). Many of the inest literary works were ity, but only the collaborative endeavor to understand ones
at one time or another put on a list of forbidden books, such partner in discourse. Writing eternalizes spiritual enlighten-
as the papal Index Auctorum et Librorum Prohibitorum, irst ment (e.g., the word of God), which must be preserved with
published in 1559. It took another 500 years for censorship to the greatest care and does not allow alteration at will. Of all
be universally censured as an illegitimate means of exercising religions, Buddhism has produced by far the largest corpus
power. Article 19 of the Universal Declaration of Human Rights of sacred texts. Signiicantly, diferent canons resulted from
adopted by the United Nations in 1948 states that everyone has translation, the Pali canon, the Tibetan canon, and the Chinese
the right to freedom of expression and information, regardless canon. With translation came schism, and with that the delimi-
of frontiers. tation of religious sects and districts. In the evolution of other
he struggle against censorship worldwide is far from over. book religions, translation had similar consequences. hese
With every new writing technology it is rekindled, as the debate other religions share with Buddhism the vital importance they
about controlling the Internet illustrates. he power elites try to attach to scriptures. he holy book is what distinguishes world
control the new media, which they perceive as a threat, although, religions from other cults. heir legitimacy is derived from the
ironically, they are often involved in its development. he battle- revelation embodied in the scriptures, which are considered the
ield shifts with technological advance. What were scriptoria and true source of enlightenment. he major world religions vary in
publishing houses in the past are servers and the low of data the role they assign sacred texts, in how they make use of them
through cyberspace today. In the long run, attempts on the part for purposes of propaganda and the regulation of life, but the
of governments to defend their lead and keep new information reverence accorded to writing is a feature they share, as is the
technology out of reach of their adversaries have failed because question of translation.
it is the very nature of these technologies that they can be utilized Translation is a problem for two reasons that can be labeled
to uphold and to counter the claim to power. authenticity and correspondence. First, the idea of authentic-
ity of the word of God does not agree with its transposition into
another form. Some book religions are very strict in this regard.
Cult
According to believers, God himself chose the Arabic language
In nonliterate societies religious, order and social order are for his inal testament, the Quran, and created every one of the
merged without clear division. Writing tends to introduce a letters of the Arabic alphabet. Consequently, the only true ver-
fracture between spheres, although the diferentiation of the sion of the holy book of Islam is the Quran in Classical Arabic.
39
he Christian tradition until the Reformation movement knew languages or varieties of the same stock. In multilingual envi-
similar limitations, recognizing only the three sacred tongues of ronments like medieval Europe, where Latin was the only writ-
Hebrew, Greek, and Latin as legitimate languages of the Bible. ten language, or in present-day India, cultivated languages are
In many cases, other languages were considered unit for the often reserved for writing and formal communication, while ver-
expression of divine revelations, if only because they lacked the nacular varieties are used in informal settings. A similar division
ixity that comes with being reduced to writing. Indeed, trans- is found between linguistic varieties of the same stock, where
lations of religious scriptures, when they were eventually pro- one is deined by reference to a corpus of classical texts, such as
duced, were for many languages their irst literary texts, serving Classical Arabic, whereas the others luctuate without artiicial
as an anchor in the luidity of oral discourse and as a starting restriction.
point of literacy and textual transmission. Writing introduces an element of art and artiiciality into the
he other reason for the problematic nature of translation history of language. Every language is the collective product of
is that through it, a stable correspondence is to be established its speakers, but a written language is more clearly an artefact
between two codes that though obviously diferent in lexical, than a vernacular, and the script that it uses more clearly yet.
grammatical, and phonetic makeup, must express the same Historically, the difusion of scripts coincided in large measure
contents. In order to translate, the meaning of the text has to with that of religions, a connection that is still visible today.
be established unequivocally, the assumption being that this Chinese characters arrived in Japan together with Buddhism.
is possible. he text is thus elevated to the source of meaning. he spread of Roman traces the expansion of both the Roman
Authority that formerly accrued to the sage, the reciter, and the Empire and the Catholic Church, while Orthodox Christianity
soothsayer was relocated in text. his transition from utterance uses Cyrillic. Armenian Christians have their own alphabet
to text (Olson 1977) implies that in order to understand a mes- designed in the ifth century by St. Mesrob. Estrangela is the script
sage, at issue no longer is what the speaker means but what the of the Syrian Church. he Hebrew square script is the script of
text contains. his is what translation is all about. Language itself Judaism, the Arabic alphabet that of Islam. Many other examples
in objectiied form thus becomes a force in its own right with far- could be added; clerks were churchmen (Coulmas 1996, 435 f.).
reaching implications in the domains of knowledge production he historical interconnectedness of writing and religion is one
and social regulation. Preservation of the word of God in text of the reasons that scripts tend to function as symbols of iden-
has provided an objectiied reference plane incorporating the tity, but ethnic, national, and political identity are also readily
true meaning waiting to be extracted from it. Olsons notion expressed by means of a distinct script or even slightly diferent
of autonomous text that results from the transition from utter- orthographic conventions. As Peter Unseth (2005) has pointed
ance to text as he conceptualizes it has been criticized because out, there are clear sociolinguistic parallels between choos-
it ignores the readers involvement in constructing meaning ing scripts and languages. Because of the visibility and the artii-
(Nystrand 1986) and because it underestimates the oral ele- cial nature of writing, however, the choice of scripts is generally a
ments in literate culture (Wiley 1996). he soundness of these more deliberate departure from tradition in that it involves con-
criticisms in detail, however, does not invalidate the general idea scious planning.
that writing gives language a measure of stability that it does not
have in speech, and brings with it a shift from the intentional to
Schooling
the conventional aspects of linguistic meaning, a shift from I
mean to the word means. he high prestige that the written Language is a natural faculty, writing an artifact. hat is the rea-
word acquired through its association with and instrumentaliza- son why children acquire language, but not writing, without
tion by organized religion has greatly contributed to the coming guidance. he diicult art of writing requires skills that must be
into existence of autonomous text. taught, memorized, and laboriously practiced. he place to do it
he reverence for holy books had various consequences for is school. For writing to be useful to the community, conventions
language attitudes and practices, two of which can be men- have to be established, individual variation curtailed, norms set.
tioned here: code choice and script choice (Coulmas 2005). Collective instruction following a curriculum is a more eicient
Writing introduces functional domain diferentiation into a way to achieve this than is private tutoring. Already in antiquity,
communitys linguistic ecology. hat the language of cult difers school became, and still is, the institution that most explicitly
from that of everyday pursuits has always been the rule rather exercises authority over the written language by controlling its
than the exception, but with writing the distinction becomes transmission from one generation to the next.
more pronounced. he important position of religious texts in With schooling came the regimentation and the decontex-
combination with restricted literacy encouraged the perpetu- tualization of language. Because in writing the continuous low
ation of the split between spoken and written language. While of speech has to be broken down into discrete units, analytic
the codiication of the latter was aided by the desire to transmit relection about the units of language was fostered. As writing,
the sacred texts inviolately to later generations, the former was language became an object of investigation and normalization.
subject to perpetual change. he result was a situation of coexist- Both grammar and lexicon are products of writing. his is not to
ing codes, called diglossia in modern scholarship (Ferguson deny the grammaticality of oral speech or that oral people have
1959; Krishnamurti 1986; Schifman 1996). Although every case a mental lexicon. It just means that the notions of grammar and
of diglossia is diferent, the deining characteristic is a domain- lexicon as we know them are entirely dependent upon writing.
speciic usage of varieties that coincides by and large with the At school, units of writing had to be practiced with the stylus, the
spoken/written language divide. hese varieties can be diferent brush, or the pen, mechanically through repetition without any
40
communicative intent. hese units could be given a phonetic issue. In medieval Europe, the ad litteras reform during the reign
interpretation; they could be pronounced and thus acquired of Charlemagne aimed at unifying spoken and written language,
as words, an existence as units of language. In time, the image as the widening gap between both was perceived as a problem.
became the model. Since the correct form of the written sign was It was eventually reduced, not so much by enforcing a uniform
a matter to which the scribe and his pupils had to pay attention, standard for pronouncing Latin than by dethroning it as the only
standards of correctness irst developed with reference to writing written language and transforming lingua illiteratae (Blanche-
and written language. Benveniste 1994), that is, Romance, Germanic, and Slavonic
Only much later, and as an efect of schooling, did these vernaculars, into written languages in their own right. Literacy
notions come to be applied to speech. he twin questions of in these emerging national languages was bolstered by the
what the units of writing were and how they were to be conjoined Reformation movement that wrested the interpretation monop-
led to the more general and profound question What is a lan- oly of Christian scriptures from the Catholic clergy. Write as you
guage? Right up to the present, answers to this question exhibit speak, a maxim that can be traced back to antiquity, became
what Roy Harris (1980) has called a scriptist bias. Only trained an increasingly important principle for teaching writing
linguists readily recognize unwritten vernaculars as languages, (Mller 1990). Although it unrealistically denies the speech/
and even they have to admit that while it is easy to count written writing distinction, generations of teachers have repeated it to
languages, the question of how many speech forms on this planet their pupils. It never meant that their writing should be as ellip-
qualify as distinct languages is impossible to answer without lay- tical, situation-bound, and variable as their speech. he impli-
ing down analytic criteria that are no less arbitrary than decisions cation is that if you cannot write as you speak, something must
as to what dialects, varieties, idioms, and speech forms should be wrong with your speech. Universal education on the basis of
be reduced to writing. Languages, as well as the units into which this maxim resulted in a conceptual reduction of the distance
they are analyzed, are a product of writing, for only in writing can between speech and writing, with some notable consequences.
the low of speech be arrested and broken down into indepen- Mass literacy through schooling led to the disappearance of
dent stable components with a presumed inherent, rather than diglossia from most European speech communities, although
vague and contextually determined meaning. the dichotomy of speech and writing continued to be expressed
Among the irst results of school mastering the language in stylistic diferences.
in the Ancient Near East were word lists, the paradigm case of In other parts of the world, where universal education was
decontextualized language. hese lists were the foundation of realized later or is still only a desideratum, the split between
lexicography (Civil 1995), the science of words. In China, lexi- spoken and written language remained. In todays world, the
cography began with lists of characters, and characters are still 1953 UNESCO declaration recommending mother tongue lit-
the basic units of dictionaries. Dictionaries provide entries for eracy notwithstanding, literacy education in the language of
lexical items. A lexical item is a word contained in a dictionary. the nation-state often means learning to read and write in a
More reined and less circular deinitions of the orthographic second language. he extent to which progress in the promo-
word, as distinct from the phonological word and the abstract tion of literacy depends on the language of instruction is still
lexeme have been proposed in great number, but it remains a matter of controversy, as is whether the writing system is a
diicult if not impossible to deine word without reference to signiicant variable. To some extent, this can be explained by
writing. he word stands at the beginning of grammatical schol- the fact that deinitions of literacy are shifting with changing
arship, which was, as the word grammatical itself indicates, socioeconomic needs and technical innovations, and because
exclusively concerned with written language. Grammatike, the range of what are considered varieties of a given language
combining the Greek words grammata (letters) and techne is variable (as is evident, for example, in the context of decre-
(art), was the art of knowing letters. hese beginnings of the olization and discussions about elevating certain varieties,
systematic study of language left a lasting imprint which, as such as black English in the United States, to language status).
Per Linell (2005) has convincingly shown, still informs modern here is, however, wide agreement that the crucial variable
linguistics. he word, the sentence, and even the phoneme is the efectiveness of the educational system. Mastering the
are analytic concepts derived from the discrete segments of written language is a diicult task, which is best executed by
writing, not vice versa. he conceptualization of writing as the institution that at the same time administers the written
a representation of speech is therefore highly problematic language: school.
(R. Harris 1980, 2000). To sum up this section, the institution- Since the time of the French Revolution, schools have been
alization of writing in school resulted in a changed attitude to charged with establishing the national language and, by way of
language. It became an object of study and regulation. Both of spreading the national language ideology, a link between lan-
these concepts were not in the irst instance developed for, and guage and nationalism. As a result, the demands of multilin-
applied to, speech. gual education are often at variance with the state-sponsored
Under conditions of restricted literacy and diglossia, a wide educational system. Because of the nationalization of languages
divide between spoken and written language was taken for in the modern nation-state and their privileged position in the
granted. Speech and writing were two modes of communication school system, however, the language of literacy training became
involving varieties or languages that were both grammatically a political issue. Minority speech communities in many indus-
and stylistically quite separate from each other. It was only when trialized countries aspired to the prestige for their language that
literacy became accessible to wider sections of the population comes with a written standard and started to lobby for the inclu-
that the relationship between speech and writing became an sion of their language in the school curriculum. Fueled by the
41
growing awareness of the importance of minority protection, the relationship between writing and economy is not a simple
such movements have met with a measure of success, leading to one. For one thing, the degree of literacy necessary for economic
a highly complex situation of multiple and multilingual literacies growth is a matter of controversy and depends on how economic
in modern societies, which of late has attracted scholarly atten- growth is measured. Illiteracy is considered a strong indicator of
tion (Martin-Jones and Jones 2000; Daswani 2001; Cook and economic underdevelopment, correlating as it does with infant
Bassetti 2005). mortality, low life expectancy, and poverty; but the question of
he prevailing view sees the establishment of a single national whether high literacy drives economic growth or is a function
language with a uniied written standard as facilitating universal thereof remains unresolved. Functional illiteracy rates of up to
literacy. International comparisons of literacy rates are notori- 20 percent in Western countries, notably in the United States,
ously diicult (Gurin-Pace and Blum 1999), but there is little suggest that at least in the developed world, literacy rates are
doubt that Europe, where national language academies irst more indicative of economic disparity than of overall national
implemented the national language ideology, led the way. Today, economic development. Similarly, in developing countries,
however, the monolingual model of literacy is called into ques- illiterates are largely rural and without access to public services
tion by developments that, on the one hand, favor English as a (Varghese 2000). he distribution of wealth and the division of
supplementary universal written language in many non-English- labor in todays national economies are such that they allow for,
speaking countries, and on the other, allow minority languages or perhaps even sustain, substantial residues of illiteracy. Both in
to make inroads into the domains of writing. he question of the developed and the developing world, people who partake of
whether the diversiication of literacy will help achieve the goal the practice of writing live side by side with others who continue
of eradicating illiteracy or whether it will compromise the alleged to conduct their life in the oral mode.
economic advantage of having one written standard language It is fair to say that the evolution of writing in antiquity both
continues to be discussed by academics and politicians, while happened in response to and propelled economic develop-
the complementary developments of globalization of markets ment. Yet although writing technology has been available for
and (re)localization of cultures unfold. economic pursuits for more than ive millennia, fully liter-
ate societies remain an ideal. his shows that the relationship
between economy and institutionalized writing is subject to
Economic Organization
interference by other variables, notably those discussed in the
he critical importance of writing for economic processes previous sections, that is, government, religion, culture, and
in early civilizations is best documented for Mesopotamia. education. In antiquity, these spheres and the economy were
It is widely agreed now that in Sumer, number concepts and not separate. It was a long process that led to their diferentia-
numerical representation stood at the beginning of writing tion in modern times. As the chief instrument of knowledge
that evolved into cuneiform (Schmandt-Besserat 1992). he management and growth, writing was indispensable for ratio-
overwhelming majority of archaeological inds from Ur, Uruk, nalization and what Max Weber (1922) called the disenchant-
Nineveh, and Babylon are records of economic transactions ment of the world, which, as the religious associations and
kept in clay tablet archives by accountants of the palace admin- emotional values attached to various written languages testify,
istration (Nissen, Damerow, and Englund 1990). he Sumerians is still incomplete.
and their Akkadian successors were the irst to develop book- he interaction of writing with the economy has been studied
keeping into a sophisticated technique of balancing income and from various points of view. Writing can accordingly be under-
expenditure. Hammurabi, King of Babylon (17281686 b.c.e.), stood as
created the irst commercial code, 282 articles written on a large
- a tool,
stone monument, which was erected in a public place for all to
observe. Every business transaction was to be in writing and - social choice,
signed, usually with a seal, by the contracting parties. At the - a common good,
time, the public-sector economy was far too highly developed - human capital,
and too complex to function without writing. Tribute quota lists
- transaction cost.
had to be kept, rations for laborers involved in public works cal-
culated, inventories recorded. Deeds were issued in duplicate Writing is a tool that enables individuals to expand their com-
and receipts stored for future reference. Large-scale trading, munication range and communities to increase social integra-
often involving credit and futures, had to be regulated and tion and differentiation. It is useful and valuable. Since writing
overseen by the bureaucracy, consisting of a huge scribal class systems, as discussed previously, are artifacts, this raises
charged with creating and servicing these documents. the question of how this tool evolves so that it is functional.
Ancient Mesopotamia is the paradigm case of the close inter- Students of writing, notably I. J. Gelb (1963), have invoked
connectedness of economy and the institutionalization of writ- the principle of least effort, which predicts that in time, writ-
ing, but if economic behavior is understood in the wide sense of ing systems become simpler and easier to use. The relative
human adaptations to the needs and aspirations of society at a simplicity of the Greek alphabet, as compared, for example,
given moment in history, this interconnectedness can be identi- with Egyptian hieroglyphs and early cuneiform, lends support
ied in every literate society. Complex socioeconomic systems of to this hypothesis. However, intuitive though it is, there are
managed production, distribution and trade, taxation, and credit problems. Obvious counterexamples are found outside the
did not evolve in cultures that had no writing. Yet the nature of ancient Near Eastern civilizations mainly studied by Gelb. In
42
its long history, the Chinese writing system and its adaptation pertinent than ever (Levine 1994) and can explain processes such
in Japan increased rather than decreased in complexity. On as the accelerated spread of English. he globalization of mar-
the other hand, it took 500 years for the much simpler Korean kets and the information technology revolution ofer ever more
Hangul to be accepted as an alternative to Chinese. If sim- people the opportunity to enhance their human capital and, at
plicity and adaptability were the forces that drive the spread the same time, compel them to do so. However, the commodii-
of writing systems, Hangul should have supplanted Chinese cation of written language (Heller 2003) and the new forms and
characters, not just in Korea but in China and elsewhere long uses it takes on in the new media have consequences, some of
ago. Looking at the evolution of individual systems, certain which become apparent only as technology spreads.
letter forms in the Tibetan script were simplified to the extent Conceptualizing written language as transaction cost brings
of becoming indistinguishable, rendering the system almost market forces into view. Reducing transaction costs is consid-
dysfunctional. One has to conclude that if the principle of ered a quasi-natural precondition of economic growth, which
least effort is at work in the evolution of the technology of the partly explains the spread of trade languages. Once in place and
intellect, then it is often compromised by other principles, controlled by relevant agents, their use is less costly than transla-
such as path dependency, that is, the fact that change in soci- tion. In todays world, this principle seems strongly to favor the
ety depends on established ways, identity affirmation (see further spread of English. However, the efects of technological
ethnolinguistic identity ), and cultural exclusivism. innovation are, as always, hard to foresee and even to assess
Considering writing and written language from the point when it unfolds in front of our eyes. When the Internet ceased
of view of social choice leads to a similar conclusion. A writ- to be a communication domain reserved to the U.S. military,
ten language can be understood as an institution with certain partly due to the available software at the time, it seemed to be
properties that is shaped by the agents involved in its opera- poised to become an English-only medium. But it turned out
tion. However, a single criterion of optimization of reasonable that as the technology caught on, the share of English in cyber-
target functions cannot explain the diversity of systems that space communication rapidly declined. he new information
evolved, the multiplicity of uses, or the actual shape of indi- technology made it much easier for speech communities big and
vidual systems, such as, for example, English spelling or the small around the world to communicate in writing, a possibility
Japanese usage of Chinese characters (Coulmas 1994). Clearly, eagerly exploited wherever the hardware became available. For
established writing conventions are the result of public choice. some communities this meant using their language in writing for
No individual can introduce a better standard, even though its the irst time. In others it led to the suspension of norms, blur-
superior qualities may be indubitable, for conformity with the ring, in many ways yet to be explored, the traditional distinctions
existing norm is crucial for the functionality of the institution. between spoken and written language.
Penalties on norm violations are therefore high, and proposals David Crystal (2001) suggests that Netspeak or computer-
for changing established practice invariably meet with strong mediated language is a new medium, neither spoken language
resistance. Change is not impossible, but it comes at a cost and nor written language. he implications of Internet communica-
rarely uncompromisingly follows functional optimization. tion for literacy in the electronic age are only beginning to be
A written language, if used by a community, has properties of explored (Richards 2000). New multilayered literacies are evolv-
a common good. Like a public transport system, it is beneicial ing, responding as they do to the complementary and sometimes
for most members of that community and therefore deserving of conlicting demands of economic rationality, social reproduc-
the attention of the state. his is the rationale underlying the legal tion through education, ideology, and the technical properties of
protection enjoyed by national languages, explicitly or implicitly, the medium. hese developments open up a huge new area of
in modern nation-states. A written language not used by every- research into how, since the invention of writing, the range of lin-
one is not a common good and treated accordingly. Not serving guistic communication options has been constantly expanding.
anyones interest, dead languages are of interest to the historian
at best. For the same reason, it has always been diicult to ele-
CONCLUSION
vate a vernacular to the status of written language; not providing
access to information and not being used by many members of Writing is a technology that interacts with social practices in
the community in the beginning, it does not count as a common complex ways, exercising a profound inluence on the way we
good. Its claim to written language status is supported not by its think, communicate, and conceptualize language. Since it is an
instrumental value but only by the symbolic value for its commu- artifact, it can be adjusted deliberately to the evolving needs
nity. To reconcile the idea of a dominant language as a common of society, but it also follows its own inherent rules that derive
good with the recognition of minority rights is therefore prob- from the properties of the medium. his essay has analyzed the
lematic. With Pierre Bourdieu (1982), it can be conceptualized tension between the properties of the medium and the designs
as a struggle over legitimacy, competence, and access (market, of its users from two points of view, technological advance and
linguistic; habitus, linguistic). Only if social harmony is institutionalization. Harnessed by institutions, the technology
made part of the equation will accommodation be possible. of writing is made serviceable to the intellectual advance of
A written language, the cognitive abilities its mastery implies, society and modiied in the process, sometimes gradually and
and the information to which it provides access are a resource. sometimes in revolutionary steps. hree consequences of writ-
Partaking in it adds to an individuals human capital and hence ing remain constant: segmentation, linearization, and accu-
to his or her marketability. In the information economy of mulation. he linear and discrete-segmental structure that all
the knowledge society, this view on written language is more writing systems both derive from and superimpose on language
43
forever informs the perception of language in literate society. Harvey, F. D. 1966. Literacy in Athenian democracy. Revue des tudes
And by making language visible and permanent, it enables and Grecques 79: 585635.
compels its users to accumulate information far beyond the Havelock, Eric A. 1982. he Literate Revolution in Greece and Its Cultural
capacity of human memory, engendering ever new challenges Consequences. Princeton, NJ: Princeton University Press.
Heller, Monica. 2003. Globalization, the new economy, and the com-
for storage and organization.
modiication of language and identity. Journal of Sociolinguistics
7.4: 47392.
WORKS CITED AND SUGGESTIONS FOR FURTHER READING Illich, Ivan, and Barry Sanders. 1988. he Alphabetization of the Popular
Mind. San Francisco: North Point.
Assmann, Jan. 1991. Stein und Zeit: Mensch und Gesellschaft im alten
Jones, Derek, ed. 2001. Censorship: A World Encyclopedia. Vols.14.
gypten. Munich: Wilhelm Fink Verlag.
London: Fitzroy Dearborn.
Backhaus, Peter. 2007. Linguistic Landscapes: A Comparative Study of
Kapitzke, Cushla. 1995. Literacy and Religion: he Textual Politics and
Urban Multilingualism in Tokyo. Clevedon, UK: Multilingual Matters.
Practice of Seventh-Day Adventism. Amsterdam and Philadelphia: John
Blanche-Benveniste, Claire. 1994. he construct of oral and written
Benjamins.
language. In Functional Literacy: heoretical Issues and Educational
Kressel, Henry, with homas V. Lento. 2007. Competing for the Future
Applications, ed. Ludo Verhoeven, 6174. Amsterdam and
How Digital Innovations Are Changing the World. Cambridge:
Philadelphia: John Benjamins.
Bourdieu, Pierre. 1982. Ce que parler veut dire. Paris: Fayard.
Krishnamurti, Bh., ed. 1986. South Asian Languages: Structure,
Burns, Alfred. 1981. Athenian literacy in the ifth century B.C. Journal of
Convergence and Diglossia. Delhi: Motilal Banarsidass.
the History of Ideas 42.3: 37187.
Landry, Rodrigue, and Richard Y. Bourhis. 1997. Linguistic landscape
Civil, Miguel. 1995. Ancient Mesopotamian lexicography. In
and ethnolinguistic vitality. Journal of Language and Social Psychology
Civilizations of the Ancient Near East, ed. Jack M. Sasson, 230514. New
16.1: 2349.
York: Charles Scribners Sons.
Levine, Kenneth. 1994. Functional literacy in a changing world. In
Cook, Vivian, and Benedetta Bassetti, eds. 2005. Second Language Writing
Functional Literacy: heoretical Issues and Educational Applications,
Systems. Clevedon, UK: Multilingual Matters.
ed. Ludo Verhoeven, 11331. Amsterdam and Philadelphia: John
Coulmas, Florian. 1994. Writing systems and literacy: he alpha-
Benjamins.
betic myth revisited. In Functional Literacy: heoretical Issues and
Lewis, Mark Edward. 1999. Writing and Authority in Early China. SUNY
Educational Implications, ed. Ludo Verhoeven, 30520. Amsterdam
Series in Chinese Philosophy and Culture. Albany: State University of
and Philadelphia: John Benjamins.
New York Press.
. 1996. he Blackwell Encyclopaedia of Writing Systems.
Liberman, Philip, and Sheila Blumstein. 1988. Speech Physiology, Speech
Oxford: Blackwell.
Perception, and Acoustic Phonetics. New York: Cambridge University
. 2003. Writing Systems: An Introduction to heir Linguistic Analysis.
Press.
Cambridge: Cambridge University Press.
Linell, Per. 2005. he Written Language Bias in Linguistics: Its Nature,
. 2005. Sociolinguistics: he Study of Speakers Choices.
Origin and Transformation. London: Routledge.
Lloyd, Geofrey and Nathan Sivin. 2002. he Way and the Word: Science
Crystal, David. 2001. Language and the Internet. Cambridge: Cambridge
and Medicine in Early China and Greece. New Haven and London: Yale
University Press.
University Press.
Daswani, C. J., ed. 2001. Language Education in Multilingual India. New
Martin-Jones, Marilyn, and Kathryn Jones. 2000. Multilingual
Delhi: UNESCO.
Literacies: Reading and Writing Diferent Worlds. Amsterdam and
De Biasi, Pierre-Marc. 1999. Le papier: Une aventure au quotidien.
Paris: Gallimard.
McLuhan, Marshall. 1964. Understanding Media: he Extension of Man.
Eisenstein, Elizabeth L. 1979. he Printing Press as an Agent of Change.
New York: McGraw-Hill.
Mller, Karin. 1990. Schreibe, wie du sprichst!: Eine Maxime im
Febvre, Lucien, and Henri-Jean Martin. [1958] 1999. Lapparition du
Spannungsfeld von Mndlichkeit und Schriftlichkeit. Frankfurt am
livre. Paris: Albin Michel (Bibliothque Evolution Humanit).
Main and Bern: Peter Lang.
Ferguson, Charles A. 1959. Diglossia. Word 15: 32540.
Negroponte, Nicholas. 1995. Being Digital. New York: Alfred A. Knopf.
Fischer, Herv. 2006. Digital Shock: Confronting the New Reality.
A cyberspace extension is available online at: http://archives.obs-us.
Montreal: McGill-Queens University Press.
com/obs/english/books/nn/bdintro.htm.
Gelb, I. J. 1963. A Study of Writing. Chicago and London: University of
Nissen, Hans J. 1988. he Early History of the Ancient Near East. Chicago
Chicago Press.
and London: he University of Chicago Press.
Goody, Jack. 1977. he Domestication of the Savage Mind. Cambridge:
Nissen, Hans J., Peter Damerow, and Robert K. Englund. 1990. Frhe
Schrift und Techniken der Wirtschaftsverwaltung im alten Vorderen
Goody, Jack, and Ian Watt. 1968. he consequences of literacy. In
Orient: Informationsspeicherung und verarbeitung vor 5000 Jahren.
Literacy in Traditional Societies, ed. Jack Goody, 2768. Cambridge:
N.p.: Verlag Franzbecker.
Nystrand, Martin. 1986. he Structure of Written Communication.
Gurin-Pace, F., and A. Blum. 1999. Lillusion comparative: Les logiques
Orlando, FL: Academic Press.
dlaboration et dutilisation dune enqute internationale sur
Olson, David R. 1977. From utterance to text: he bias of language in
lillettrisme. Population 54: 271302.
speech and writing. Harvard Educational Review 47.3: 25786.
Harris, Roy. 1980. he Language Makers. Ithaca, NY: Cornell University
. 1991. Literacy and Orality. Cambridge: Cambridge University
Press.
Press.
. 2000. Rethinking Writing. London: Athlon Press.
. 1994. he World on Paper. Cambridge: Cambridge University
Harris, William V. 1989. Ancient Literacy. Cambridge: Harvard University
Press.
Press.
44
Ong, Walter J. 1982. Orality and Literacy: he Technologizing of the Word. Taylor, Insup, and M. Martin Taylor. 1995. Writing and Literacy in
London: Methuen. Chinese, Korean and Japanese. Amsterdam and Philadelphia: John
Parkinson, Richard. 1999. Cracking Codes: he Rosetta Stone and Benjamins.
Decipherment. London: British Museum Press. homas, Rosalind. 1992. Literacy and Ortality in Ancient Greece.
Posener, Georges. 1956. Littrature et politique dans lgypte de la XIIe Cambridge: Cambridge University Press.
dynastie. Paris: Honor Champion (Bibliothque lcole des hautes Twitchett, Denis C. 1983. Printing and Publishing in Medieval China.
tudes. 3007.) London: he Wynkyn de Worde Society.
Richards, Cameron. 2000. Hypermedia, Internet communication, and UNESCO Institute for Statistics. 2006. Available online at: http://portal.
the challenge of redeining literacy in the electronic age. Language unesco.org/education/en/ev.php-URL_ID=40338&URL_DO=DO_
Learning and Technology 4.2: 5977. TOPIC&URL_SECTION=201.html.
Roberts, Colin H., and T. C. Skeat. 1983. he Birth of the Codex. Unseth, Peter. 2005. Sociolinguistic parallels between choosing scripts
Oxford: Oxford University Press. and languages. Written Language and Literacy 8.1: 1942.
Sassoon, Rosemary. 1995. he Acquisition of a Second Writing System. Varghese, N. V. 2000. Costing of total literacy campaigns in India. In
Oxford: Intellect. Adult Education in India, ed. C. J. Daswani and S. Y. Shah, 22750. New
Schifman, Harold. 1996. Linguistic Culture and Language Policy. London Delhi: UNESCO.
and New York: Routledge. Weber, Max. 1922. Wirtschaft und Gesellschaft: Grundriss der verstehen-
Schmandt-Besserat, Denise. 1992. Before Writing. Austin: he University den Soziologie. Tbingen: J. C. B. Mohr.
of Texas Press. Wiley, T. G. 1996 . Literacy and Language Diversity in the United
Stratton, Jon. 1980. Writing and the concept of law in Ancient Greece. States. Washington, DC: Center of Applied Linguistics and Delta
Visible Language 14.2: 99121. Systems.
45
systems of communication (semaphores, bee language, etc.),
but the study of the biology of language might shed some light
on the case of human language communication. In this essay, I
consider some diverging ideas about the role of communication
5 in driving language evolution.
he study of the biology of language (see biolinguistics) is
traditionally divided into the investigation of three questions: 1)
What is knowledge of language? 2) How does language develop in
EXPLAINING LANGUAGE: NEUROSCIENCE, the child? 3) And how does language evolve in the species? (See
GENETICS, AND EVOLUTION Chomsky 1980, 2006; Jenkins 2000, 2004.) Note that the study of
the neuroscience of language cross-cuts with all three questions.
Lyle Jenkins hat is, we can ask: 1) What are the neurological underpinnings
of the faculty of language? 2) How does the language faculty
develop in the nervous system of the individual? 3) How did the
language faculty evolve in the species? (See Chomsky and Lasnik
1995.)
hese three questions are sometimes referred to as the what
and how questions of biolinguistics. here is another question,
Before undertaking a discussion of the explanation of language, the why question, which is more diicult to answer. his is the
we should point out that we are using this word in a special sense. question of why the principles of language are what they are
As Noam Chomsky has noted (see Jenkins 2000), while we can- (Chomsky 2004). Investigations into these why questions has in
not ask serious questions about general notions like vision or recent years been termed the minimalist program (or minimal-
language, we can ask them of speciic systems like insect vision ism), but interest in and discussion of these questions go back to
or human language. In what follows, our focus is on the biologi- the earliest days of biolinguistics (Chomsky 1995; Boeckx 2006).
cal foundations of language. As a result, certain areas commonly Properties of the attained language derive from three factors
referred to as languages are excluded from consideration, for (Chomsky 2005b): 1) genetic endowment for language, 2) expe-
example, some invented logical systems, computer languages rience, and 3) principles not speciic to the faculty of language.
such as Java, encrypted languages, the language of DNA, and Principles in 3) might even be non-domain-speciic or non-
so on. hese are all important and interesting areas of investi- organism-speciic principles. Examples of such principles are
gation. In fact, signiicant insights into human language may principles of eicient computation. Note that similar questions
be gained from the study of some of these ields. For example, can be posed about any biological system viruses, protein fold-
it has been argued that particular systems of logical form ing, bacterial cell division, sunlowers, bumblebees, falling cats,
may shed light on the semantics of human language. Both bio- nervous systems, and so on.
logical factors and nonbiological factors interact in such areas as Furthermore, similar kinds of questions arise in every linguis-
pragmatics and sociolinguistics. In addition, the study of tics research program (Chomsky 2005b). To make the discussion
formal languages (e.g., the Chomsky hierarchy) has also led manageable, in what follows I draw examples and discussion
to some important contributions. However, these areas cannot from minimalism. However, the issues and problems carry over
be completely accounted for by a consideration of the biology of to all other research programs concerned with providing expla-
human language. It is important to keep in mind that an account nations for properties of human language and accounting for
of cognitive systems like human language (as well as systems of them in terms of neurobiology, genetics, and evolution. For
animal communication) often exhibit a signiicant degree example, any theory of the language faculty will generate ini-
of modularity. In this view, biological factors interact with nitely many expressions that provide instructions to the senso-
other factors to provide a uniied explanation. Sometimes the rimotor and semantic interfaces. All such generative systems
term i-language is used to distinguish the biological object in will have an operation that combines structures (e.g., in mini-
the mind-brain that biolinguists study from other uses of word malism, merge), such as the phrase the boy with the phrase saw
language. the cat, and so on, formed from the lexical items of the language.
he same is true of other areas of study that are even more Applying this operation over and over (unbounded Merge), we get
closely related to human language, such as the role of language a discrete ininity of expressions, part of property 1), our genetic
in poems and novels, or the inluence of economic factors and endowment, in this particular case of the genetic component of
conquests and invasions on language change. historical lin- the language faculty.
guistics may involve factors both within and outside of the Many well-known accounts of the evolution of language pro-
scope of human language biology. Again, although analysis of pose communication as the primary selective force involved
the biology of human language may shed some light on these in the origin of language (see Works Cited). Here, for purposes
areas (e.g., the study of phonetics and phonology may be of comparison, we present a noncommunicative account of
useful in the analysis of poetry), in general it will not provide an the origins of language, suggested by Chomsky and based on
exhaustive account of these areas. Similarly, there has been great work in minimalism. However, we stress again that this kind of
interest in language as a system of communication. For the account is also compatible with approaches based on other lin-
reasons discussed here, there is not much to say about arbitrary guistic research programs. We then discuss another viewpoint
46
Explaining Language
based on the idea that language evolved from gestures, an idea directions here (but see the discussion of population dynamics
that is represented by a number of approaches to the evolution in a later secion; Nowak and Komarova 2001).
of language. After that, we present an example of the compara- A number of these accounts attempt to interconnect the evo-
tive approach to evolution in biolinguistics and discuss language lution of manual gestures, sign language, spoken language, and
and the neurosciences, focusing on leftright asymmetries of the motor control in various ways. Some of this work is based on the
language areas to illustrate this research. Finally, we discuss the discovery of a system of mirror neurons (mirror systems,
genetics of language, using the studies of the FOXP2 gene system imitation, and language) (Gallese, Fadiga, et al. 1996). his
to show how studies of language phenotype, neurobiology, and and later work demonstrated the existence of neurons in area F5
molecular biology, as well as comparative and evolutionary stud- of the premotor cortex of the monkey, which are activated when
ies with other animal systems, are being carried out. the monkey executes an action, for example, grasping an object,
Work on the principles of eicient computation governing the and also when the monkey observes and recognizes the action
application of the operation of Merge seem to suggest an asym- carried out by another monkey, or even the experimenter. In
metry, namely, that the computational principles are optimized addition, a subset of audiovisual mirror neurons were discov-
for the semantic interface, not the sensorimotor interface. his is ered that are activated when the sound of an action is perceived,
because conditions of computational eiciency and ease of com- for example, the tearing of paper (Kohler et al. 2002). In addi-
munication conlict, as is familiar from the theory of language tion to hand mirror neurons, communicative mouth mirror
parsing. his in turn has led Chomsky to suggest that external- neurons were discovered that were activated both for ingestive
ization of language, and hence communication, was a secondary actions and for mouth actions with communicative content,
adaptation of language. his would mean that language arose such as lip smacking in the monkey (Ferrari et al. 2003).
primarily as an internal language of thought. Supporting Since it had been suggested on the basis of cytoarchitectonic
this idea is the existence of sign languages, which develop in studies that there was a homology between area F5 of the mon-
a diferent modality but in other respects are very similar to spo- key and area 44 (in brocas area) of the human brain (Petrides
ken language (Kegl 2004; Pettito 2005). and Pandya 1994), researchers looked for and found mirror
hese design features of language have led Chomsky to pro- neuron systems in humans, using fMRI (Iacoboni et al. 1999)
pose the following scenario for the evolution of language. Several and event-related magnetoencephalography (MEG) (Nishitani
decades ago Chomsky suggested, on the basis of results from non- and Hari 2000) in place of single neuron studies. Mirror neu-
human primate studies, that the higher apes might well have a rons discharge whether the action is executed, observed, or
conceptual system with a system of object reference and notions heard. Moreover, they even discharge in the human system
such as agent, goal, instrument, and so on. What they lack, how- when subjects are exposed to syntactic structures that describe
ever, is the central design feature of human language, namely, goal-directed actions (Tettamanti et al. 2005). In an fMRI study,
the capacity to deal with discrete ininities through recursive 17 Italian speakers were asked to listen to sentences describing
rules (Chomsky 2004, 47). He proposed that when you link that actions performed with the mouth (I bite an apple), with the
capacity to the conceptual system of the other primates, you get hand (I grasp a knife), and with the leg (I kick the ball). In
human language, which provides the capacity for thought, plan- addition, they were presented a control sentence with abstract
ning, evaluation and so on, over an unbounded range, and then content (I appreciate sincerity). In the case of the action-
you have a totally new organism (Chomsky 2004, 48). related words, the left-hemispheric fronto-parietaltemporal
hus, let us assume that some individual in the lineage to mod- network containing the pars opercularis of the inferior frontal
ern humans underwent a genetic change such that some neural gyrus (Brocas area) was activated. Other areas were diferen-
circuit(s) were reorganized to support the capacity for recursion. tially activated, depending on the body part. hey conclude that
his in turn provided the capacity for thought and so on over an the experiment showed that the role of Brocas area was in the
unbounded range. his in itself provided that individual and any access to abstract action representations, rather than in syntactic
ofspring with a selective advantage that then spread through the processing per se (277).
group. hus, the earliest stage of language would have been just On the basis of these outlined indings, it has been sug-
that: a language of thought, used internally (Chomsky 2005a, 6). gested that speech may have evolved from gesture rather than
At some later stage, there was an advantage to externalization, so from vocal communication by utilizing the mirror neuron sys-
that the capacity would be linked as a secondary process to the tem (Rizzolatti and Arbib 1998; Gentilucci and Corballis 2006;
sensorimotor system for externalization and interaction, includ- Fogassi and Ferrari 2007). Leonardo Fogassi and Pier Francesco
ing communication (7). Ferrari note that the motor theory of speech perception its well
he evolutionary scenario just outlined is derived from design with an account in terms of mirror neurons in that the objects
principles suggested from work on human languages. Many other of speech perception are the intended phonetic gestures of the
evolutionary scenarios have been proposed that assume that speaker, represented in the brain as invariant motor commands
communication or other social factors played a more primary (Liberman and Mattingly 1985, 2). However, Fogassi and Ferrari
role. hese include accounts involving manual and facial ges- note that, even if the mirror neuron system is involved in speech,
tures (Corballis 2002), protolanguage (Bickerton 1996), groom- the currently available evidence does not appear to support a
ing (Dunbar 1998), and action and motor control (Rizzolatti dedicated mirror-neuron system for language in humans.
and Arbib 1998). However, the two kinds of accounts are not Additional accounts of the evolution of language may also
incompatible and may represent diferent aspects or stages in be found in Maynard Smith and Szathmary (1998), Lieberman
the evolution of language. We cannot review all of these research (2006), and Christiansen and Kirby (2003). See also adaptation;
47
grooming, gossip, and language; mirror systems, imi- a rattle and B to a warble. As for possible reasons why the
tation, and language; evolutionary psychology; starlings succeeded with the context-free task while the tamarins
morphology, evolution and; origins of language; failed, and for alternative explanations of the learning results, see
phonology, evolution of; pragmatics, evolution and; additional discussion in Fitch and Hauser (2004), M. Liberman
primate vocalizations; semantics, evolution and; (2004), Gentner et al. (2006), and Hauser et al. (2007).
speech anatomy, evolution of; syntax, evolution of;
verbal art, evolution and.
LANGUAGE AND THE NEUROSCIENCES
Turning to the neurosciences (see questions 13), with the
THE COMPARATIVE APPROACH IN BIOLINGUISTICS
advent of more tools both experimental, such as imaging and
hroughout the modern era of biolinguistics, a question that has the methods of comparative genomics, and theoretical, such
been much debated is to what degree the faculty of language as computational theories of linguistics we can look forward
is uniquely human. Marc D. Hauser and colleages (2002) have to more informative models of language and the brain. David
stressed the importance of the comparative approach in the Poeppel and Gregory Hickok (2004) note three problems with
study of this question. Early on, Chomsky, in making a case for classical models of language (involving such concepts as Brocas
the genetic determination of language, used arguments from area and wernickes area): 1) hey are inadequate for aphasic
animal behavior (ethology) to note the similarities in learning symptomatology, 2) they are based on an impoverished linguis-
between birdsong and animals, in particular, rapidity of learn- tic model, and 3) there are anatomical problems.
ing, underdetermination of data, and so on (Chomsky 1959). As for aphasic symptomatology, the classical models do not
Hauser, Chomsky, and their colleagues have emphasized a explain certain subtypes of aphasia, like anomic aphasia. Also,
number of methodological points concerning the comparative clusters of aphasic symptoms are highly variable and dissocia-
approach, using a distinction between the faculty of language ble, indicating that there is a more complex architecture under-
in the broad sense (FLB) and the faculty of language in the nar- lying the syndromes.
row sense (FLN). he basic idea is that before concluding that As for the second problem, there has been a tendency of some
some property of language is uniquely human, one should study implementations of classical models to incorporate a monolithic
a wide variety of species with a wide variety of methods. And picture of linguistic components, for example, production versus
before concluding that some property of language is unique to comprehension, or semantics versus syntax, without regard for
language, one should consider the possibility that the property iner computational subdivisions.
is present in some other (cognitive) domain, for example, music And inally, certain anatomical problems came into view. For
or mathematics. hey tentatively conclude that recursion may example, Brocas aphasia and Wernickes aphasia did not always
be a property of language that is unique to language and, if so, correspond to damage in the areas that bore their names. In addi-
belongs to the faculty of language in the narrow sense. tion, these classical areas were not always anatomically or func-
An example of the application of the comparative method is tionally homogenous. In addition, areas outside these regions
the investigation of the computational abilities of nonhuman pri- were found to be implicated in language processing, including,
mates by W. Tecumseh Fitch and Hauser (2004), who tested the for example, the anterior superior temporal lobe, the middle
ability of cotton-top tamarins, a New World primate species, as temporal gyrus (MTG), the temporo-parietal junction, the
well as human controls, to process diferent kinds of grammars. basal ganglia, and many right hemisphere homologs
Using a familiarization/discrimination paradigm, they found (Poeppel and Hickok 2004, 5). As noted, in the last few decades
that the tamarins were able to spontaneously process inite-state many new approaches and tools have been developed in neuro-
grammars, which generate strings of syllables of the form (AB)n, linguistics and neurology, genetics and molecular neurobiology
such as ABAB, ABABAB. However, they were unable to process (examples follow), and these have helped to overcome the kinds
context-free grammars, which generate strings of syllables of the of issues pointed out by Poeppel and Hickok.
form AnBn, such as AABB, AAABBB. It is known from formal lan- For a review of some of the attempts to develop a new func-
guage theory (the Chomsky hierarchy) that context-free gram- tional anatomy of language, see the essays published along
mars are more powerful than inite-state grammars. Moreover, with Poeppel and Hickock (2004) in Cognition 92 (2004). For
the humans tested were able to rapidly learn either grammar. new approaches to the study of Brocas area, see Grodzinsky and
he authors conclude that the acquisition of hierarchical pro- Amunts (2006).
cessing ability, that is, the ability to learn context-free gram-
mars, may have represented a critical juncture in the evolution
LEFTRIGHT ASYMMETRIES OF THE LANGUAGE AREAS
of the human language faculty (380).
In a later study, Timothy Q. Gentner and colleagues (2006) Ever since Paul Pierre Brocas and Carl Wernickes seminal dis-
showed that European starlings, in contrast to tamarins, can coveries of areas involved in language processing, questions
recognize acoustic patterns generated by context-free gram- about asymmetries in the brain has been a lively area of research.
mars. Using an operant conditioning paradigm, they found that As early as 1892, Daniel J. Cunningham reports that he found
9 of 11 starlings were able to learn both inite-state grammar and the left sylvian issure longer in the chimpanzee and macaque.
context-free grammar sequences accurately. he (AB)n and AnBn Cunningham, in turn, cites earlier work by Oskar Eberstaller on
sequences in this case were made up of acoustic units (motifs) the Sylvian issure in humans, who had concluded that it was
from the song of the starlings. In this case, A corresponded to longer in the left hemisphere than in the right (on average). He
48
Explaining Language
postulated that this region held the key to what he called the sen- in a region of the planum temporale in human, chimpanzee,
sible Sprachcentrum (sensible/cognizant language center). and rhesus monkey brains. It was found that only human brains
Claudio Cantalupo and William D. Hopkins (2001) report exhibited asymmetries in minicolumn morphology, in particu-
inding an anatomical asymmetry in Brocas area in three great lar, wider columns and more neuropil space (Buxhoeveden et al.
ape species. hey obtained magnetic resonance images (MRI) 2001).
(neuroimaging) from 20 chimpanzees (P. troglodytes), 5 It is possible that circuits could be reorganized within a lan-
bonobos (P. paniscus), and 2 gorillas (G. gorilla). In humans, guage region without a signiicant volumetric change so that a
Brodmanns area 44 corresponds to part of Brocas area within novel function in language could evolve. Sherwood and col-
the inferior frontal gyrus (IFG). his area is larger in the left leagues conclude: herefore, it is likely that Brodmanns area
hemisphere than in the right. Furthermore, it is known that 44 homolog in great apes, while similar in basic structure to that
Brocas area is vital for speech production (with the qualiica- in humans, difers in subtle aspects of connectivity and lacks
tions discussed earlier). homologous function (284).
Although the great apes were known to have a homolog of Allen Braun (2003) notes that MRI could still turn out to be
area 44 on the basis of cytoarchitectonic and electrical stimu- useful for the study of microstructure at higher ield strengths,
lation studies, no leftright anatomical asymmetry had been with the addition of MR contrast agents, and with the use of
shown. Cantalupo and Hopkins found a pattern of morpholog- difusion-weighted MR methods. He also notes that the pars
ical asymmetry similar to that found in the homologous area in orbitalis has often been arbitrarily excluded from the deinition
humans. his would place the origin of the asymmetry for the of Brocas area, and might be important in the search for ante-
anatomical substrate for speech production to at least ive mil- cedents of language in nonhuman primates. In particular, some
lion years ago. studies suggest that the pars orbitalis is selectively activated by
Since the great apes exhibit only primitive vocalizations, these semantic processes (as opposed to phonological or syntactic
authors speculate that this area might have subserved a gestural processes) (Bookheimer 2002).
system (see earlier discussion). hey note the presence in mon- It is known that nonhuman primates have structures homolo-
keys of mirror neurons in area 44 that subserve the imitation of gous to the perisylvian areas involved in human language, that
hand grasping and manipulation (Rizzolatti and Arbib 1998). is, support both expressive and receptive language (Galaburda
hey also observe that captive great apes have a greater right- and Pandya 1983; Deacon 1989). Ricardo Gil-da-Costa and col-
hand bias when gesturing is accompanied by vocalization. Hence leagues (2006) presented species-speciic vocalizations in rhesus
in the great apes, the asymmetry may have subserved the pro- macaques and found that the vocalizations produced distinct
duction of gestures accompanied by vocalizations, whereas, for patterns of brain activity in areas homologous to the perisyl-
humans, this ability was selected for the development of speech vian language areas in humans using H215O positron emission
systems, accompanied by the expansion of Brodmanns area 45 tomography (PET). Two classes of auditory stimuli were pre-
(which, along with Brodmanns area 44, makes up Brocas area) sented to the monkeys. One was species-speciic macaque
(Cantalupo and Hopkins 2001, 505). vocalizations (coos and screams). As a control, nonbiologi-
However, additional studies of Brodmanns area 44 in cal sounds were presented that matched the species-speciic
African great apes (P. troglodytes and G. gorilla) call into ques- vocalizations in frequency, rate, scale, and duration. hey found,
tion whether the techniques used in Cantalupo and Hopkins for example, a greater response to species-speciic calls than to
study were suicient to demonstrate the leftright asymmetry. nonbiological sounds in the perisylvian system with homologs
Chet C. Sherwood and colleagues (2003) found considerable in humans, for example, to the area Tpt of the temporal planum
variation in the distribution of the inferior frontal sulci among and to the anterior perisylvian cortex, roughly corresponding to
great ape brains. hey also constructed cytoarchitectural maps of the areas studied by Wernicke and Broca in humans. However,
Brodmanns area 44, examining myeloarchitecture and immu- they did not ind any clear lateralization efects in the macaque
nohistochemical staining patterns. When they studied the IFG brain comparable to the anatomical and functional asymmetries
of great ape brains, they found a poor correspondence between documented in humans and anatomical asymmetries in apes
the borders observed in the cytoarchitectural maps and the bor- (Gannon et al. 1998).
ders in the surface anatomy (e.g., sulcal landmarks). here were Gil-da-Costa and colleagues (2006) note that the perisylvian
similar indings for human brains in an earlier study (Amunts et regions are not performing linguistic computations in the
al. 1999). Sherwood and colleagues conclude that in the study macaque, but could be performing a prelinguistic function in
by Cantalupo and Hopkins, it is unlikely that the sulci used to associating the sound and meaning of species-speciic vocal-
deine the pars opercularis coincided with the borders of cyto- izations. Furthermore, this would position the perisylvian sys-
architectural area 44 (2003, 284). In general then, macrostruc- tem to be recruited for use during the evolution of language.
ture is a poor predictor of microstructure. More speciically, it may have been exapted during the emer-
Sherwood and colleagues also point out that even if human- gence of more complex neural mechanisms that couple sound
like asymmetries of the inferior frontal gyrus and of the planum and meaning in human language (2006, 1070). Although I have
temporale are conirmed, these gross asymmetries will not suf- focused here on the perisylvian system, it should be emphasized
ice to explain the unique neural wiring that supports human that areas outside this system have also been demonstrated to be
language (284). To that end, comparative studies of microstruc- involved in language.
ture in humans and great apes are needed. For example, a com- K. A. Shapiro and colleagues (2006) provide another example
puterized imaging program was used to examine minicolumns of the application of imaging studies to investigate how linguistic
49
categories like nouns, verbs, and adjectives are organized in the dyslexia, specific language impairment, and autism
brain. An event-related functional MRI imaging study has found and language). By understanding how genetic changes can
speciic brain sites that are activated by either nouns or verbs, cause the operation of language to break down, we get an idea of
but not both. In a series of experiments, subjects were asked to the genes that are important for language acquisition, process-
produce nouns and verbs in short phrases as real words (the ing, and, ultimately, language evolution.
ducks, he plays), as well as pseudowords (the wugs, he zibs), both An autosomal-dominant and monogenic disorder of speech
with regular inlections and irregular inlections (geese, wrote), and language was found in the KE family with a 3-generation
including both concrete and abstract words. Speciic brain pedigree. A monogenic disorder involves a mutation in a single
areas were selectively activated for either verb production (left gene, and here the individuals have one copy of the mutant gene
prefrontal cortex and left superior parietal lobule) or for noun and one normal gene on two autosomal (non-sex) chromosomes.
production (left anterior fusiform gyrus) across the entire bat- he disorder was mapped to the region 7q31 on chromosome 7,
tery of tests. Moreover, the areas were nonoverlapping, leading and it was shown that the gene (called FOXP2) had a mutation
the authors to conclude that these regions are involved in rep- in a forkhead-domain of the protein it encoded in the afected
resenting core conceptual properties of nouns and verbs (2006, family (Lai et al. 2001).
1644). he individuals were diagnosed as having developmental ver-
In recent years, it has become possible to study asymmetries bal dyspraxia. he phenotype was found to be quite complex,
on a molecular level as well (Sun and Walsh 2006). As discussed afecting orofacial sequencing, articulation, grammar, and cog-
earlier, there are functional, anatomical, and cytoarchitectonic nition and is still incompletely understood and under investiga-
diferences between the left and right cerebral hemispheres in tion (see also genes and language; Jenkins 2000).
humans. To determine what the molecular basis for these asym- he FOXP2 gene was found to code for a transcription factor,
metries might be, Tao Sun and colleagues (2005) compared left that is, a protein that regulates gene expression by turning a gene
and right embryonic cerebral hemispheres for leftright difer- on or of or otherwise modulating its activity. It is natural to ask
ences in gene expression, using serial analysis of gene expres- what other genes FOXP2 may regulate and how it regulates these
sion (SAGE). hey discovered 27 genes whose transcriptions genes (turning them on or of, for example), as well as to deter-
were diferentially expressed on the left and right sides. In par- mine whether any of these genes downstream are involved in
ticular, the gene LMO4, which asymmetrically expressed the Lim speech and language in a more direct way.
Domain Only 4 transcription factor, is more highly expressed in To ind the gene targets of FOXP2 in the brain and to deter-
the perisylvian regions of the right hemisphere than in the left mine the efects on those genes, methods are being developed to
at 12 weeks and 14 weeks. Further studies are needed to deter- identify these neural targets both in vitro and in vivo (Geschwind
mine how LMO4 expression is regulated by factors still earlier in and Miller 2001). he laboratory of D. H. Geschwind has devel-
development. oped a genomic screening approach that combines 1) chromatin
Mouse cortex was also examined, and it was found that immunoprecipitation and 2) microarray analysis (ChIP-Chip).
although Lmo4 expression was moderately asymmetrical in every In chromatin immunoprecipitation, an antibody that recognizes
individual mouse brain, the expression was not consistently lat- the protein of interest (e.g., foxp) is used to ish out a protein-
eralized to either the left or the right. his may be related to the DNA complex. he DNA is then hybridized to arrays with DNA
fact that asymmetries like paw preference are seen in individual from thousands of human genes. his allows the identiication of
mice but are not biased in the population as a whole, as hand binding sites for transcription factors (in this case, FOXP2). he
preference is in humans. goal is to discover potential gene candidates involved in the devel-
he results of this study are also consistent with the observa- opment of neural circuits supporting speech and language.
tion that the genes involved in visceral asymmetries (e.g., of the he homologue to the human FOXP2 gene has been discov-
heart) are not measurably implicated in cerebral asymmetries. It ered in a number of diferent species, including mice (Foxp2)
had been noted earlier that situs inversus mutations in humans and songbirds, such as the zebra inch (FoxP2). Whatever ones
do not appear to interfere with the left-hemisphere localization of views on the relationship between human language and other
language and handedness (Kennedy et al. 1999). In these earlier animal communication systems, it is important to study the evo-
studies, it had been found that the pattern of language laterali- lutionary origin of genes that afect language, such as FOXP2, for
zation in patients with situs inversus was identical to that found one can learn about the neural pathways constructed by these
in 95 percent of right-handed individuals with normal situs. It genes, which might not otherwise be possible in experiments
was concluded that the pathway afecting language dominance with humans.
and handedness was most likely distinct from that afecting the It has been found that the zebra inch and human protein
asymmetry of the visceral organs. sequence is 100 percent identical within the DNA-binding
domain, suggesting a possible shared function (White et al. 2006;
Haesler et al. 2004). In addition, Constance Scharf and Sebastian
GENETICS AND SPEECH DISORDERS
Haesler (2005) report that the FoxP2 pattern of expression in the
Biolinguists would like to understand the wiring of networks brain of birds that learn songs by imitation resembles that found
underlying language function at the level of genes. We have seen in rodents and humans. In particular, FoxP2 is expressed in the
that one way to study this question is to use such diferential gene same cell types, for example, striatal medium spiny neurons.
expression methods as SAGE. Another key way of investigating Moreover, FoxP2 is expressed both in the embryo and in the
the genetics of language is by studying language disorders (see adult. To ind out whether FoxP2 is required for song behavior in
50
Explaining Language
the zebra inch, the Scharf laboratory is using RNA to downregu- one of the researchers on the study, noted that, in contrast,
late FoxP2 in Area A, a striatal region important for song learning Patient 1 had normal spatial ability but could form next to no
(Scharf and Nottebohm 1991). It is known that young male zebra complete words. When asked what animal has long ears and eats
inches express more FoxP2 bilaterally in Area X when learning carrots, he could only pronounce the r in the word rabbit but was
to sing (Haesler et al. 2004). hey will then be able to determine able to draw the letter on the blackboard and add features such
whether the bird is still able to sing normal song as well as copy as whiskers (McMaster 2005).
the song of an adult male tutor. he duplicated region on chromosome 7 contains around 27
Weiguo Shu and colleagues (2005) found that disrupting the genes, but it is not yet known which of the duplicate gene copies
Foxp2 gene in mice resulted in impairing their ultrasonic vocal- are involved in the expressive language delay, although certain
ization. In addition to their more familiar sonic vocalizations, genes have been ruled out.
mice also make ultrasonic sounds, for example, when they are A gene (SRPX2) responsible for rolandic seizures that are
separated from their mothers. associated with oral and speech dyspraxia and mental retarda-
In order to study the efect of disrupting the Foxp2 gene on tion has been identiied (Roll et al. 2006). It is located on Xq22.
vocalization, these researchers constructed two versions of he SPRX2 protein is expressed and secreted from neurons in the
knockout mice. One version had two copies of the defective human adult brain, including the rolandic area. his study char-
Foxp2 gene (the homozygous mice) and the other version had acterizes two diferent mutations.
one copy of the defective Foxp2 gene, as well as one gene that he irst mutation was found in a patient with oro-facial dys-
functioned normally (the heterozygous mice). he homozygous praxia and severe speech delay. he second mutation was found
mice (double knockout) sufered severe motor impairment, in a male patient with rolandic seizures and bilateral perisylvian
premature death, and an absence of ultrasonic vocalizations polymicrogyria. he authors also note that Sprx2 is not expressed
that are normally produced when they are separated from their during murine embryogenesis, suggesting that SPRX2 might play
mother. On the other hand, the heterozygous mice, with a sin- a speciic role in human brain development, particularly of the
gle working copy of the gene, exhibited modest developmental rolandic and sylvian areas.
delay and produced fewer ultrasonic vocalizations than normal.
In addition, it was found that the Purkinje cells in the cerebel-
RECOVERING ANCIENT DNA
lum, responsible for ine motor control, were abnormal. It is con-
cluded that the indings support a role for Foxp2 in cerebellar In addition to classical studies of fossils, there is currently
development and in a developmental process that subsumes renewed interest in work on ancient DNA. he Neanderthals
social communication functions (Shu et al. 2001, 9643). (Homo neanderthalensis) were an extinct group of hominids that
Timothy E. Holy and Zhongsheng Guo (2005) studied the are most closely related to modern humans (Homo sapiens). Up
ultrasonic vocalizations that male mice emit, when they encoun- to now, information about Neanderthals has been limited to
ter female mice or their pheromones. hey discovered that the archaeological data and a few hominid remains. Comparing the
vocalizations, which have frequencies ranging from 30100 kHz, genetic sequences of Neanderthals and currently living humans
have some of the characteristics of song, for example, birdsong. would allow one to pinpoint genetic changes that have occurred
In particular, they were able to classify diferent syllable types during the last few hundred thousand years. In particular, one
and found a temporal sequencing structure in the vocalizations. would be able to examine and compare diferences in such
In addition, individual males, though genetically identical, pro- genes as FOXP2 in living humans, Neanderthals, and nonhuman
duced songs with characteristic syllabic and temporal structure. primates.
hese traits reliably distinguish them from other males. Holy Partial DNA sequences of the Neanderthal have now been
notes that these discoveries increase the attractiveness of mice published by two groups led by Svante Pbo and Edward Rubin
as model systems for study of vocalizations (White et al. 2006, (Green et al. 2006; Noonan et al. 2006). Pbos group identi-
10378). ied a 38,000-year-old Neanderthal fossil from the Vindija cave
We have focused on the FOXP2 gene here because there is no (the Neanderthals became extinct around 30,000 years ago). he
other gene afecting speech and language about which so much fossil was free enough from contamination to permit DNA to be
information is available that bears on questions of neurology and extracted and subjected to large-scale parallel 454 sequencing, a
evolution of language. However, the search is underway for other newer and faster system for sequencing DNA.
additional genes. Pbos group was able to sequence and analyze about one
For example, genetics researchers have also discovered a million base pairs of Neanderthal DNA. Note that the genomes of
DNA duplication in a nine-year-old boy with expressive language modern humans and Neanderthals each have about three billion
delay (Patient 1) (Fisher 2005; Somerville et al. 2005). Although base pairs (3 gigabases). Among the conclusions they reached
his comprehension of language was at the level of a seven-year- on the basis of the comparison to human DNA was that modern
old child, his expression of language was comparable to that of human and Neanderthal DNA sequences diverged on average
only a two-and-a-half-year-old. about 500,000 years ago. hey also expected to produce a draft of
he region of DNA duplication in Patient 1 was found to be the Neanderthal genome within two years. (For their preliminary
on chromosome 7, and interestingly, was found to be identical to results, see Pennisi 2009.)
the region that is deleted in Williams-Beuren syndrome (WBS). Rubins group obtained around 65,000 base pairs of
Patients with WBS have relatively good expressive language but Neanderthal sequence. hey used a combination of the Sanger
are impaired in the area of spatial construction. Lucy Osborne, method of DNA sequencing and the (faster) pyrosequencing
51
method. Although the Sanger method yields less amounts of diicult to discover the signiicance of nucleotide changes in
sequence than pyrosequencing, the error rate is lower. From the noncoding regions, since one cannot determine their functional
sequence data they estimated that Neanderthals and humans signiicance by visual inspection. Nonetheless, until this infor-
shared a most recent common ancestor around 706,000 years mation is available, there is no reason to favor the idea that the
ago. here has been interest in whether Neanderthals might two changes in the FOXP2 protein are functional. Carroll notes
have contributed to the European gene pool. For example, one that while it may be tempting to reach for the low-hanging fruit
study suggests that humans may have acquired the microceph- of coding sequence changes, the task of unravelling the regula-
alin gene, which regulates brain size during development, by tory puzzle is yet to come (1165).
interbreeding with another species, possibly the Neanderthals. In fact, some data favor the regulatory sequence hypothesis.
However, the data here do not support this possibility, although Carroll notes that in evaluating whether FOXP2 is involved in the
more sequence data will be needed to answer the admixture evolution of the neural circuits of language, one must ask several
question deinitively (Evans et al. 2006). However, they did questions. he irst is, is the gene product used in multiple tis-
establish the validity of their sequencing approach, which sues? (1164).
allows for the rapid recovery of Neanderthal sequences of It is known that FOXP2 appears to act as a repressor in lung
interest from multiple independent specimens, without the tissue (Shu et al. 2001). Moreover, studies in mice have revealed
need for whole-genome resequencing (Noonan et al. 2006, that in addition to Foxp2, two other transcription factors, Foxp1
1118). and Foxp4 are expressed in pulmonary and gut tissues (Lu et al.
2002). Foxp4 is also expressed in neural tissues during embry-
onic development. It is not at all surprising to ind that a gene
GENETICS AND EVOLUTION
that may be involved in language development is also active in
he work on comparative animal studies and comparative neu- nonlanguage areas. he reason for this is that transcription fac-
roanatomy discussed earlier are being increasingly informed by tors like FOXP2 often act in a combinatorial fashion in conjunc-
the rapidly emerging ield of comparative genomics. Work in tion with other transcription factors in diferent ways in diferent
the ield of evolutionary development (evo-devo) has provided tissues.
substantial support for the idea that gene regulation is key to As another example, suppose that a cell has three regula-
understanding evolution. Among the methods now available to tory proteins and that with each cell division a new regulatory
us to study the evolution of anatomy (including neural circuits) protein becomes expressed in one of the daughter cells, but
and behavior at the genome level are comparative genomics, not the other. Hence, we only need three regulatory proteins
gene-expression proiling, and population genetics analysis. We acting in combination to produce eight diferent cell types.
have already seen an example of the proiling of gene expression Combinatorial control is thought to be widespread as a means
in the left and right cerebral hemispheres using serial analysis of eukaryotic gene regulation. For those familiar with linguistic
of gene expression. Such methods have also been applied to models of language acquisition, it may help to think of parame-
FOXP2. ter settings, whereby three binary parameters can specify eight
As we noted earlier, the FOXP2 gene was discovered to code diferent language types with respect to some structural feature.
for a transcription factor and is therefore involved in regulating he idea, then, is that FOXP2 can work in conjunction with cer-
the expression of other genes. In general, transcription factors tain factors in the lung to repress gene activity in the epithelium,
are highly conserved. In this case, the FOXP2 protein is diferent whereas it might work together with other factors in the brain to
from the chimpanzee and gorilla sequence at two amino acids, regulate genes there directly (or indirectly) involved in speech
and from the orangutan sequence at three amino acids. he and language.
human and mouse protein sequence difer at three amino acids. he second question to ask is are mutations in the coding
he question arises whether these amino acid replacements are sequence known or likely to be pleiotropic [i.e., causing multiple
of functional signiicance, that is, whether they played a crucial efects]? (1164) It is well known that patients with the FOXP2
role in the evolution of language. mutation have multiple defects in speech, orofacial praxis, lan-
A population genetic study of the FOXP2 locus concluded guage, and cognition. he third question to ask is does the locus
that it had been subjected to a selective sweep during the past contain multiple cis-regulatory elements? (ibid.) (Cis-elements
200,000 years, correlating closely with estimated age of Homo are regulatory elements located on the same nucleic acid strand
sapiens. However, Sean B. Carroll (2005) notes that one cannot as the gene they regulate.) Again, since FOXP2 is expressed in
immediately conclude from the fact that the FOXP2 gene has multiple areas in the brain and in other organs, this is a clear
been a target of selection during human evolution that it is the indication that it does. Carroll concludes on this basis that reg-
amino acid replacements just discussed that were the function- ulatory sequence evolution is the more likely mode of evolution
ally important targets. Since the FOXP2 gene is 267 kilobases than coding sequence evolution (ibid.).
in size, we should ind more than 2,000 diferences in DNA Finally, Carroll notes that some of the data from experi-
sequence between chimpanzees and humans (assuming an ments with birdsong learners and nonlearners also support the
average base pair divergence of 1.2%). his means that there are idea of regulatory evolution. When FoxP2 mRNA and protein
many more possibilities for functionally signiicant mutations levels during development were studied, a signiicant increase
in noncoding regulatory areas than in coding regions (the of FoxP2 expression was found in Area X, a center required for
FOXP2 protein is made from coding regions, while the noncod- vocal learning. he increase occurred at a time when vocal learn-
ing regions contain the regulatory information). It is, of course, ing in zebra inches was underway. Moreover, FoxP2 expression
52
Explaining Language
levels in adult canaries varied with the season and correlated (1980, 216). Not only has that proven to be the case, but with the
with changes in birdsong. hese facts suggest regulatory control explosion of knowledge in many areas, including (comparative)
in development and in the adult brain. linguistics, comparative neuroanatomy, evolutionary devel-
opment, comparative genomics, to take just a few examples,
biolinguistics promises to be a fascinating ield for decades to
EVOLUTION AND DYNAMICAL SYSTEMS come.
Another tool used in the study of language evolution is the theory
of dynamical systems (see also self-organizing systems) WORKS CITED AND SUGGESTIONS FOR FURTHER READING
(Nowak 2006). However, applications of dynamical systems to Amunts, K., A. Schleicher, et al. 1999. Brocas region revis-
language include studies not only of evolution of language but ited: Cytoarchitecture and intersubject variability. Journal of
of language change and language acquisition as well (Niyogi Comparative Neurology 412: 31941.
2004, 2006). Martin A. Nowak and Natalia Komarova (2001, Bickerton, Derek. 1996. Language and Human Behavior. Seattle: University
292) assume for the case of language acquisition that Universal of Washington Press.
Grammar (UG) contains two parts: 1) a rule system that gener- Boeckx, Cedric. 2006. Linguistic Minimalism. Oxford: Oxford University
ates a search space of candidate grammars and 2) a [learning] Press.
mechanism to evaluate input sentences and to choose one of Bookheimer, Susan. 2002. Functional MRI of language: New approaches
to understanding the cortical organization of semantic processing.
the candidate grammars that are contained in his [the learners]
Annual Review of Neuroscience 25 (March): 15188.
search space. One of the main questions to be determined is
Braun, Allen. 2003. New indings on cortical anatomy and implications
what is the maximum size of the search space such that a spe- for investigating the evolution of language. he Anatomical Record
ciic learning mechanism will converge (after a number of input Part A: Discoveries in Molecular, Cellular, and Evolutionary Biology
sentences, with a certain probability) to the target grammar. 271A (March): 27375.
he question for language evolution then is what makes a Buxhoeveden, D. P., A. E. Switala, et al. 2001. Lateralization of minicol-
population of speakers converge to a coherent grammatical sys- umns in human planum temporale is absent in nonhuman primate
tem. A homogenous population (all individuals have the same cortex. Brain, Behavior and Evolution 57 (June): 34958.
UG) is assumed. Nowak and Komarova derive a set of equations, Cangelosi, Angelo, and Domenico Parisi, eds. 2001. Simulating the
which they call the language dynamics equations (293), which Evolution of Languge. New York: Springer.
Cantalupo, Claudio, and William D. Hopkins. 2001. Asymmetric Brocas
give the population dynamics of grammar evolution.
area in great apes. Nature 414 (November): 505.
he equations represent the average payof (for mutual under-
Carroll, Sean B. 2005. Evolution at two levels: On genes and form. PLoS
standing) for all those individuals who use a particular grammar
Biology 3 (July): 115966. Available online at: www.plosbiology.org.
and contribute to biological itness (the number of ofspring they Chomsky, Noam. 1959. A review of B. F. Skinners Verbal Behavior.
leave), and include a quantity to measure the accuracy of gram- Language 35.1: 2658.
mar acquisition of the ofspring from their parents. Another . 1980. On the biological basis of language capacities. In Rules
variable denotes the relative abundance of individuals who use and Representations, 185216. New York: Columbia University Press.
a particular grammar. Still another variable denotes the average . 1995. he Minimalist Program. Cambridge, MA: MIT Press.
itness or grammatical coherence of the population, the mea- . 2004. he Generative Enterprise Revisited: Discussions with Riny
sure for successful communication in a population (293). Huybregts, Henk Van Riemsdijk, Naoki Fukui and Mihoko Zushi.
Nowak and Komarova use the language dynamics equations Berlin: Mouton de Gruyter.
. 2005a. Some simple evo-devo theses: How true might they be for
to study the conditions under which UG will result in grammati-
language? Paper presented at the Morris Symposium, SUNY at Stony
cal coherence. A number of factors can be varied in order to run
Brook.
computer simulations: population size, assumptions about UGs
. 2005b. hree factors in language design. Linguistic Inquiry
search space, and assumptions about the learning mechanism 36.1: 122.
(e.g., memoryless or batch learning, etc.). . 2006. Language and Mind. Cambridge: Cambridge University
A similar kind of dynamical systems analysis has been pro- Press.
posed by Partha Niyogi for language change (2004, 58), for what Chomsky, N., and H. Lasnik. 1995. he theory of principles and param-
he calls the emerging ield of population linguistics. Such con- eters. In he Minimalist Program, N. Chomsky, 13127. Cambridge,
cepts as symmetry and stability (stable and unstable equilibria) MA: MIT Press.
are used in the study of the language dynamics equations. Niyogi Christiansen, Morten H., and Simon Kirby, eds. 2003. Language Evolution.
uses symmetric and asymmetric nonlinear dynamical models to New York: Oxford University Press.
Corballis, Michael C. 2002. From Hand to Mouth: he Origins of Language.
study lexical and syntactic change. Nowak and Komarova note
Princeton, NJ: Princeton University Press.
that dynamical systems analysis is compatible with a wide range
Cunningham, Daniel J. 1892. Contribution to the Surface Anatomy of the
of diferent kinds of linguistic analysis and learning theories.
Cerebral Hemispheres [Cunningham Memoirs]. Dublin: Royal Irish
here are a number of other approaches to the study of language Academy of Science.
evolution with dynamical systems and simulation, some of which Deacon, T. W. 1989. he neural circuitry underlying primate calls and
may be found in the suggestions for further reading (Christiansen human language. Human Evolution 4 (October): 367401.
and Kirby 2003; Cangelosi and Parisi 2001; Lyon et al. 2006). Dunbar, Robin. 1998. Grooming, Gossip and the Evolution of Language.
More than 25 years ago, Chomsky observed that the study of Cambridge: Harvard University Press.
the biological basis for human language capacities may prove to Evans, Patrick D., Nitzan Mekel-Bobrov, et al. 2006. Evidence that the
be one of the most exciting frontiers of science in coming years adaptive allele of the brain size gene microcephalin introgressed
53
into Homo sapiens from an archaic Homo lineage. PNAS 103 Kohler, Evelyne, Christian Keysers, et al. 2002. Hearing sounds, under-
(November): 1817883. standing actions: Action representation in mirror neurons. Science
Ferrari, Pier Francesco, Vittorio Gallese, et al. 2003. Mirror neurons 297: 8468.
responding to the observation of ingestive and communicative mouth Lai, Cecilia S. L., Simon E. Fisher, et al. 2001. A forkhead-domain gene
actions in the monkey ventral premotor cortex. European Journal of is mutated in a severe speech and language disorder. Nature 413
Neuroscience 17: 170314. (October): 51923.
Fisher, S. E. 2005. On genes, speech, and language. New England Liberman, Alvin M., and Ignatius G. Mattingly. 1985. he motor theory
Journal of Medicine 353.16: 16557. of speech perception revised. Cognition 21: 136.
Fitch, W. Tecumseh, and Marc D. Hauser. 2004. Computational con- Liberman, Mark. 2004. Hi lo hi lo, Its of to formal language theory we
straints on syntactic processing in a nonhuman primate. Science 303 go. Language Log (January 17). Available online at: http://itre.cis.
(January): 37780. upenn.edu/~myl/languagelog/archives/000355.html.
Fogassi, Leonardo, and Pier Francesco Ferrari. 2007. Mirror neurons Lieberman, Philip. 2006. Toward an Evolutionary Biology of Language.
and the evolution of embodied language. Current Directions in Cambridge: Belknap Press of Harvard University Press.
Psychological Science 16:13641. Lu, Min Min, Shanru Li, et al. 2002. Foxp4: A novel member of the Foxp
Galaburda, Albert M., and Deepak N. Pandya. 1983. he intrinsic archi- subfamily of winged-helix genes co-expressed with Foxp1 and Foxp2
tectonic and connectional organization of the superior temporal in pulmonary and gut tissues. Mechanisms of Development 119,
region of the rhesus monkey. Journal of Comparative Neurology 221 Supplement 1 (December): S197S202.
(December): 16984. Lyon, Caroline, Chrystopher L. Nehaniv, et al., eds. 2006. Emergence of
Gallese, Vittorio, Luciano Fadiga, et al. 1996. Action recognition in the Communication and Language. New York: Springer.
premotor cortex. Brain 119: 593609. Maynard Smith, John, and Ers Szathmry. 1998. he Major Transitions
Gannon, Patrick J., Ralph L. Holloway, et al. 1998. Asymmetry of chim- in Evolution. New York: Oxford University Press.
panzee planum temporale: Humanlike pattern of Wernickes brain McMaster, Geof. 2005. Researchers discover cause of speech defect.
language area homolog. Science 279 (January): 2202. University of Alberta ExpressNews (October 25). Available online
Gentilucci, Maurizio, and Michael C. Corballis. 2006. From manual ges- at: http://www.expressnews.ualberta.ca.
ture to speech: A gradual transition. Neuroscience and Biobehavioral Nishitani, N., and R. Hari. 2000. Temporal dynamics of cortical repre-
Reviews 30: 94960. sentation for action. Proceedings of the National Academy of Sciences
Gentner, Timothy Q., Kimberly M. Fenn, et al. 2006. Recursive syntactic USA 97: 91318.
pattern learning by songbirds. Nature 440 (April): 12047. Niyogi, Partha. 2004. Phase transitions in language evolution. In
Geschwind, D. H., and B. L. Miller. 2001. Molecular approaches to Variation and Universals in Biolinguistics, ed. L. Jenkins, 5774.
cerebral laterality: Development and neurodegeneration. American Amsterdam: Elsevier.
Journal of Medical Genetics 101: 37081. . 2006. he Computational Nature of Language Learning and
Gil-da-Costa, Ricardo, Alex Martin, et al. 2006. Species-speciic calls Evolution. Cambridge, MA: MIT Press.
activate homologs of Brocas and Wernickes areas in the macaque. Noonan, James P., Graham Coop, et al. 2006. Sequencing and analysis of
Nature Neuroscience 9 (July): 106470. Neanderthal genomic DNA. Science 314 (November): 111318.
Green, Richard E., Johannes Krause, et al. 2006. Analysis of one million Nowak, Martin A. 2006. Evolutionary Dynamics: Exploring the Equations
base pairs of Neanderthal DNA. Nature 444 (November): 3306. of Life. Cambridge: Harvard University Press.
Grodzinsky, Yosef, and Katrin Amunts, eds. 2006. Brocas Region. Nowak, Martin A., and Natalia L. Komarova. 2001. Towards an evo-
Oxford: Oxford University Press. lutionary theory of language. Trends in Cognitive Sciences 5 (July):
Haesler, Sebastian, Kazuhiro Wada, et al. 2004. FoxP2 expression in 28895.
avian vocal learners and non-learners. Journal of Neuroscience 24 Pennisi, Elizabeth. 2009. Neanderthal Genomics: Tales of a Prehistoric
(March): 316475. Human Genome. Science 323. 5916: 86671.
Hauser, Marc D., David Barner, et al. 2007. Evolutionary linguistics: A Pettito, Laura-Ann. 2005. How the brain begets language. In
new look at an old landscape. Language Learning and Development Cambridge Companion to Chomsky, ed. J. McGilvray, 84101.
3.2: 10132. Cambridge: Cambridge University Press.
Hauser, Marc D., Noam Chomsky, and W. Tecumseh Fitch. 2002. he Petrides, M., and D. N. Pandya. 1994. Comparative architectonic anal-
faculty of language: What is it, who has it, and how did it evolve? ysis of the human and the macaque frontal cortex. In Handbook of
Science 298 (November): 156979. Neuropsychology, ed. F. Boller and J. Grafman, 1758. Amsterdam:
Holy, Timothy E., and Zhongsheng Guo. 2005. Ultrasonic songs of male Elsevier.
mice. PLoS Biology 3 (December): 217786. Available online at: www. Poeppel, David, and Gregory Hickok. 2004. Towards a new functional
plosbiology.org. anatomy of language. Cognition 92 (May/June): 112.
Iacoboni, Marco, Roger Woods, et al. 1999. Cortical mechanisms of Rizzolatti, Giacomo, and Michael A. Arbib. 1998. Language within our
human imitation. Science 286: 25268. grasp. Trends in Neurosciences 21 (May): 18894.
Jenkins, Lyle. 2000. Biolinguistics: Exploring the Biology of Language. Roll, Patrice, Gabrielle Rudolf, et al. 2006. Srpx2 mutations in disor-
Cambridge: Cambridge University Press. ders of language cortex and cognition. Human Molecular Genetics 15
Jenkins, Lyle, ed. 2004. Variation and Universals in Biolinguistics. (April): 11951207.
Amsterdam: Elsevier. Scharf, Constance, and Sebastian Haesler. 2005. An evolutionary
Kegl, Judy. 2004. Language emergence in a language-ready perspective on FoxP2: Strictly for the birds? Current Opinion in
brain: Acquisition. In Variation and Universals in Biolinguistics, ed. Neurobiology 15 (December): 694703.
L. Jenkins, 195236. Amsterdam: Elsevier. Scharf, C., and F. Nottebohm. 1991. A comparative study of the behav-
Kennedy, D. N., K. M. OCraven, et al. 1999. Structural and functional ioral deicits following lesions of various parts of the zebra inch song
brain asymmetries in human situs inversus totalis. Neurology 53 system: Implications for vocal learning. Journal of Neuroscience 11
(October): 12605. (September): 2896913.
54
Explaining Language
Shapiro, K. A., L. R. Moo, et al. 2006. Cortical signatures of noun and verb Somerville, M. J., C. B. Mervis, et al. 2005. Severe expressive-language
production. Proceedings of the National Academy of Sciences USA 103 delay related to duplication of the Williams-Beuren locus. New
(January): 164449. England Journal of Medicine 353.16: 1694701.
Sherwood, Chet C., Douglas C. Broadfield, et al. 2003. Variability Sun, Tao, Christina Patoine, et al. 2005. Early asymmetry of gene tran-
of Brocas area homologue in African great apes: Implications for scription in embryonic human left and right cerebral cortex. Science
language evolution. he Anatomical Record Part A: Discoveries 308 (June ): 17948.
in Molecular, Cellular, and Evolutionary Biology 271A (March): Sun, Tao, and Christopher A. Walsh. 2006. Molecular approaches to
27685. brain asymmetry and handedness. Nature Reviews Neuroscience 7
Shu, Weiguo, Julie Y. Cho, et al. 2005. Altered ultrasonic vocalization in (August): 65562.
mice with a disruption in the Foxp2 gene. Proceedings of the National Tettamanti, Marco, Giovanni Buccino, et al. 2005. Listening to action-
Academy of Sciences USA 102 (July 5): 96438. related sentences activates fronto-parietal motor circuits. Journal of
Shu, W., H. Yang, et al. 2001. Characterization of a new subfamily of Cognitive Neuroscience 17: 27381.
winged-helix/forkhead (fox) genes that are expressed in the lung and White, Stephanie A., Simon E. Fisher, et al. 2006. Singing mice, songbirds,
act as transcriptional repressors. Journal of Biological Chemistry and more: Models for Foxp2 function and dysfunction in human speech
276.29: 2748897. and language. Journal of Neuroscience 26 (October): 103769.
55
Yet normally, by about three years of age, a child will have
attained the fundamental knowledge of an ininite combinato-
rial multileveled system as well as essential constraints on this
ininite system, no matter what the language, no matter what
6 the country or culture the child is born into, no matter what the
limitless contextual, environmental variations across children,
cultures, and countries. his mystery, and the paradox of con-
strained ininity, drives the ield of language acquisition. How
ACQUISITION OF LANGUAGE is this fundamental knowledge acquired? How is it represented
and efected in the mind and brain?
Barbara Lust
THE FIELD OF LANGUAGE ACQUISITION

In many ways, the study of language acquisition stands at the
center of the language sciences, subsumes all of its areas, and
thus, perhaps, supersedes all in its complexity. It entails the
study of linguistic theory in order that the systems and com-
plexities of the end state of language knowledge be understood,
so that the status of their acquisition can be evaluated. his
INTRODUCTION involves all areas of linguistics, in addition to phonetics: pho-
How does the child, within the irst few years of life, come to nology, morphology, syntax, semantics, and prag-
understand and to produce Hop on Pop or Cat in the Hat matics (cf. Chierchia 1999). It entails language typology
(Dr. Seuss)? because acquisition of any speciic language is possible, and
because this acquisition relects any possible typological varia-
(1) We all play ball tion. Barbara Grimes (2000) proposes that between 6,000 and
Up on a wall 7,000 languages now exist in the end state. he study of language
Fall of the Wall (Dr. Seuss [1963] 199l, 1013) acquisition also entails the study of language typology because
(2) We looked! language acquisition at any state of development represents a
And we saw him! form of manifestation of language knowledge, itself subject to
he Cat in the Hat! typological variation. It entails language change, because
And he said to us (Dr. Seuss [1957] 1985, 6) the course of acquisition of language over time reveals changing
manifestations of language; language change must be related to
How does the child come to know the precise sound varia- language acquisition, in ways still not understood (although see
tions that distinguish hop and pop or wall and ball, the Baron 1977, deGraf 1999, and Crain, Gloro, and hornton 2006
multiple meanings of words like play or in, the reference for initial attempts to relate these areas; see also age groups.).
of pronouns like we or him, the meaning of quantiiers (see It involves psycholinguistics because any particular mani-
quantiication) like all, the functional categories determining festation of language during language acquisition is revealed
noun phrases like the, the idioms like play ball, the system- through the cognitive and psychological infrastructure of the
atic order and structure necessary for even simple sentences, mind and brain during hearing and speaking in real time,
the ininite recursion of sentences allowed by the coordinate potentially within a visual modality as in sign languages,
connective and, the ininite possibilities for propositions based thus always involving language processing integrated with lan-
on the manipulation of even a few sound units and word units, guage knowledge. It involves neurolinguistics, not only because
and the ininite possibilities for meaning and truth, which the knowledge and use of language must in some way relate
Dr. Seuss continually demonstrated? to its biological and neural instantiation in the brain, but also
Even more stunning, how does the child both attain the ininite because explanations of the gradual course of language acqui-
set of possibilities and, at the same time, know the ininite set of sition over time must critically evaluate the role of biological
what is impossible, given this ininitely creative combinatorial sys- change. (See brain and language and biolinguistics.)
tem? Just as there is no limit to what is possible, there is no limit to At the same time, the study of language acquisition also stands
what is impossible in language, for example, (3), or an ininite set at the center of Cognitive Science. It cannot be divorced from other
of other impossible word and structure combinations, for example, closely related disciplines within Cognitive Science. he relation
(4). his provides an essential paradox for language learners: How between language and thought is never dissociated in natural
do they both acquire the ininite possibilities and, at the same time, language, thus crucially calling for an understanding of cognitive
the constraints on the ininite set of impossible expressions? psychology and philosophy; and the formal computation involved
(3) *Ylay in grammatical systems is not fruitfully divorced from computer
*lla science. Any use of language involves possibilities for vagueness
and ambiguities in interpretation, given the particular prag-
(4) *Play all we ball matic context of any language use. For example, how would one
*in cat the hat fully understand the meaning of an ostensibly simple sentence like
56
Acquisition of Language
Shes going to leave without understanding the particular con- varying positions in the ield on these focal issues, then exem-
text in which this sentence is used, and there is an ininite set of plify a range of research results, displaying crucial new discover-
these contexts possible. he challenge of developing an ininitely ies in the ield. Ill conclude by formulating leading questions for
productive but constrained language system embedded in the the future. For those interested in pursuing these topics further, I
ininite possibilities for pragmatic determination of interpretation close with selected references for future inquiry in the ield.
remains a central challenge distinguishing machine language and
the language of the human species. All areas of cognitive science
Focal Tensions
must be invoked to study this area. Finally, especially with regard to
language acquisition in the child, developmental psychology must Classically, approaches to the study of language acquisition have
be consulted critically. he child is a biologically and cognitively been categorized as nativist or empiricist (see innateness and
changing organism. Indeed, an understanding of commonalities innatism). hese approaches, which correspond generally to
and/or diferences between language acquisition in child and adult rationalist or empiricist approaches to epistemology (see Lust
requires expertise in the ield of developmental psychology. 2006 for a review), have been typically associated with claims
In addition, the fact that the human species, either child or that essential faculties responsible for language acquisition in
adult, is capable not only of single language acquisition but also the human species involve either innate capabilities for language
of multilanguage acquisition, either simultaneously or sequen- or not. However, now the debates have become more restricted,
tially, exponentially increases the complexity of the area (see allowing more precise scientiic inquiry. No current proposals, to
bilingualism and multilingualism). he fact that these our knowledge, suggest that nothing at all is innate. (See Elman
languages may vary in their parametric values (see parame- et al, 1996 for example). No one proposes that every aspect of
ters) and be either oral or visual further complicates the area. language knowledge is innate. For example, in the now-classic
Piaget-Chomsky Debate (see Piatelli-Palmarini 1980), Chomsky
did not deny that experience was necessary for a comprehensive
THEORETICAL FOUNDATIONS
theory of language acquisition. On the other hand, Jean Piaget
Not surprisingly, then, the ield of research involving the study of proposed an essentially biological model of cognitive develop-
language acquisition is characterized by all the complexity and ment and an antiempiricist theory, coherent with Chomskys
variation in theoretical positions that characterize the ield of lin- position in this way.
guistics and the language sciences in general. he area of language Rather, the current issue that is most central to the ield of
acquisition research relects varying approaches to the study of language acquisition now is whether what is innate regarding
grammar, that is, variations regarding viewpoints on what consti- language acquisition involves components speciic to linguistic
tutes the end state of language knowledge that must be acquired, knowledge, for example, speciic linguistic principles and
for example, generative or functionalist approaches. At parameters, or whether more general cognitive knowledge or
the same time, it is characterized by variation in approach to the learning principles themselves potentially innate can account
study of language acquisition, in particular, ranging from vari- for this knowledge. his issue corresponds to the question of
ous forms of logical to empirical analyses, and including disputes whether organization of the mental representation of language
regarding methodological foundations and varying attempts knowledge is modular or not. Proponents of the view that spe-
at explanatory theories (ranging from rationalist to empiricist ciically linguistic factors are critical to an explanation of human
types). (Lust 2006, Chapter 4, provides a review.) language acquisition generally work within a linguistic theory of a
he ield is led by a strong theory of what is necessary logi- modular language faculty, that is, a theory of universal gram-
cally for a strong explanatory theory of language acquisition in mar (UG), which is proposed to characterize the initial state
the form of Noam Chomskys early proposal for a language of the human organism, that is, the state of the human organ-
acquisition device (LAD). his model not only involved ism before experience. his current descendant of Chomskys
explication of what needed to be explained but also spawned LAD provides speciic hypotheses (working within a genera-
decades of research either for or against various proposals bear- tive grammar framework) regarding the identity of linguistic
ing on components of this model. Yet, today, no comprehensive principles and parameters that may be innately, or biologically,
theory of language acquisition exists, that is, no theory that would determined in the human species (e.g., Anderson and Lightfoot
fully account for all logically necessary aspects of a LAD. 2002). Proponents of the alternative view generally assume a
At the same time, decades of research have now produced an functionalist theory and a model of cultural learning, that is,
explosion of new discoveries regarding the nature of language a model where culture in some ways provides the structure of
acquisition. hese empirical discoveries, combined with theo- language, potentially without speciic linguistic printiples, with
retical advances in linguistic theory and with the development only general learning principles (e.g., Van Valin 1991; Tomasello
of the interdisciplinary ield of cognitive science, bring us today 2003; Tomasello, Kruger, and Ratner 1993).
to the verge of such a comprehensive theory, one with irm sci-
entiic foundations.
Positions in the Field
Debates in the ield of language acquisition continue today
STRUCTURE OF THIS OVERVIEW
around issues of innateness and modularity (e.g., Pinker, 1994;
In this overview, I irst briely characterize and distill focal ten- Tomasello 1995; Edelman 2008). Research in the ield is often
sions in the ield of language acquisition and briely survey polarized according to one paradigm or the other. Several
57
speciic issues focalize these debates. Two of the most pressing of a proposal for grammatical mapping in Lust 1999, 2006 and
currently concern the nature of development, and the nature of Santelmann et al. 2002.) he task of language acquisition goes
the childs use of language input. beyond the periphery of grammatical knowledge and beyond
Critically, all approaches must confront the empirical fact mere triggering of parameters; it lies in constructive synthesis of
that childrens language production and comprehension develop a speciic language grammatical system, which is constrained by
over time. Language acquisition is not instantaneous. he ques- UG but not fully determined by it.
tion is whether childrens language knowledge also develops (iv) Some propose that UG is irrelevant to language acquisi-
over time, and if it does, does it change qualitatively, and if it does tion; the process can be described by an alternative mechanism,
change qualitatively, what causes the change. his question of critically involving some form of usage-based learning (e.g.,
development is parallel to questions regarding cognitive devel- Elman et al. 1996; Tomasello 2003, 2005): In general, the only
opment in general, where stage theories are debated against fully adequate accounts of language acquisition are those that give
continuity theories. Positions in the ield today vary in several a prominent role to childrens comprehension of communicative
ways regarding their understanding of the nature of develop- function in everything from words to grammatical morphemes to
ment in language acquisition. Even within researchers working complex syntactic constructions (Tomasello 2005, 183).
within a rationalist paradigm that pursues universal grammar as Within the proposals of iiii which all work within a generative
a speciically linguistic theory of a human language faculty, there UG framework, (i) and (iii) propose a strong continuity hypothesis
is disagreement. he theory of UG does not a priori include an of language development, although they difer crucially in what
obvious developmental component. Positions in the ield can be is claimed to be continuous. In (iii), it is only the essential set of
summarized as in iiv. principles and parameters of UG constituting the initial state
(i) Some propose that essential language knowledge is fun- that is proposed to be biologically programmed and to remain
damentally innate, and do not centrally address the issue of constant over development, while in (i), it is comprehensive
what change in language knowledge may occur during language grammatical knowledge, with no distinction made between UG
development (e.g., Crain 1991). In this view, apparent cases of and SLG. While a maturational approach such as in (ii) would
language knowledge change in a young childs production or appear to maintain the premise of biological programming of
comprehension are often mainly attributed to methodological language knowledge, and thus be consistent with a theory of
failures, for example, involving the researchers choice of spe- UG, it raises several critical theoretical and empirical issues that
ciic tasks for testing childrens knowledge. he view of language are still unresolved. For example, theoretically, the question
development is characterized by the following: On the account arises: What explains the change from one UG state to another, if
we envision, childrens linguistic experience drives children this determination is not programmed in UG itself? Empirically,
through an innately speciied space of grammars, until they hit in each area where a maturational account has been proposed,
upon one that is suiciently like those of adult speakers around further advanced research has often revealed grammatical com-
them, with the result that further data no longer prompts further petence that was thought to be missing in early development,
language change (Crain and Pietroski 2002, 182). (Compare for example, early knowledge of functional categories. (See Lust
LAD and a recent proposal by Yang 2006.) On this view, it would 1999 and Wexler 1999 for debate.)
appear that grammars are predetermined and available for All proposals must now confront the fundamental develop-
choice. Presumably, speciic lexicons, as well as other periph- mental issues: what actually changes during language devel-
eral aspects of language knowledge, would be exempt from this opment and why. hose within the generative paradigm must
claim of predetermined knowledge. he range and nature of the sharpen their vision of what in fact constitutes UG, and those
innately speciied space of grammars would have to be expli- outside of it must sharpen their view of how ininitely creative but
cated, as this framework develops. ininitely constrained grammatical knowledge can be attained
(ii) Some propose that UG itself develops over time in a man- on the basis of communicative function alone. All proposals
ner that is biologically driven, that is, through maturation. Major must be accountable to the wide and cumulative array of empir-
aspects of language development, for example late acquisition of ical data now available. (See, for example, Discoveries in the
the passive construction (such as he ball was thrown by Mary next section.)
in English), are attributed to biologically determined changes in Researchers are also opposed in their view of how the child
UG (e.g., Wexler 1999; Radford 1990). his maturation is proposed uses input data. (See Valian 1999 for a review.) All approaches
to be the major explanation of language development. Speciic must confront this fundamental area, whether the UG frame-
language grammars (SLGs) are not, for the most part, treated as work is involved (e.g., in explaining how the environment trig-
essential independent components of the language acquisition gers parameter setting) or whether an empiricist framework is
challenge, but often as simply triggered by UG parameters. Any involved (where the mechanism of imitation of individual words
learning that would be involved for the child in language devel- and/or utterances may be viewed as an essential mechanism of
opment would involve merely peripheral aspects of language, data processing and grammar development for the child, e.g.,
with the nature of the periphery yet to be speciied. Tomasello 1995, 2002, 2005). Indeed, the very nature of imitation
(iii) Some propose that UG is continuous, but UG is inter- of language stimuli is under investigation (e.g., Lust, Flynn, and
preted as involving just those principles and parameters that con- Foley 1996). In each case, the mechanism proposed must reli-
strain and guide the language acquisition process. SLG develops ably account for the childs mapping from these input data to the
over time through experience with data from a speciic language speciic knowledge of the adult state, thus solving what has been
and through the childs active construction of it. (See examples called the logical problem of language acquisition and it must
58
also be empirically veriied, that is, be a veridical account of how languages, the proportion of labials in both late babbling and
the child actually does use surrounding data. hose approach- irst words will relect the input of labials in the adult language
ing the logical problem of language acquisition are formally being acquired (de Boysson-Bardies, Vihman, and Vihman
diagnosing the properties of syntactic and semantic knowledge 1991).
that must be acquired, and assessing varied formal learnability In fact, children are picking out words as formal elements from
theories that may possibly account for the childs mapping from the speech stream even before they understand them. For exam-
actual input data to this knowledge. (e.g., Lightfoot 1989). (See ple, 8-month-olds who were exposed to stories read to them for
also projection principle.) 10 days (30 minutes of prerecorded speech, including three short
stories for children) during a two-week period, two weeks later
distinguished lists of words that had appeared in these stories,
Discoveries
for example, (5a), from those which had not, for example, (5b)
Fortunately, not only have the last decades seen continuous (Jusczyk and Hohne 1997).
theoretical developments in areas of linguistics regarding the
(5) a. sneeze
nature of language knowledge, and continuous sharpening
elephant
of the debate on the nature of language development, but the
python
ield of language acquisition has also produced a wealth of new
peccaries
empirical discoveries, all of which promise to inform the crucial
b. aches
questions and debates in the ield, and to eventuate in a more
apricot
comprehensive theory of language acquisition than has yet been
sloth
available. Ill exemplify only some of these here, all of which are
burp
foci of intense current research. (See Lust 2006 for a more com-
prehensive review.) Given the age of the children and the nature of the words
tested, it is clear that children are engaged both in word segmen-
THE FIRST 12 MONTHS. Informed by developed methodologies tation and in long-term storage of these formal elements, even
for investigating infant perception of language, we now know without their semantic content. Moreover, like the acquisition of
that neonates show early sensitivities to language variations phonology, childrens sensitivities to the language-speciic struc-
and categorize these variations (e.g., Ramus et al. 2000; Mehler ture of words begins to show reinement after the sixth month.
et al. 1996, 1988). For example, given varying sound stimuli from American 9-month-olds, though not 6-month-olds, listened
Japanese and varying sound stimuli from English speech, new- longer to English words than to Dutch words, while Dutch
born infants distinguish variation from Japanese to English sig- 9-month-old infants showed the reverse preference (Jusczyk
niicantly more than variation within either English or Japanese, et al. 1993).
including speaker variation. In this sense, the infant is seen to Simultaneously, and in parallel, in the area of syntax, infants
be categorizing variation across languages. (See infantile have also been found to be laying the formal foundations for
responses to language.) More speciically, we also know language knowledge even within the irst 12 months. hey are
that formal aspects of language (phonetic, phonological, and carving out the formal elements that will form the basis for the
syntactic) begin to develop at birth, even before language com- syntax of the language they are acquiring. Precise sensitivities
prehension, and provide the foundations for the appearance of to linear order, as well as to constituent structure, have
overt language production and comprehension in the child at been discovered in these irst few months of life. For example,
about 12 months of age. Now we also know something of how infants as young as 4 months of age have been found to dis-
that development proceeds even during the irst 12 months of tinguish natural well-formed clause structure, like (6a) from
life. non-natural ones, like (6b), in stories read to them (where /
In the area of acquisition of phonology, we have discovered represents clause breaks through pauses, and the stimuli are
the very ine, precise, and extensive phonetic sensitivities of the matched in semantic and lexical content) (Hirsh-Pasek et al.
newborn, sensitivities that appear to be exactly the right ones 1987).
to underlie all potential cross-linguistic phonological contrasts,
(6) a. Cinderella lived in a great big house/ but it was sort of dark/
for example, contrasts in voicing or place and manner features
because she had this mean, mean, mean stepmother
that characterize speech sounds. Once again, these sensitivities
b. in a great big house, but it was/ sort of dark because she
are categorical (as in categorical perception; see speech per-
had/ this mean
ception in infants). We know that by about 6 months, these
sensitivities reveal appropriate cross-linguistic selection and Experimental research has begun to reveal how infants
modulation, and by 12 months, this process is nicely attuned to accomplish this discrimination in the speech stream, suggest-
the childs irst productive words of their speciic language (e.g., ing that the mapping of prosodic phrasing to linguistic units may
Werker 1994; see Jusczyk 1997 and de Boysson-Bardies 1999 for play an important role.
reviews). For example, while the 12-month-old infant acquir- Again, sensitivities become more language speciic after the
ing Hindi will have maintained sensitivity to Hindi contrasts in sixth month. Although 6-month-olds did not distinguish natu-
aspiration, the infant acquiring English will show diminished ral (e.g., 7a) and non-natural phrasal (verb phrase) structures
response to such distinctions, which are not linguistically con- (e.g., 7b), that is, phrasal constituents smaller than the clause,
trastive in English. Although labials appear in irst words across 9-month-olds did:
59
(7) a. he little boy at the piano/ is having a birthday party category sensitivities appear to be available even within the irst
b. he little boy at the piano is having/ a birthday party 12 months (e.g., Shady 1996; Demuth 1994).
Contrary to the widespread view that the contentful lexicon
In all these cases, both phonological and syntactic develop-
(involving nouns and verbs) is the privileged basis for acquisition
ment does not reduce either to simple loss of initial sensitivities
of syntax in early language acquisition, these results are begin-
or to simple accrual or addition of new ones, but a gradual inte-
ning to suggest that, in fact, functional categories are fundamen-
gration of a speciic language grammatical system.
tal (see lexical learning hypothesis).
More recently, research has begun to reveal even more pre-
cisely how these early sensitivities are related to later language
PRINCIPLES AND PARAMETERS. Principles and parameters that
development, thus foreshadowing a truly comprehensive theory
are hypothesized to provide the linguistic content of UG and
of language acquisition (Newman et al. 2006; Kuhl et al. 2005).
of the human language faculty provide leading hypotheses for
language acquisition research. (See principles and param-
BEYOND FIRST WORDS: LINGUISTIC STRUCTURE. Continuous with
eters theory and language acquisition.) Investigators
advances in our understanding of early infant development, we
continue to search not only for theoretical motivation for such
are now also seeing a potential revolution in our understand-
principles and parameters but also for empirical evidence of the
ing of the early stages of overt language acquisition, that is,
role of UG-based principles and parameters in early language
those periods within the irst three years of life, where the child
acquisition. A wide range of empirical research has now accrued
is beginning to overtly manifest language knowledge in terms
in this paradigm, paralleling theoretical developments (e.g.,
of language production and comprehension. Child language in
Snyder 2007; Crain and Lillo-Martin 1999; Guasti 2002; Roeper
these early periods has traditionally been referred to as holo-
2007; Lust 2006, among others). his research reveals a wide
phrastic or, a little later, telegraphic (see two-word stage)
array of evidence regarding very fundamental linguistic prin-
in nature. Even in these early periods, numerous studies are now
ciples, including the central and perhaps most fundamental UG
revealing childrens very early sensitivity to functional categories
principle of structure dependence:
(FCs) in language, that is, to grammatical elements that function
formally to a large degree, often with little or no semantic con- he rules operate on expressions that are assigned a certain struc-
tent. hese functional categories play a critical role in deining ture in terms of a hierarchy of phrases of various types. (Chomsky
constituent structure in language, and in deining the locus of 1988, 45)
syntactic operations. hus, the evidence that infants and tod-
Evidence for this fundamental linguistic principle has been
dlers are accessing these FCs in their early language knowledge
adduced in studies of childrens acquisition of several areas,
begins to provide crucial data on the foundations for linguistic
including various types of question formation (e.g., Crain and
systems in the child. (Such FC are relected in diferent ways
McKee 1985; Crain and Nakayama 1987; deVilliers, Roeper, and
across languages. In English they are relected in determiners
Vainikka 1990), empty category and pronoun interpretation (e.g.,
such as the, auxiliary verbs like do, complementizers like
Cohen Sherman and Lust 1993; Nuez del Prado, Foley, and Lust
that, or inlection of verbs, for example.)
1993; Lust and Cliford 1986), and quantiier scope (e.g., Chien
Early research had revealed that young children perceive and
1994).
consult functional categories, such as determiners, even when
Results from young children at early stages of development
they are not overtly producing them regularly in their natural
across languages have shown that they very early distinguish
speech. For example, L. Gerken, B. Landau, and R. Remez (1990)
coordinate and subordinate structures and that they diferenti-
showed that 2-year-olds recognize the distinction between
ate syntactic processes in these diferent structures accordingly.
grammatical and ungrammatical function words, contrasting
For example, in English, they very early distinguish sentences like
these, for example (8a) and (8b), in their elicited imitation of
(10a) and (10b) in both comprehension and production (Cohen
these sentences. Gerken and Bonnie McIntosh (1993) showed
Sherman and Lust 1993):
that 2-year-olds used this knowledge in a picture identiication
task, discriminating between (9a) and (9b), where grammatical (10) a. [he turtle tickles the skunk] and [0 bumps the car]
forms facilitated semantic reference. b. Tom [promises/tells Billy [0 to eat the ice cream cone]]
(8) a. Pete pushes the dog Chinese children diferentiate coordinate and embedded
b. Pete pusho na dog structures and diferentiate subjects and topics (see topic
and comment) in Chinese accordingly (Chien and Lust 1985).
(9) a. Find the bird for me
Across English, Japanese, and Sinhala, children diferentiate pos-
b. Find was bird for me
sibilities for anaphora according to the embedded or adjoined
More recently, a wide range of experimental researchers work- structure in which proforms appear (Lust and Cliford 1986;
ing with expanded infant testing methods have replicated these Oshima and Lust 1997; Gair et al. 1998; Eisele and Lust 1996).
results, and also revealed similar functional category distinctions In general, very early linguistic knowledge including knowl-
even in younger children. For example Yarden Kedar, Marianella edge of language involving diminished content, where direct
Casasola, and Barbara Lust (2006) showed that infants as young overt phonetic information is not available is attested in stud-
as 18 months also distinguish sentences like (9a) and (9b) in a ies of childrens acquisition of sentences with ellipses. For exam-
preferential looking task, and again, object reference is facili- ple, in sentences such as (11), young children have been found
tated by the grammatical form. Precursors to these functional to reveal not only competence for empty category interpretation
60
learning by hanging on to some aspect of the input that it can

(in the does too clause, where the second clause does not access. Research is beginning to provide evidence on how and
state what Bert did), but also competence for construction of the when these forms of bootstrapping may work in the young child
multiple interpretations allowed in this ambiguous structure (as (e.g, work of Gleitman 1990 and others on syntactic bootstrap-
in 11ad), and constraint against the ungrammatical possibili- ping and its role in early lexical learning and the the collection of
ties (as in 11 ei). (See Foley et al. 2003 for an example.) In other papers in Morgan and Demuth 1996). Precise hypotheses are now
words, they evidence early control of and constraints on empty being formed regarding the mechanisms by which certain param-
category interpretation and on other forms of ellipsis. Studies eters may be set very early by the infant, even before the irst word,
of Chinese acquisition show similar results (Guo et al. 1996). by consulting prosodic and other aspects of the speech stream
Without structure dependence and grammatical computation (Mazuka 1996).
over abstract structure underlying such sentences (e.g., recon-
structing the verb phrase [VP] in the second clause), children CROSS-SPECIES COMPARATIVE METHOD. Advances in cross-
would not be expected to evidence this competence (see Foley species comparisons now provide an additional dimension to the
et al. 2003). All interpretations are pragmatically possible. study of language acquisition, allowing reinement of our speci-
(11) Oscar bites his banana and Bert does too. ication of what is particularly human and of what is particularly
linguistic about human acquisition of language (e.g., Hauser,
Possible Interpretations:
Chomsky, and Fitch 2002; Call and Tomasello 2007; Ramus et al.
a. O bites Os banana and B bites Bs banana.
2000). For example, comparative studies with cotton-top tamarin
b. O bites Os banana and B bites Os banana.
monkeys revealed certain capabilities in this species to discrimi-
c. O bites Bs banana and B bites Bs banana.
nate language stimuli (e.g., Dutch and Japanese) that were com-
d. O bites Es banana and B bites Es banana.
parable to human infants (Ramus et al. 2000). his implied that
Impossible Interpretations a general auditory process was involved in the discrimination. In
*e. O bites Os banana and B bites Es banana. contrast, other processes discovered in early acquisition of phonol-
*f. O bites Bs banana and B bites Os banana. ogy have been found not to generalize. (See, for example, the work
*g. O bites Bs banana and B bites Es banana. of Kuhl et al. 2005.)
*h. O bites Es banana and B bites Os banana.
*i. O bites Es banana and B bites Bs banana. RESILIENCE. Finally, the tremendous resilience of the language
acquisition feat in the face of varying input, including dearth of
As is the case in this study of VP ellipsis acquisition, empiri-
input, has been revealed through important work on young deaf
cal research results in the Principles and Parameters framework
childrens spontaneously created home sign (Goldin-Meadow
mutually inform theoretical development; they contribute to and
2003). he role of community in sign language creation is also
help to resolve theoretical debates on the representation of this
revealed through in-depth studies of the creation of Nicaraguan
area of linguistic knowledge.
Sign Language in young children (Senghas 1995; Kegl, Senghas,
Evidence for early parameter setting with regard to linear
and Coppola 1999).
order in natural language is now provided by a wide range of
cross-linguistic studies showing very early language-speciic
sensitivity to the clausal head direction and directionality of Toward the Future
RECURSION in the language being acquired (e.g., Lust, in prep- FIRST AND SECOND LANGUAGE IN CHILD AND ADULT. Current and
aration; Lust and Chien 1984). Very early diferentiation of the future studies that seek to triangulate the language faculty by
pro drop (i.e., argument omission wherein subjects may not be precise comparative studies between child irst language acquisi-
overtly expressed) possibilities of a language have been attested tion and adult second language and between child monolingual
across languages (e.g., Italian and English, Spanish and English) irst language acquisition and multilanguage acquisition promise
(Valian 1991; Austin et al. 1997, after Hyams 1986). In fact, chil- to be able to dissociate factors related to biological maturation,
dren have been found to critically consult the complementizer universal grammar, and speciic language grammar in ways not
phrase, and the subordinate structure domain in order to make achievable by studies of irst language acquisition alone (e.g.,
this diferentiation (Nuez del Prado, Foley, and Lust 1993). Flynn and Martohardjono 1994; Yang and Lust 2005).
FROM DATA TO GRAMMAR. Our understanding of how the infant, PRAGMATIC AND COGNITIVE DEVELOPMENT AND GRAMMATICAL
from birth, consults the surrounding speech stream is now also DEVELOPMENT. Studies of the integration of childrens developing
expanding quickly. Research has revealed very ine sensitivities pragmatic knowledge with their grammatical knowledge have only
to particular aspects of the speech stream (e.g., Safran, Aslin, begun; yet this integration characterizes every aspect of a childs
and Newport 1996), and research has begun to isolate the pre- use of language (cf. Clark 2003). Such studies may critically inform
cise role of particular cues in this process, for example, STRESS, our understanding of childrens early syntax development (e.g.,
phonotactic constraints, and statistical distributions (e.g, Johnson Blume 2002). Similarly, studies involving interactions between
and Jusczyk 2001). Various forms of bootstrapping may be avail- general cognition and speciic aspects of linguistic knowledge (e.g.,
able to the child (phonological or prosodic or syntactic bootstrap- Gentner and Goldin-Meadow 2003) will be critical for resolving
ping, for example). Here, bootstrapping generally refers to that issues of modularity in language acquisition. (See constraints
process by which the child, in the initial state, might initiate new in language acquisition.)
61
A NEUROSCIENCE PERSPECTIVE AND BEYOND. Current advances line more fully with studies of the language acquisition process.
in brain imaging methods (such as fMRI, and neurophysiologi- (See Baker 2005 for an example of an argument for representa-
cal measures, such as EEG [electroencephalography] and MEG tion of this integration.) Many current disputes regarding the
[magnetoencephalography]) and in genome mapping (Greally fundamentals of language acquisition cannot be resolved until
2007) promise new discoveries regarding fundamental issues disputes regarding the nature of the adult state of language
still open in the ield of language acquisition: How are language knowledge are further resolved, and until the ield of Linguistics
knowledge and language acquisition represented in the brain? and Psychology are more fully integrated in the ield of Cognitive
(See neuroimaging and genes and language). More pre- Science.
cisely, what is the content of the initial state and how is it biologi-
cally represented? How is development in language knowledge
either determined by biological changes or a cause of them? WORKS CITED AND SUGGESTIONS FOR FURTHER READING
here have been many neuroscientiic results regarding language Aitchison, Jean. 2007. he Articulate Mammal. 5th ed. London: Routledge.
in the adult state (e.g., Friederici 2002; Binder and Price 2001; Almi, C. R., M. J. Rivkin, R. C. McKinstry, and Brain Development
see syntax, neurobiology of; phonetics and phonology, Cooperative Group. 2007. he NIH MRI study of normal brain devel-
neurobiology of; and semantics, neurobiology of, for opment (Objective-2): Newborns, infants, toddlers, and preschoolers.
example), including multilingual language (e.g. Paradis 2000; see NeuroImage 35: 30825.
bilingualism, neurobiology of). Research on brain devel- Anderson, Stephen R., and David W. Lightfoot. 2002. he Language
opment has also begun to yield fundamental descriptive results Organ. Cambridge: Cambridge University Press.
Austin, Jennifer, Maria Blume, David Parkinson, Zelmira Nuez del
(e.g., Casey, Galvan and Hare 2005). Nevertheless, it appears that
Prado, Rayna Proman, and Barbara Lust. 1997. he status of pro-
the issue of the language-brain relationship during early develop-
drop in the initial state: Results from new analyses of Spanish. In
ment is a terra incognita. he issue of how brain development
Contemporary Perspectives on the Acquisition of Spanish, Vol 1:
and cognitive development in the area of language development Developing Grammars, ed. Anna Perez-Leroux and W. Glass, 3551.
co-occur in early development and over a lifetime will be one of Boston: Cascadilla Press.
the key issues in the coming decades of the third millennium Baker, Mark C. 2005. Mapping the terrain of language learning.
(Friederici 2000, 66). Advances in the description of brain devel- Language Learning and Development 1.1: 93129.
opment (e.g., Almi et al. 2007) have not yet been related precisely Baron, Naomi. 1977. Language Acquisition and Historical Change.
to advances in language acquisition. Amsterdam: North Holland.
Not only will advances in this area require technology appli- Binder, Jefrey R., and Cathy J. Price. 2001. Functional neuroimaging of
cable to a young and changing brain but also advances in the language. In Handbook of Functional Neuroimaging of Cognition, ed.
Robert Cabeza and Alan Kingstone, 187251. Cambridge, MA: MIT
interdisciplinary ield of cognitive science. hey will depend
Press.
on the development of strong linguistic theory, but also on the
Blake, Joanna. 2000. Routes to Child Language: Evolutionary and
development of a theory that maps the elements of linguistic the- Developmental Precursors. Cambridge: Cambridge University Press.
ory to the activation of language in the brain. At present, there is Blume, Maria. 2002. Discourse-morphosyntax in Spanish non-inite
still no theory of how an NP relates to a neuron (Marshall 1980) verbs: A comparison between adult and child grammars. Unpublished
or how an abstract linguistic principle like structure dependence Ph.D. diss., Cornell University.
or a particular linguistic constraint could be represented at a Boysson-Bardies, Bndicte de. 1999. How Language Comes to
neural level. Most imaging results to date provide evidence on Children: From Birth to Two Years. Cambridge, MA: MIT Press.
the processing of language, rather than on the knowledge of, or Boysson-Bardies, Bndicte de, B. Vihman, and M. M. Vihman. 1991.
representations of, language per se. Adaptation to language: Evidence from babbling and irst words in
four languages. Language 67.2: 297319.
Call, Josep, and Michael Tomasello. 2007. he Gestural Communication
METHODOLOGY. Advances in this area will also require advance-
of Apes and Monkeys. Mahwah, NJ: Lawrence Erlbaum.
ment on issues of methodology: Simply put, we need to under-
Casey, B. J., Adriana Galvan, and Todd Hare. 2005. Changes in cere-
stand, precisely, how our behavioral methodologies relect on, bral functional organization during cognitive development. Current
and interact with, the language processes we are attempting to Opinion in Neurobiology 15: 23944.
study and deine (Swinney 2000, 241). Advancement in this Chien, Yu-Chin. 1994. Structural determinants of quantiier scope: An
area is a prerequisite to advancement in our understanding of experimental study of Chinese irst language acquisition. In Syntactic
language processing, language knowledge, and their interaction heory and First Language Acquisition: Cross-Linguistic Perspectives.
in language acquisition, as well as in the interpretation of brain Vol 2: Binding, Dependencies, and Learnability. Ed. B. Lust, G. Hermon,
imaging results. Many persistent sources of disagreement in and J. Kornilt, 391416. Mahwah, NJ: Lawrence Erlbaum.
the ield of language acquisition depend on resolution of issues Chien, Yu-Chin, and Barbara Lust. 1985. he concepts of topic and
related to methodology (e.g., Crain and Wexler 1999; Lust et al. subject in irst language acquisition of Mandarin Chinese. Child
Development 56: 135975.
1999), including those surrounding new infant-based methods
Chierchia, Gennaro. 1999. Linguistics and language. In he MIT
related to preferential looking.
Encyclopedia of the Cognitive Sciences, ed. R. Wilson and F. Keil, xci
cix. Cambridge, MA: MIT Press.
LINGUISTIC THEORY AND LANGUAGE ACQUISITION. Finally, the Chomsky, Noam. 1988. Language and Problems of Knowledge.
study of language acquisition will be advanced when linguistic Cambridge, MA: MIT Press.
studies of the knowledge that must underlie the adult state of Clark, Eve V. 2003. First Language Acquisition. Cambridge: Cambridge
language knowledge, no matter what language, are brought into University Press.
62
Cohen Sherman, Janet, and Barbara Lust. 1993. Children are in control. Gleitman, Lila. 1990. he structural sources of verb meanings. Language
Cognition 46: 151. Acquisition 1.1: 355.
Crain, Stephen. 1991. Language acquisition in the absence of experi- Goldin-Meadow, Susan. 2003. he Resilience of Language: What Gesture
ence. Behavioral and Brain Sciences. 14.4: 597650. Creation in Deaf Children Can Tell Us About How All Children Learn
Crain, Stephen., Takuya Gloro, and Rosalind hornton. 2006. Language Language. New York: Psychology Press.
acquisition is language change. Journal of Psycholinguistic Research . 2005. What language creation in the manual modality tells us
35: 3149. about the foundations of language. Linguistic Review 22: 199226.
Crain, Stephen, and Diane Lillo-Martin. 1999. An Introduction to Linguistic Greally, John M. 2007. Encyclopaedia of humble DNA. Nature
heory and Language Acquisition. Malden, MA: Basil Blackwell. 447: 7823.
Crain, Stephen, and Cecile McKee. 1985. Acquisition of structural restric- Grimes, Barbara, ed. 2000. Ethonologue: Languages of the World.
tions on anaphors. In Proceedings of the Sixteenth Annual Meeting Dallas: SIL International.
of the North Eastern Linguistics Society, ed. S. Berman, J. Chloe, and Guasti, Maria Teresa. 2002. Language Acquisition: he Growth of
J. McDonough, 94110. Montreal: McGill University. Grammar. Cambridge, MA: MIT Press.
Crain, Stephen, and Mineharu Nakayama. 1987. Structure dependence Guo, F. F., C. Foley, Y.-C. Chien, B. Lust, and C.-P. Chiang. 1996.
in grammar formation. Language 63: 52243. Operatorvariable binding in the initial state: A cross-linguistic
Crain, Stephen, and Paul Pietroski. 2002. Why language acquisition is a study of VP ellipsis structures in Chinese and English. In Cahiers de
snap. Linguistic Review 19: 16384. Linguistique Asie Orientale 25.1: 334.
Crain, Stephen, and Kenneth Wexler. 1999. Methodology in the study Hauser, Marc, Noam Chomsky, and Tecumseh Fitch. 2002. he fac-
of language acquisition: A modular approach. In Handbook of First ulty of language: What is it, who has it, and how did it evolve. Science
Language Acquisition, ed. William C. Ritchie and Tej K. Bhatia, 387 298: 156979.
426. San Diego, CA: Academic Press. Hirsh-Pasek, Kathy, Diane Kemler-Nelson, Peter Jusczyk, K. Wright-
deGraf, Michele. 1999. Creolization, language change and language Cassidy, B. Druss, and L. Kennedy. 1987. Clauses are perceptual units
acquisition: A prolegomenon. In Language Creation and Language for young children. Cognition 26: 26986.
Change, ed. M. deGraf, 146. Cambidge, MA: MIT Press. Hirsh-Pasek, Kathy, and Roberta Michnick Golinkof. 1996. he Origins of
Demuth, Katherine. 1994. On the underspeciication of functional cat- Grammar. Cambridge, MA: MIT Press.
egories in early grammars. In Syntactic heory and First Language Hyams, Nina. 1986. Language Acquisition and the heory of Parameters.
Acquisition: Cross-Linguistic Perspectives. Vol. 1. Ed. B. Lust, M. Suner, Dorcrecht and Boston: Reidel.
and J. Whitman, 11934. Hillsdale, NJ: Lawrence Erlbaum. Johnson, Elizabeth, and Peter Jusczyk. 2001. Word segmentation by
deVilliers, Jill, Tom Roeper, and Ann Vainikka. 1990. he acquisi- 8-month-olds: When speech cues count more than statistics. Journal
tion of long-distance rules. In Language Processing and Language of Memory and Language 44: 54867.
Acquisition, ed. Lynn Frazier and Jill deVilliers, 25798. Dordrecht, the Jusczyk, Peter. 1997. he Discovery of Spoken Language. Cambridge,
Netherlands: Kluwer. MA: MIT Press.
Edelman, Shimon. 2008. Computing the Mind: How the Mind Really Jusczyk, W., A. D. Friederici, J. M. I. Wessels, V. Y. Svenkerud, and A. M.
Works. Oxford: Oxford University Press. Jusczyk. 1993. Infants sensitivity to the sound patterns of native lan-
Eisele, Julie, and Barbara Lust. 1996. Knowledge about pronouns: A guage words. Journal of Memory and Language 32: 40220.
developmental study using a truth-value judgment task. Child Jusczyk, Peter W, and Elizabeth A. Hohne. 1997. Infants memory for
Development 67: 3086100. spoken words. Science 277: 19846.
Elman, Jefrey L., Elizabeth A. Bates, Mark H. Johnson, Annette Kedar, Yarden, Marianella Casasola, and Barbara Lust. 2006. Getting
Karmilof-Smith, Domenico Parisi, and Kim Plunkett. 1966. Rethinking there faster: 18- and 24-month-old infants use of function words to
Innateness. Cambridge, MA: MIT Press. determine reference. Child Development 77.2: 32538.
Flynn, Suzanne, and Gita Martohardjono. 1994. Mapping from the ini- Kegl, Judith, Ann Senghas, and M. Coppola. 1999. Creation through con-
tial state to the inal stage: he separation of universal principles and tact: Sign language emergence and sign language change in Nicaragua.
language speciic principles. In Syntactic heory and First Language In Language Creation and Language Change: Creolization, Diachrony
Acquisition: Cross Linguistic Perspectives, ed. Barbara Lust, Magui and Development, ed. M. DeGraf, 179237. Cambridge, MA: MIT
Suner, and John Whitman, 31936. Hillsdale, NJ: Lawrence Erlbaum. Press.
Foley, Claire, Zelmira Nuez del Prado, Isabella Barbier, and Barbara Kuhl, Patricia, Barbara Conboy, Denise Padden, Tobey Nelson, and
Lust. 2003. Knowledge of variable binding in VP-ellipsis: Language Jessica Pruitt. 2005. Early speech perception and later language devel-
acquisition research and theory converge. Syntax 6.1: 5283. opment: Implications for the critical period. Language Learning and
Friederici, A. D. 2000. he developmental cognitive neuroscience of lan- Development 1.3/4: 23764.
guage: A new research domain. Brain and Language 71: 658. Lightfoot, David. 1989. he childs trigger experience: Degree 0 learn-
. 2002. Towards a neural basis of auditory sentence processing. ability. Behavioral and Brain Sciences 12: 32134.
Trends in Cognitive Science 6.2: 7884. Lust, Barbara. 1999. Universal grammar: he strong continuity hypoth-
Gair, James, Barbara Lust, Lewala Sumangala, and Milan Rodrigo. 1998. esis in irst language acquisition. In Handbook of First Language
Acquisition of null subjects and control in some Sinhala adverbial Acquisition, ed. William C. Ritchie and Tej K. Bhatia, 11155. San
clauses. In Studies in South Asian Linguistics: Sinhala and Other South Diego, CA: Academic Press.
Asian Languages, 27185. Oxford: Oxford University Press. . 2006. Child Language: Acquisition and Growth. Cambridge:
Gentner, Dedre, and Susan Goldin-Meadow, eds. 2003. Language in Cambridge University Press.
Mind. Cambridge, MA: MIT Press. . Universal Grammar and the Initial State: Cross-Linguistic Studies
Gerken, L., B. Landau, and R. Remez. 1990. Function morphemes in of Directionality. In preparation.
young childrens speech perception and production. Developmental Lust, Barbara, and Yu-chin Chien. 1984. he structure of coordination in
Psychology 26: 20416. irst language acquisition of Mandarin Chinese. Cognition 17: 4983.
Gerken, Louann, and Bonnie McIntosh. 1993. Interplay of function mor- Lust, Barbara, and Teresa Cliford. 1986. he 3-D study: Efects of
phemes and prosody in early language. Developmental Psychology depth, distance and directionality on childrens acquisition of ana-
29.3: 44857. phora: Comparison of prepositional phrase and subordinate clause
63
embedding. In Studies of the Acquisition of Anaphora: Deining the Roeper, Tom. 2007. he Prism of Grammar: How Child Language
Constraints, ed. B. Lust, 20344. Dordrecht, the Netherlands: Reidel Illuminates Humanism. Cambridge, MA: MIT Press.
Press. Safran, Jenny R., Richard N. Aslin, and Elissa L. Newport. 1996. Statistical
Lust, Barbara, Suzanne Flynn, and Claire Foley. 1996. What children learning by 8-month-old infants. Science 274: 19268.
know about what they say: Elicited imitation as a research method. Santelmann, Lynn, Stephanie Berk, Shamita Somashekar, Jennifer
In Methods for Assessing Childrens Syntax, ed. Dana McDaniel, Cecile Austin, and Barbara Lust. 2002. Continuity and development in the
McKee, and Helen Cairns. Cambridge, MA: MIT Press. acquisition of inversion in yes/no questions: Dissociating movement
Lust, Barbara, Suzanne Flynn, Claire Foley, and Yu-Chin Chien. 1999. How and inlection. Journal of Child Language 29: 81342.
do we know what children know? Problems and advances in establish- Senghas, A. 1995. Childrens Contribution to the Birth of Nicaraguan Sign
ing scientiic methods for the study of language acquisition and linguis- Language. Cambridge, MA: MIT Press.
tic theory. In Handbook of First Language Acquisition, ed. William C. Seuss, Dr. [1957] 1985. Cat in the Hat. New York: Random House.
Ritchie and Tej K. Bhatia, 42756. San Diego, CA: Academic Press. . [1963] 1991. Hop on Pop. New York: Random House.
Marshall, John C. 1980. On the biology of language acquisition. Shady, M. E. 1996. Infants sensitivity to function morphemes. Ph.D.
In Biological Studies of Mental Processes, ed. D. Caplan, 30120. diss., State University of New York at Bufalo.
Cambridge, MA: MIT Press. Snyder, William. 2007. Child Language. Oxford: Oxford University Press.
Mazuka, Reiko. 1996. Can a grammatical parameter be set before the irst Swinney, David. 2000. Understanding the behavioral-methodology/
word? Prosodic contributions to early setting of a grammatical param- language-processing interface. Brain and Language. 71: 2414.
eter. In Signal to Syntax: Bootstrapping from Speech to Grammar in Tomasello, Michael. 1995. Language is not an instinct. Cognitive
Early Acquisition, ed. James Morgan and Katherine Demuth, 31330. Development 10: 13156.
Hillsdale, NJ: Lawrence Erlbaum. . 2002. he New Psychology of Language. Cognitive and Functional
Mehler, Jacques, Emmanuel Dupoux, hierry Nazzi, and G. Dehaene- Approaches to Language Structure. Mahwah, NJ: Lawrence Erlbaum.
Lambertz. 1996. Coping with linguistic diversity: he infants view- . 2003. Constructing a Language: A Usage-Based heory of Language
point. In Signal to Syntax: Bootstrapping from Speech to Grammar in Acquisition. Cambridge: Harvard University Press.
Early Acquisition, ed. James L. Morgan and Katherine Demuth, 10116. . 2005. Beyond formalities: he case of language acquisition.
Mahwah, NJ: Lawrence Erlbaum. Linguistic Review 22: 18398.
Mehler, Jaques, Peter Jusczyk, Ghislaine Lambertz, Nilofar Halsted, Tomasello, Michael, Ann Cale Kruger, and Hilary Horn Ratner. 1993.
Josiane Bertoncini, and Claudine Amiel-Tison. 1988. A precursor of Cultural learning. Behavioral and Brain Sciences 16: 495552.
language acquisition in young infants. Cognition 29: 14478. Valian, Virginia. 1991. Syntactic subjects in the early speech of American
Morgan, James L., and Katherine Demuth, eds. 1996. Signal to and Italian Children. Cognition 40: 2149.
Syntax: Bootstrapping from Speech to Grammar in Early Acquisition. . 1999. Input and language acquisition. In Handbook of Child
Mahwah, NJ: Lawrence Erlbaum. Language Acquisition, ed. William Ritchie and Tej Bhatia, 497530. San
Newman, Rochelle, Nan Bernstein Ratner, Ann Marie Jusczyk, Peter W. Diego, CA: Academic Press.
Jusczyk, and Kathy Ayala Dow. 2006. Infants early ability to segment Van Valin, Robert D. 1991. Functionalist theory and language acquisi-
the conversational speech signal predicts later language development: A tion. First Language 11: 740.
retrospective analysis. Developmental Psychology 42.4: 64355. Werker, Janet F. 1994. Cross-language speech perception:
Nuez del Prado, Zelmira, Claire Foley, and Barbara Lust. 1993. he sig- Developmental change does not involve loss. In he Development
niicance of CP to the pro-drop parameter: An experimental study of of Speech Perception: he Transition from Speech Sounds to Spoken
Spanish-English comparison. In Proceedings of the Twenty-Fifth Child Words, ed. J. Goodman and H. Nusbaum, 93120. Cambridge, MA:
Language Research Forum, ed. Eve Clark, 14657. Stanford, CA: CSLI. MIT Press.
Oshima, Shin, and Barbara Lust. 1997. Remarks on anaphora in Japanese Werker, Janet F., and Richard C. Tees. 1984. Cross-language speech per-
adverbial clauses. In Papers on Language Acquisition: Cornell ception: Evidence for perceptual reorganization during the irst year of
University Working Papers in Linguistics, ed. Shamita Somashekar, life. Infants Behavior and Development 7: 4963.
Kyoko Yamakoshi, Maria Blume, and Claire Foley. Ithaca, NY: Cornell Wexler, Kenneth. 1999. Maturation and growth of grammar. In
University Press. Handbook of First Language Acquisition, ed. T. Bhatia and Wm. Ritchie,
Paradis, Michel. 2000. he neurolinguistics of bilingualism in the next 55110. San Diego, CA: Academic Press.
decades. Brain and Language 71: 17880. Yang, Charles D. 2006. he Ininite Gift. New York: Scribner.
Piatelli-Palmarini, Massimo, ed. 1980. Language and Learning: he Yang, S., and B. Lust. 2005. Testing efects of bilingualism on execu-
Debate Between Jean Piaget and Noam Chomsky. Cambridge: Harvard tive attention: Comparison of cognitive performance on two non-
University Press. verbal tests. BUCLD 29 Proceedings Online Supplement. Somerville,
Pinker, Steven. 1994. he Language Instinct. New York: W. W. Morrow. MA: Cascadilla. Available online at http://www.bu.edu/linguistics/
Radford, Andrew. 1990. Syntactic heory and the Acquisition of English APPLIED/BUCLD/supp29.html.
Syntax. Cambridge: Cambridge University Press. . 2007. Cross-linguistic diferences in cognitive efects due to
Ramus, Frank, Marc D. Hauser, Cory Miller, Dylan Morris, and Jaques bilingualism: Experimental study of lexicon and executive attention
Mehler. 2000. Language discrimination by human newborns and by in 2 typologically distinct language groups. BUCLD 31 Proceedings.
cotton-top tamarin monkeys. Science 288.5464: 34951. Somerville, MA: Cascadilla.
64
theories of language have been developed most prominently
within ordinary language philosophy. he ideas of Ludwig
Wittgenstein and the principles of speech-act theory have
been taken up by literary theorists, such as Mary Louise Pratt and
7 Stanley Fish. Some philosophers working in other areas of the
analytic philosophy of language, such as Donald Davidson, have
also had considerable inluence (see Dasenbrock 1993).
While intentionalist theories of language have certainly been
ELABORATING SPEECH AND WRITING: inluential in literary study, their use has been limited and, so
VERBAL ART to speak, instrumental. hey tend to be taken up for particular
interpretive aims. his is often the case with literary borrowings
Patrick Colm Hogan from linguistics and the philosophy of language. However, the
uses of representationalist theories have been diferent. In these
cases, the literary and linguistic theories have been much more
thoroughly integrated. his is the result of two factors. First, the
research areas of the linguistic and literary theories overlap.
Second, there has been collaboration between linguists and lit-
erary theorists in treating these areas.
he past half century has seen considerable interaction between More exactly, there are two important representationalist
the sciences of language and the study of literature. But this schools that have had signiicant impact in literary study. One
interaction has been largely unidirectional, with inluence is cognitive linguistics. Some of the most important work in
lowing from language science to literature. his may be seen cognitive linguistics has treated metaphor. While cognitive lin-
most clearly in the massive impact of Ferdinand de Saussure on guists were initially interested in ordinary uses of metaphor, they
literary study since the 1960s. However, generative gram- quickly extended their analyses to poetic metaphor. his was
mar, cognitive linguistics, connectionism, and other facilitated by the collaboration of a linguist, George Lakof, and a
approaches have also had efects on poetics and literary theory. literary theorist, Mark Turner.
In the following pages, I wish to consider the general issues of A similar point may be made about Chomskyan or generative
what distinguishes verbal art as an object of study in language representationalism. One part of Chomskyan theory treats pho-
science. However, before I turn to this, it is important to get a nology. Patterns in sound and stress are of obvious importance
sense of how the analysis of literature and the analysis of lan- in verse. hus, certain aspects of poetic form may be included
guage have been interconnected since the current phase of lan- within the framework of generative linguistic theories. Work in
guage study began 50 years ago. this area has been facilitated by collaborations between linguists
and literary critics as well (see Fabb and Halle 2006). (heorists
have used Chomskyan generative principles as a model for other
THEORIES OF LANGUAGE AND THEORIES
aspects of literary theory also; see generative poetics.)
OF LITERATURE
Brain-related theorization on language and literature is less
When the inluence of language science on literary theory is developed, in part because it is highly technical and in part
considered, it is helpful to begin with a division between literary because neuroscientiic theorization about language itself is
theorists who have drawn on broad methodological principles much more recent. Turning again to the divisions in the Preface,
and literary theorists who have taken up particular linguistic we may distinguish between connectionist approaches and neu-
theories. For example, my own work on literary universals robiological approaches proper.
(Hogan 2003) does not rely on any particular account of lan- here has been some work on verbal art and parallel distrib-
guage universals. However, I do follow the general methodologi- uted processing. For example, some writers have used connec-
cal principles for isolating genetic and areal distinctness (though tionist models to discuss creativity (see Martindale 1995) and
see areal distinctness and literature), distinguishing there has been some work on connectionism and metaphor
diferent varieties of statistically unexpected cross-cultural pat- (e.g., Chandler 1991). hough limited, the work done in this ield
terns, and so on. is generally well integrated into connectionist theories (i.e., it is
his type of inluence, however, is the exception rather than not solely instrumental).
the rule. Other writers have drawn on particular theories of Recently, there has been considerable interest in neuro-
language, using them literally or analogically to explore liter- biology and art. his work has addressed many nonlinguistic
ature. Taking up the structure set out in the Preface, we may aspects of brain function. However, some has focused specii-
distinguish neurobiological, mentalistic, and social theories, cally on language. Much of this addresses hemispheric speciali-
as well as theories that bear on acquisition and evolution. zation, exploring distinctive features of verbal art (see Kane 2004;
Mentalistic theories have been the most prominent. As noted poetic language, neurobiology of).
in the Preface, within mentalistic theories, we may distinguish Given the general orientation of literary critics, it is unsur-
intentionalist accounts of language (referring to human subjec- prising that social aspects of speech and language have igured
tivity and intention) from representationalist accounts (treat- more prominently in literary study. At the level of dialogue, lit-
ing algorithmic operations on mental symbols). Intentionalist erary theorists have drawn on, for example, Paul Grices account
65
of conversational implicature, as well as various ideas of me please, Im trying to cook. I havent got enough potatoes
Mikhail Bakhtin (see dialogism and heteroglossia). In (Biber, Conrad, and Reppen 2006, 69); and 2) the following
terms of larger groups, literary theory has been highly inluenced excerpt from Shakespeares Sonnet 97 How like a winter hath
by historian Michel Foucaults ideas about discourse and social my absence been/From thee, the pleasure of the leeting year!
power (see discourse analysis [foucaultian]). hey he irst piece of speech calls to mind a particular, active con-
have also drawn on the sociological ideas of Pierre Bourdieu text. he second is removed from any such context. Perhaps,
(see field; market, linguistic; and habitus, linguistic) then, works of verbal art are more separable from their imme-
and others. Broader social movements, such as Marxism and diate context. As a irst approximation, we might say that verbal
feminism (see marxism and language, gender and lan- art has other verbal art as its primary, distinctive context, and
guage, and sexuality and language) and their associated the current material context has largely inhibitory force. In other
theories, have also contributed importantly to literary discus- words, our understanding and evaluation of a work of verbal art
sion, though not necessarily in a way that bears particularly on are distinguished from other sorts of understanding and evalua-
language science. tion, irst of all, by their relation to other works of verbal art our
he literary use of most social theories has tended to be sense of storytelling techniques, our awareness of the larger story
instrumental. However, more narrowly sociolinguistic analy- cycle in which a particular narrative occurs, our expectations
ses of literature have been integrated into research programs about characters, and so forth.
in sociolinguistics. his is largely because, here too, the areas of his does not mean that verbal art is entirely insensitive to
research overlap, and language scientists have been involved in immediate context. Our response to stories may be inlected, pri-
research along with literary interpreters. For example, William marily in an inhibitory way, by current material circumstances.
Labovs studies of oral narrative and the researchers of writers in Consider jokes, a form of verbal art. One standard type of joke
corpus linguistics have contributed to the advancement of is ethnic. he standard formats of such jokes (e.g., How many
both sociolinguistics and literary study. xs does it take to screw in a light bulb?), the general function of
he same general point holds for the study of acquisition. jokes, and so forth provide the broad, distinctive context for inter-
here has been valuable work done on, for example, the acqui- pretation and response. he most obvious function of the immedi-
sition of metaphor and the development of verbal ate, material context comes when a member of the relevant ethnic
humor. community is present and that function is usually inhibitory.
Finally, evolutionary theory has inspired many literary theo- Removal from immediate context cannot be quite the crucial
rists in recent years (see verbal art, evolution and). Its property, however. Consider, for example, the present essay.
advocates propose sweeping evolutionary explanations for a Physically, I am alone as I am writing. Anyone who reads this
wide range of literary phenomena. It is not clear that this pro- essay will be removed from the material context in which I am
gram has gone beyond the stage of conjecture. In any case, it is writing, and that material context will be irrelevant to the readers
general and often not tied speciically to language. response and understanding. But that does not make this essay
As I have suggested, much work in the preceding areas is verbal art. Perhaps, then, the most important diference between
very signiicant. However, much of it begins, like the classical the aforementioned speech actions is not context per se but
European epic, in medias res. It does not set out a clear ield of something closely related to context. he joke suggests that this
study for a language science of verbal art. Rather, the most suc- has something to do with the audience. Perhaps the diference is
cessful work tends to focus on those areas of verbal art that fall a matter of the way the speaker addresses his or her audience.
within the purview of nonliterary research programs. Put difer- To consider this, we might return to those cases. It is clear that
ently, it tends to treat verbal art as a case of something else (e.g., the person who says, Go away, Im cooking, is talking to his or
cognitive metaphor or phonology). In the remainder of this essay, her audience. I, too, am addressing my readership in writing this
then, I do not explore these particular approaches and connec- essay even if my idea of that readership is rather amorphous.
tions in more detail. Such a review is, in any case, redundant, as But the sonnet, as a socially circulated poem, is not addressing
this material is covered in the following entries. Instead, I con- its readership. Even if Shakespeare initially drafted the poem as
sider what is distinctive about verbal art and why, as a result, it is a private message to a particular person, he made the decision
an important area of study for language science. that its readership would not be its addressee when he made it
a public poem.
More exactly, works of verbal art tend to be marked by indi-
THE PARTIAL AUTONOMY OF VERBAL ART: INDIRECT
rect address, rather than direct address. When considered from
ADDRESS, SIDE PARTICIPATION, AND PLAY
the perspective of the reader rather than the author, indirect
Perhaps the most obvious diferentiating characteristics of verbal address is roughly the same as side participation, as discussed
art are that it is normative and rare. While all cultures have verbal by Richard Gerrig and Deborah Prentice (1996). Gerrig and
art (see Kiparsky 1987, 1956), few people in any culture produce Prentice distinguish several possible roles in a conversation.
works of verbal art (though, of course, they do produce many con- Obvious roles include speaker, addressee, and overhearer. he
stituents of such works novel metaphors, allusions, wit, and so authors add a fourth role side participant. Suppose I am with
forth). On the other hand, these diferences are not deinitive in my wife at the grocery store. She sees von Humboldt, a colleague
themselves. Rather, they seem to result from other factors. of hers. he colleague says, Oh, that meeting with de Saussure,
Consider two samples of speech actions: 1) the following our new provost, is on the twelfth. When she says the new pro-
excerpt from recorded speech Go away, Im cooking. Excuse vost she is doing so for my beneit. My wife knows perfectly well
66
Elaborating Speech and Writing
who de Saussure is. he conversation does not really concern suppose Jane is in third grade and Jill is in second grade. Jane
me. If I had been standing a few feet away, I would have merely begins to play a third grade teacher. She starts by referring to
been an overhearer and von Humboldt would have said only, what the class did last week. However, she has to keep in mind
Oh, that meeting with de Saussure is on the twelfth. But since I that Jill does not know what the third grade class did last week.
was closer, I became a side participant and von Humboldt had to hus, she may have to say Now, class, you remember that last
take my knowledge and interest into account when speaking. week we began by discussing government and binding
he diiculty with the account of Gerrig and Prentice is that theory. As this suggests, play is a purer case of indirection than
it is, for the most part, analogical. It suggests an area of research ordinary side participation. In the case of side participation,
and theorization. However, it does not develop this in algorith- our adjustments are more likely to be explicit. For example, my
mic speciicity (i.e., spelling out how it will work, step by step) wifes colleague is likely to turn to me when explaining that de
in relation to the structures, processes, and contents of human Saussure is the new provost. In play, explicit adjustments occur
cognition. To explore the idea further, we might consider a sim- when the pretense of play is disrupted. Suppose Jill accidentally
ple model of speech (oral or written). his model begins from addresses Jane as Jane, then explicitly adjusts that to I mean,
the premise that language is a form of action. hus, it has the Frau Doktor Wittgenstein. Jane is likely to get annoyed with this
usual components of action goals, motivations or emotions, breach in the play, perhaps responding, Jill, you should never
anticipated outcomes, and so forth. call your teacher by her irst name. Now you have to sit in the cor-
Here, I wish to isolate two stages in the production of speech. ner and wear the Aversive Stimulus Operant Conditioning Cap.
One is the basic generative phase in which the initial utterance Following this usage, we may say that verbal art is a form of
is formulated. he second is an adjustment phase, which follows indirect speech action in the same sense as one inds in play.
our realization of just what we are saying. Intuitively, we might Indeed, verbal art is, in certain respects, a form of play. Note,
expect that awareness of speech would precede generation. however, that in verbal art, in play, and elsewhere, there is not an
But it does not. Indeed, a moments relection suggests why. In absolute separation between direct and indirect action. Again,
order to realize that I am about to say Hello, I must in some indirect action reduces extrinsic factors. It does not eliminate
sense have already generated the Hello, even if my lips have them. he diference is one of degree.
not yet moved. More importantly, empirical research indicates Indeed, the diference between extrinsic and intrinsic is not
that human action generally involves just this sort of duality. For ine grained enough to clarify the distinctiveness of verbal art.
example, as Henrik Walter points out, our brains initiate or proj- his becomes clear as soon as we notice that almost all writing
ect actions approximately .5 to .7 seconds before the actions are is predominantly intrinsic in that it is not generated or adjusted
performed. We are able to modify or inhibit the action .3 to .5 primarily by reference to external circumstances. Moreover,
seconds after it is projected (see Walter 2001, 24850). In keeping the opportunities for extensive revision in writing allow for virtu-
with this temporal sequence, adjustment may precede, interrupt, ally all adjustments to be implicit in the inal text. How, then, do
or follow the execution of an action. we distinguish ordinary writing from writing that is verbal art?
Suppose someone asks me if I need a ride. I begin to say, No, Here, too, the distinction is a matter of degree. Writing
thanks. Duns is picking me up. I realize that the person does not involves diferent gradations of removal from direct address.
know who Duns is. I may adjust the sentence before speaking Take, for example, a letter. Suppose von Humboldt is not speak-
My cousin Duns is picking me up. Or I may succeed in making ing to my wife about a faculty meeting, but is instead writing her
the change only after beginning the sentence Duns uh, hes a note. In an obvious way, von Humboldts address to my wife is
my cousin hes picking me up. When the adjustment takes indirect and intrinsic. After all, my wife is not present. However,
place before speaking, we might refer to it as implicit. When von Humboldts address to my wife is direct in another way, for it
it occurs in the course of speaking or after speaking, we may is oriented precisely toward her. It remains guided by her as von
refer to it as explicit. Finally, it is important to note that actions Humboldt imagines her.
have two broad sources. One is extrinsic, or externally derived; In order to understand how this works, and how it bears on
the other is intrinsic, or internally derived (see MacNeilage 1998, verbal art, we need to have a clearer understanding of imagina-
2256 on the neural substrates for this division). tion and action. When I begin to act, I tacitly project or imagine
Now we are in a position to clarify the nature of indirect possible outcomes for my action. For example, when I see a car
address or, more generally, indirect action. Indirect action, coming toward me, I run to the curb. his is bound up with a tacit
as I am using the phrase, involves a relative decrease in the imagination of where I should be in order to fulill my purpose
extrinsic aspects of action. hus, the sources of both generation of not being run over. More exactly, we may understand imagi-
and adjustment are more intrinsic than is commonly the case. nation, like speech, as involving two (continuously interacting)
Moreover, when they occur, extrinsic adjustments tend to be processes. One generates possible scenarios. he other makes
implicit rather than explicit. adjustments. I project running to the curb, but then notice a man-
To get a better sense of how indirect action operates, it is use- hole in my path. his leads me to adjust my imagination of the
ful to look at a paradigmatic case of such action play. Indeed, precise trajectory. he nature of the generation and the nature of
play and side participant interaction have a great deal in com- the adjustment will change, depending on the guiding purposes
mon. When Jane and Jill play school, each of them must keep of the action. In some cases of speech action, the purpose involves
in mind that the other person has a real identity outside the role a real addressee. In some cases, it does not. In a face-to-face dia-
she is playing. Moreover, each of them must continually adjust logue, a speaker will continually generate, adjust, and regener-
her speech and behavior to take that into account. For example, ate what he or she is saying in part due to the addressees actual
67
response. In writing a letter, a writer will be guided by his or her his combines with pragmatic information about the presence of
tacit imagination of the addressee. hough that imagination is not cookies in the cookie jar to initiate the action of getting a cookie
adjusted by reference to actual responses, it nonetheless serves as out of the cookie jar. However, before I reach for the cookie jar,
a guide for the generation of the speech. Crucially, this imagined prudential information stops me as I recall the deleterious efects
addressee maintains his or her external, independent existence that cookies are likely to have on my lithe and shapely form.
for the speaker. As such, that addressee is, so to speak, a pseudo- For imaginations or actions to result from pragmatic infor-
extrinsic guide to the generation of the speech. mation, there must be some motivation present as well. In other
Here, then, we may adjust our account of indirect address words, I must have some emotion. Emotion is what leads us to
in verbal art. Verbal art minimizes both extrinsic and pseu- act, whether the action is speech or larger bodily movement. It is
doextrinsic elements in the production and adjustment of also what leads us to refrain from acting. We may divide emotions
speech and in the imaginations that are connected with speech. into two sorts, depending on their function in action sequences.
Moreover, it minimizes explicit markers of adjustments for side he irst might be called the initiating emotion. An initiating
participants. emotion is any emotion that one feels in current circumstances
We began with the idea that verbal art is relatively indepen- and that impels one to act. Action here includes the initiation of
dent of direct, material context. hat context involves authors movement and the initiation of imagination. We imagine various
and readers. In this connection, we have been considering outcomes of our actions. For example, feeling hungry, I swiftly
the relation between the author and readers as one of indirect and unrelectively imagine eating a cookie. But my imagination
address. he other crucial element of material context is, of does not stop there. I may imagine my wife seeing me with the
course, reference. Indeed, verbal art, as commonly understood, crescent of cookie in my hand and the telltale crumbs sticking to
has an even more obvious relation to reference than to address, my lips, then chastising me, reminding me of the doctors warn-
for verbal art is paradigmatically conceived of as iction. As such, ings, and explaining once again that the cookies are for my nieces
it has an unusual degree of referential autonomy. In other words, and nephew. Or I may suddenly see an image of myself with wob-
it tends to be characterized by indirect reference as well as indi- bly love-handles. In each case, I experience what might be called
rect address. his, too, is illustrated most clearly in play. Suppose a hypothetical emotion. A hypothetical emotion is a feeling that
Jane and Jill are playing telephone. Jane picks up a banana, puts I experience in the course of imagining the possible trajectories
it up to the side of her face, then holds it out to Jill and says, Its of my action. While initiating emotions give rise to (or generate)
for you. In saying it, she is not referring to a banana. She is the action sequence initially, hypothetical emotions qualify (or
referring to a telephone. adjust) that action sequence. Hypothetical emotions may inten-
In sum, verbal art is not just partially autonomous with sify the initiating emotion; they may inhibit it; they may respecify
respect to immediate circumstances. It is largely independent of the precise goals of the action sequence (e.g., from eating cook-
the extrinsic and pseudoextrinsic aspects of the generation and ies to eating carrots), or they may afect the precise means that I
adjustment of speech action and associated imagination. his is adopt (e.g., checking that my wife is not around).
true with respect to both address and reference. From the preceding example, it may seem that hypothetical
emotions are all egocentric. However, hypothetical emotions
may also be empathic. For example, I may forego my plan to eat
ART AND ACTION: THE PURPOSES OF VERBAL ART
cookies because I imagine my tiny nephews disappointed face
In discussing action-related imagination, we noted that the pre- when he reaches into the empty jar. Empathic hypothetical emo-
cise nature of imagination varies according to the purposes of the tions may have the same qualifying efects on initiating emotions
action. Generally speaking, the goals of physical action concern and actions as do egocentric hypothetical emotions.
the alteration of some situation. To a great extent, speech actions Hypothetical emotions are critical for all types of action,
have the same function. One crucial diference between speech including verbal action. Consider again the case where von
actions and bodily actions is that speech actions have their efects Humboldt explains that de Saussure is the new provost. In doing
only through minds. However much I plead with the packet of this, von Humboldt tacitly imagines the conversation from my
noodles, they wont turn themselves into pad thai. But, if I sweetly perspective and, sensing that I may not follow and, more impor-
ask someone to make the noodles into pad thai, perhaps that tantly, that I might feel left out she provides the information.
person will do so. Speech actions, then, tend to aim at altering If I wish to alter someones emotions through speech, I will
the world by altering peoples actions. In order to alter peoples appeal primarily to initiating emotions. hus, the alteration of
actions, they aim at altering two things irst, the way those peo- initiating emotions is usually a central purpose of speech action.
ple understand the world and, second, the way they feel about it, However, certain sorts of hypothetical emotions are important
including the way they feel about the speaker himself or herself. as well. For example, when requesting a favor, I may foreground
More exactly, we may distinguish two psychological pur- how grateful I will be. One purpose is to encourage my addressee
poses of speech actions. hese are informational and emotional. to imagine my gratitude and experience the related hypothetical
Informational purposes may be further divided into pragmatic emotion (roughly, feeling appreciated).
and regulative. Pragmatic information concerns factual, direc- In sum, ordinary speech actions involve an informational
tional, or other information that facilitates our pursuit of goals. aim and an emotional aim, usually coordinated to produce some
Regulative information concerns broad ethical, prudential, or change in the world, currently or at some time in the future. he
related principles, which tend to serve an adjusting function. informational aim involves both pragmatic and regulatory com-
For example, feeling hungry, I form the goal of eating something. ponents, though in ordinary speech, the pragmatic component
68
is probably dominant. he emotional aim involves both initiat- and integrated with other information; see encoding) and
ing and hypothetical emotions. In ordinary speech, the initiating may even draw attentional focus. his occurs most obviously
emotion is probably dominant. in the phonetic/phonological aspect of language. Poets pattern
In being removed from direct address and direct reference, stress beyond what occurs spontaneously. hey organize vowel
works of verbal art commonly do not have a goal of directly alter- sounds and initial consonants to produce assonance and allit-
ing particular material conditions. Nonetheless, verbal art is ani- eration. he point is less clear in morphology, though corpus
mated by the same two psychological goals that animate other linguistic studies have pointed to diferential tendencies (see
forms of speech action. Verbal art, too, has an informational Biber, Conrad, and Reppen 2006, 5865)
component and an emotional component. Indeed, cross-cultur- Syntax is a complex and thus particularly interesting case.
ally, aestheticians and critics tend to see verbal art as successful Verbal art does undoubtedly tend to pattern syntactic usage in
insofar as it afects us emotionally and insofar as it develops sig- encodable ways. But the poetic patterns in these cases tend to
niicant ideas or themes. be intensiications of the ways in which ordinary speech actions
he emphasis in verbal art, however, tends to be diferent pattern syntax. For example, in ordinary speech, we have some
from that of other speech actions in both cases. Speciically, ver- tendency to use parallel syntactic structures. In poetry, we are
bal art commonly does not stress pragmatic aspects of informa- more likely to use these. A more obtrusive foregrounding of syn-
tion. Certainly, there are, for example, political works that set tax occurs when we violate expectations of patterning in verbal
out to give the audience pragmatic information. However, this art. Such violation is the other primary way in which verbal art
is not the general tendency of verbal art. Rather, verbal art tends treats linguistic levels more autonomously. In the case of syn-
to develop regulative concerns. hese regulative concerns make tax, there are several ways in which this may occur. One obvious
their appearance in literary themes. When we interpret a work way is through disjoining syntactic units and line units, such that
and seek to understand its point, we are commonly looking line breaks do not coincide with syntactic breaks (see Tsur 2006,
for a theme, which is to say, some sort of regulative informa- 14652). Another is by rejecting standard principles of word
tion. Moreover, in terms of emotion, verbal art tends to inspire order.
not initiating emotions but hypothetical emotions particularly Verbal art also manifests distinctive tendencies in semantics.
empathic hypothetical emotions. hese are found most obviously in lexical preferences and meta-
phor. In the case of lexical preferences, verbal art may draw on
rarer or more unexpected words. Perhaps more importantly, it
THE MAXIMIZATION OF RELEVANCE
may pattern the suggestions and associations of terms. In ordi-
Up to this point, I have been speaking of the content of speech nary speech, we tend to concern ourselves with the associative
action generally and verbal art in particular. But the form of ver- resonance of a term only in extreme cases (e.g., when its con-
bal art is widely seen as crucial, perhaps its deinitive feature. he notations may be ofensive to a particular addressee). In verbal
content/form division is somewhat too crude to form the basis art, however, the writer is much more likely to organize his or her
for a sustained analysis of verbal art. However, it does point to lexical choices so that the suggestions of the terms are consistent
an important aspect of verbal art, and an important diferentiat- (e.g., in emotional valence; see dhvani and rasa).
ing tendency the partial autonomy and patterning of distinct As to metaphor, our use of tropes in ordinary speech is surpris-
linguistic levels in verbal art. ingly constrained. here are aspects of everyday metaphor that
All linguistic levels phonology, morphology, syn- are creative. However, on the whole, we follow well-worn paths.
tax, and so forth are, of course, relevant to speech actions of Lakof and Turner have argued that a great deal of literary meta-
all types. However, most components are, in most cases, only phor draws on the same broad structures as ordinary speech.
instrumentally relevant. Morphology is relevant for communi- However, literary metaphors extend, elaborate, and combine
cating whether I want one cookie or two cookies. But it has no these structures in surprising ways (see poetic metaphor).
separate function. Put diferently, the speciication of phonology, Finally, we ind parallel tendencies in discourse practices.
morphology, and syntax is a sort of by-product of my pragmatic Consider the principles of conversation articulated by Grice
aims. I want to get two cookies. In English, I signal that I want (see cooperative principle). hese principles form a set of
two cookies rather than one by using the word two, putting practical conditions for any sort of interaction between speak-
that word before cookie, adding s to cookie, and so forth. ers. For example, it is a fundamental principle of conversation
I do not set out to do anything with phonology, morphology, or that one should not say things that are irrelevant to the con-
syntax. One might even argue that I do not set out to do anything versation. Grice points out, however, that one may lout these
special with semantics or with discourse principles. I just run principles, violating them in gross and obvious ways. Flouting a
my language processors to achieve the active goal. Features of principle of conversation gives rise to interpretation. Jones and
phonology and so forth are relevant to my action only insofar Smith are discussing who should be hired as the new assistant
as my speech processor makes them relevant. hey have, in this professor in the Hermeneutics of Suspicion. Jones says, heres
way, minimal relevance. a new applicant today Heidegger. What do you think of him?
A number of literary theorists, prominently the Russian Smith replies, Nice penmanship. Since penmanship is clearly
Formalists, have stressed that verbal art foregrounds its lan- irrelevant to the topic, Smith may be said to be louting the prin-
guage (see foregrounding). In cognitive terms, we might say ciple of relevance. Jones is likely to interpret Smiths comment
that verbal art tends to enhance linguistic patterns to the point as indicating a dim view of Heideggers qualiications. Literary
where they are likely to be encoded by readers (i.e., perceived works often lout conversational principles.
69
All these points indicate signiicant diferences between ver- attention on the friend and simultaneously directs my attention
bal art and other sorts of speech action. Again, these diferences to ways in which I can reach him. If I see some sort of animal, I
do not create a hard-and-fast division between verbal art and feel fear. hat fear keeps my attention on the animal and simulta-
all other types of speech. here is a continuum here, with many neously directs my attention to ways that I can escape.
parameters and degrees of variation. Nonetheless, there is a clear he arousal of interest is obviously crucial to literary experi-
diferential tendency. ence. It is also important to other speech actions. However, in
he diferences we have just been considering are a matter nonliterary cases, a great deal of the interest comes from the
of the various linguistic levels of a literary text bearing autono- direct relation between the people involved (thus, the speaker
mously or separately on our understanding and experience of and addressee), as well as the practical situation referenced in
the work. his is irst of all and most obviously a matter of creat- the speech action. In other words, a great deal comes from direct
ing patterns. However, given the violations of syntactic rules and address and direct reference, both of which are greatly inhibited
the louting of conversational principles, it seems clear that ver- in verbal art. Nonetheless, verbal art has other means of produc-
bal art does not simply create extra patterns on top of the usual, ing interest. We have just been discussing two such means the
instrumental patterns produced by ordinary language pro- multiplication of non-normal patterns and violations of normal
cesses. It also violates patterns of ordinary language processes. or expected patterns at various linguistic levels. Both are precisely
Most importantly, in both cases, the result renders the linguis- the sorts of deviation from normalcy that produce interest.
tic level in some way directly (rather than just instrumentally) Speciic emotional efects are fostered most obviously by
relevant to our experience of the work. hus, it maximizes the semantics and pragmatics. For example, a great deal of our
relevance of language features. emotional response to verbal art seems to be bound up with the
But in what way are these features relevant? As with any sort patterning of associative networks that spread out from lexical
of speech action, relevance is, irst of all, relevance to the aims of items (see Oatley 2002 and suggestion structure). he
the action. Again, the primary aims of verbal art are thematic and extension and elaboration of metaphorical structures are clearly
emotional. hus, the maximization of relevance is the thematic consequential in this regard, particularly as the concreteness
or emotional use of (ordinarily unexpected) noninstrumental of metaphors often enhances associations with concrete expe-
patterns or violations of (ordinarily expected) instrumental riential memories, including emotional memories. he speciic
patterns from diferent linguistic levels. emotional impact of phonological, morphological, and syntactic
In touching on these issues, literary critics have tended to features is less obvious, but no less real, at least in some cases.
emphasize thematic relevance. Some writers have seen patterns For example, the interplay between nonlinguistic organization
and violations of patterns in phonology, morphology, and syntax (e.g., in line divisions) and linguistic organization (e.g., in sen-
as consequential for interpretation. It is probably true that such tence divisions) may serve to convey a sense of a speakers voice
formal features are thematically interpretable in some cases. and, along with this, an emotional tone.
However, it seems doubtful that such features are generally rele- hus, once more, we see both continuity and diference
vant to interpretation. In contrast, extended patterns or violations between verbal art and other sorts of speech action. Most speech
in semantics and pragmatics are almost always interpretively actions involve a minimal, incidental patterning of linguistic lev-
relevant. In these aspects, the main diference between verbal art els. In contrast, the general tendency of literary art is toward the
and other forms of speech action is where the interpretive pro- maximization of the relevance of diferent linguistic levels. his
cess ends. We will consider this issue later. relevance is a function of the main purposes of the text, thematic
If anything, the maximization of relevance applies more fully and emotional. Again, this need not be relevance that increases
to the communication of emotion than to the communication of informational content. Indeed, I suspect that it most often is not.
themes. here are two types of afective response that bear impor- It need not even contribute to the speciic emotions of the text.
tantly on verbal art, and thus on the maximization of relevance. We It may be a matter of enhancing interest or of qualifying the spe-
might refer to the irst as pre-emotional and the second as speciic ciic emotions fostered by the text. In each case, though, there is
emotional. Pre-emotional efects are efects of interest. Interest is some partially autonomous variation in the organization of the
pre-emotional in two senses. First, it is often an initial stage in an linguistic level in question, through the addition of unexpected
emotion episode. Second, it is a component of all speciic emo- patterning, through the violation of expected patterning, or both.
tions after those emotions have been activated. Speciic emotions
are simply our ordinary feelings sorrow, joy, disgust, and so on.
ON INTERPRETATION AND THE USES OF TEXTS
More exactly, interest is the activation of our attention system.
hat activation occurs whenever we experience something new In the preceding section, I referred briely to the point at which
or unexpected (see Frijda 1986, 2723, 318, 325, 386). Such acti- interpretation ends. When verbal art is considered in relation to
vation prepares us for events that have emotional signiicance. interpretation, the irst thing to remark is that verbal art is notori-
It directs our attention to aspects of the environment or our own ous for its hermeneutic promiscuity (see philology and her-
bodies that may be emotion triggers. Once a speciic emotion is meneutics). It is widely seen as almost ininitely interpretable.
activated, that emotion system reactivates our attention system, At one level, this is surprising. Novels, for example, are devel-
focusing it on properties relevant to that emotion in particular. oped in great detail and with considerable elaboration of the
For example, suppose I am out in the woods and hear something characters attitudes and actions. It might seem that this would
move. hat arouses my attention. I carefully listen and look. If constrain interpretation relative to the much vaguer and more
I see a friend at a distance, I feel some joy. hat joy keeps my elliptical speech actions of ordinary life. But that is not generally
70
believed to be the case. his suggests that the degree of interpret- someone for the irst time, I may describe my outit so that he
ability of a text is not a function of its elaboration. Rather, inter- or she can recognize me. It has pragmatic consequences. But
pretability appears to be a function of a texts distance from direct if an author describes a characters outit, that description may
address and direct reference to practical conditions, particularly bear on our understanding of the characters emotions, class
as these are connected with pragmatic information. Put difer- status, or religious beliefs. hose features may in turn bear on
ently, the limits of interpretation are not so much a matter of our broader understanding of human relations, class conlict,
the words themselves. hey are, irst of all, a matter of action. or religious practice as portrayed in the work. In short, ordi-
Here as elsewhere, action is animated by the agents goals, narily incidental details may have thematic (thus regulative)
emotions, expectations, and so forth. In ordinary life, then, we consequences.
usually understand validity in interpretation as a function of Moreover, literary narratives tend to develop subtle and vari-
the speakers intention (see intentionality). More techni- able ainities and ambivalences (see Hogan 2003, 12251). In
cally, our prototype of interpretive validity almost certainly some cases, the development of these ainities actually runs con-
includes the intention of the speaker or author as a critical norm. trary to the authors self-conscious sense of his or her own aims.
(he point is related to the argument of Steven Knapp and Walter he most famous case may be Miltons portrayal of Satan. Satan
Benn Michaels that we invariably interpret for speakers inten- has often been seen as the most engaging igure in Paradise Lost,
tion, though it is not identical, for there are cases where we do but this was certainly not Miltons self-conscious intention.
not adhere to this prototype and thus do not interpret for the his is part of a larger point that intention is not a single, uni-
speakers intention.) However, intention is an abstract criterion. ied operation. here are diferent sorts of intention with diferent
We do not have access to speakers intentions. So even interpret- objects, constraints, and processes. Perhaps the most important
ing for intention, we need an operational criterion for validity form of intention for verbal art is what we might call aestheti-
as well. hat is where our own action enters. Action commonly cal intent (see Hogan 1996, 16393). he aesthetical intent of an
guides our sense of when we have gotten an interpretation right. author is to produce a work that has the right sort of experiential
At the dinner table, someone says, Could you pass the that? I efect. his right efect is not something that the author is likely
am not sure what he means by that. Unable to read his mind, I to be able to articulate separately from the work itself. It is simply
engage in an action either a bodily action (passing the beans) or what he or she experiences when he or she feels that the work
a speech action (asking if he meant the beans). When I pass the is now complete. In composing the work, the author generates
beans, he knows that I have understood his intention. When he and adjusts the text, testing the outcome against his or her own
accepts the beans, I infer that I understood his intention. response. he authors sense that the work is complete need not
he interpretation of verbal art is as removed from such prac- mean that the work conforms to the authors self-conscious atti-
tical action as possible. hus, our ordinary operational criterion tudes and commitments.
is rendered inefective. he only obvious practical behaviors One way of explaining aesthetical intent is that the author
relating to literary interpretation are professional for example, adopts an aesthetical attitude toward the work, or a dhvani atti-
the acceptance or rejection of articles in academic journals. tude, as Anand Amaladass put it. his is a matter of broaden-
(he point is codiied in Fishs contention that validity in liter- ing ones attention to the work, expanding ones encoding of the
ary interpretation is deined by the norms of interpretive work, to include its multiple resonances and emotive patterns.
communities.) In short, it involves approaching the text as a work in which dif-
A number of further problems for intentional inference arise ferent linguistic levels are maximally relevant. Correlatively, it
in connection with this removal of literary interpretation from involves approaching the text as a work that is removed from
practical action. First, many texts are read or performed, and constraints of direct reference or address, constraints that ordi-
thus interpreted, far from their authors and even after their narily orient our judgment of informational relevance and our
authors are dead. If we take a strict intentionalist view of valid- construal of emotional bearing. When approaching a work as
ity, then the author is the only one who has the authority to verbal art, readers and audience members adopt this attitude as
determine that a given action does indeed satisfy an operational well to varying degrees.
criterion. Suppose Jones leaves instructions for his funeral. he he mention of readers approaches to texts brings us to a
funeral director follows them as well as she can. But Jones is not inal complication. We have been considering ways in which
around to conirm that she got things right. Obviously, there works produced as verbal art tend to be diferent from other
are things that she might do to ascertain Joness intention. For speech actions. But the nature of a given text is not determined
example, she might talk to Joness friends and relatives or she solely by authorial intent. Despite our prototypical concern for
might try to learn something about Joness religion and ethnic authorial intent, we are free to approach works of verbal art in
background. hese are the sorts of concerns that lead writers pragmatic ways and to approach other works with an aesthetical
such as Hans-Georg Gadamer to stress tradition as a crucial attitude. As writers in cultural studies have stressed, practices of
guide to interpretation. literary analysis may be applied to a wide range of texts that were
A second problem is more distinctively connected with ver- not initially intended to be literary. In short, literariness may
bal art per se. Both informational and afective patterns are be deined by interpretation or reception no less than it may be
more complex in verbal art than in most other speech actions. deined by production; it may be deined by readers no less than
For example, a literary work communicates thematically rel- by authors. In this way, the usual characteristics of verbal art may
evant information by maximizing the relevance of virtually be extended to other texts or withdrawn from texts (as when a
every semantic and discursive detail in the text. If I am meeting novel is studied for its authors psychology).
71
On the other hand, the expansion of hermeneutic liberality in play, be summer (e.g., a time of professional success for the
results from the existence of verbal art. Interpretive autonomy speaker). In short, summer becomes a metaphor.
arises in the irst place through the removal of speech action he mention of metaphors points us toward the maximization
from direct address and direct reference, along with the attenu- of relevance. However, before going on to this, it is important to
ation of pragmatic information and initiating emotion. In this remark on some other contextual features of the poem, contex-
way, interpretive practices that bridge literary and nonliterary tual features that are themselves bound up with metaphor. As I
speech actions are themselves a distinctive product of literary indicated earlier, the entire tradition of poetry forms an implicit
speech action. context for any given poem (a point stressed by T. S. Eliot). One
obvious way in which Shakespeare suggests a context of verbal
SHAKESPEAREAN INDIRECTION art is in his use of seasonal imagery. Winter is a standard literary
image of romantic separation. his is, of course, related to the
As the preceding discussions have been rather abstract, it is conceptual metaphor, discussed by Lakof and others: LIFE
valuable to end with a more developed example. Consider IS A YEAR. As Lakof and Turner explain, there are several pro-
Shakespeares Sonnet 97: cesses poets use to create novel instances of these schemas.
Shakespeare is clearly creating such a novel instance when he
How like a winter hath my absence been
maps the source metaphor of winter onto the target, summer
From thee, the pleasure of the leeting year!
(see source and target). Moreover, in making summer into
What freezings have I felt, what dark days seen!
(metaphorical) winter, he is intensifying the efect of the meta-
What old Decembers bareness every where!
phor by contrast. But he does not say summer only. He expands
And yet this time removed was summers time,
the target time to summer and autumn. In a way, this is pecu-
he teeming autumn big with rich increase,
liar. he contrast would have been more obviously enhanced
Bearing the wanton burthen of the prime,
by making the target time period spring and summer. Why does
Like widowed wombs after their lords decease:
Shakespeare choose autumn? here are several reasons. Two
Yet this abundant issue seemd to me
are closely interrelated. First, he wishes to intensify the tacit
But hope of orphans, and unfathered fruit,
emplotment of the speakers isolation. hat speaker is now
For summer and his pleasures wait on thee,
anticipating the very worst, for if summer was like winter, how
And thou away, the very birds are mute;
devastating will winter be? Second, he wishes to hold out hope
Or if they sing, tis with so dull a cheer
for something beyond that period of terrible loneliness spring.
hat leaves look pale, dreading the winters near.
hat hope is possible only through an indirect reference to the
he poem presents a straightforward instance of indirect broader, literary context in which spring is the season when the
address. here is, of course, a narratee in the poem, an explicit lovers are inally reunited.
thee. But the poem only plays at addressing this thee. he he indirection of the poem makes it unlikely that it will
point is clear, for example, when the reader is brought in as a side have any pragmatic information as its aim. Pragmatic informa-
participant with the otherwise superluous information, this tion most often involves reference to particular situations, or to
time removed was summers time. If there were a deining, mate- forms of general knowledge. Of course, there is not a strict divi-
rial context and if there were direct address in this speech action, sion between general pragmatic knowledge and, say, prudential
the speaker would not need to explain that the separation had regulatory information. But to the degree that the poem com-
occurred over the summer. he beloved would surely remember municates thematic points, those points do incline toward pru-
this. he point is reinforced more subtly by the shifts in spatial ori- dential regulatory information. For example, suppose summer
entation in the poem (what some writers see as particular types of is metaphorical (i.e., winter is metaphorical for summer,
deixis [see Stockwell 2002, 4157]). hese may be understood as a which is itself metaphorical for, say, professional success). hen
matter of indirect reference. If this speech action were grounded the poem suggests that the enjoyment of external conditions
in a particular situation, then there would almost certainly be (e.g., professional success) is lost when it cannot be shared a
some ixed reference point, a speciic home that would deine point with clear regulatory consequences.
which of the lovers was stationary and which had departed. But he indirection of the poem also indicates that it is unlikely
the poem is contradictory on this score. In the opening lines, to be aimed at the elicitation of initiating emotions. his can be
the speaker refers to my absence /From thee and his time seen if we contrast it with a similar letter, sent by the poet to his
removed. But toward the end of the poem, he reverses the spatial beloved. Such a letter could be aimed at convincing the beloved
orientation and related direction of movement. Now, the beloved to return home. he poem, in contrast, does not have any such
is away and summer will wait for her return. initiating emotional aim. Its emotional aim is, rather, conined to
he introduction of summer in this context suggests some- provoking empathic hypothetical emotions.
thing else. he poet is not only like little Jane, covertly explaining he maximization of relevance contributes to the poems
what the third graders did in their last class. He may also be like achievement of its primary aims. As usual, this maximization is
Jane in referring to a telephone by way of a banana. Put difer- most obvious at the phonological level. Consider only the irst
ently, the indirectness of both address and reference make the two lines. he poem is in iambic pentameter. However, when
reference of summer itself uncertain. Without a context, read- spoken naturally, these lines do not follow the meter. Indeed,
ers are free to associate summer tacitly with anything that can, there is considerable tension between spontaneous stress
72
patterns and the meter. One natural way of speaking the lines Verbal art both fosters the proliferation of interpretations
would be as follows: and sharpens our sense of the human embodiment that lim-
its those interpretations. We ind the same tension between
Hw lke wntr hth m bsnce ben
sameness and diference in all the characteristics we have con-
Frm the, th plasre f th letng yar!
sidered. Although I have been stressing diference, the same-
here are several things one might remark on here. First, in ness is no less consequential. For example, in the context of
natural speech, the lines are in a somewhat irregular tetram- an encyclopedia of language sciences, it is important that in
eter. However, this irregularity is not unpatterned. here is a studying verbal art, we are likely to isolate properties and rela-
striking rhythmic motif that occurs in the middle of both lines. tions in language that we might otherwise have passed over
he sequences lke wntr hth m bsnce and the, th properties and relations of address, reference, informational
plasre f th letng have the same stress pattern. his is not structure and orientation, and type and force of emotional
monotonous because the lines also manifest three variations. consequence or function. In short, verbal art is a critical ele-
First, a caesura appears in only one of the sequences. Second, ment of human life. As such, it is a critical object of study in its
the irst word of the line changes from stressed to unstressed. own right. It is also a crucial part of human speech action. As
Finally, the last word changes from unstressed to stressed. hus such it is a crucial, if sometimes neglected, part of the language
we ind novelty, therefore the triggering of at least mild interest, sciences as well.
at two levels.
In addition, there are interpretive and speciic emotional con-
sequences. he disjunction of syntactic and verse breaks may help WORKS CITED AND SUGGESTIONS FOR FURTHER READING
to give the poem a sense of voice. Speciically, it suggests to me a
Amaladass, Anand. 1984. Philosophical Implications of Dhvani:
speaker who pauses before saying what is most painful about his
Experience of Symbol Language in Indian Aesthetics. Vienna: De Nobili
absence. Being away from home means that one is absent from Research Library.
many things. But, here, there is only one crucial attachment. he Biber, Douglas, Susan Conrad, and Randi Reppen. 2006.
line break imitates the emotion that so often interrupts ones Corpus Linguistics: Investigating Language Structure and Use.
speech when such occasions are real and directly addressed. Cambridge: Cambridge University Press.
here may also be efects of tempo in these lines. Hw lke Chandler, Steven. 1991. Metaphor comprehension: A connection-
wntr is heavy with accents and thus slower than th letng ist approach to implications for the mental lexicon. Metaphor and
yar. his provides an instance of sound echoing sense. It also Symbolic Activity 6.4: 22758.
suggests an interpretive point time lies in the sense that life Dasenbrock, Reed Way, ed. 1993. Literary heory After Davidson.
University Park: Pennsylvania State University Press.
passes quickly, leaving us little time together; but, being apart, we
Eliot, T. S. 2001. Tradition and the individual talent. In he Norton
experience each moment of that leeting time as a slow drag.
Anthology of heory and Criticism, ed. Vincent B. Leitch, 10928. New
We would have many further points to interpret if we were
York: W. W. Norton.
to consider lexical choice, metaphorical patterning, the louting Fabb, Nigel, and Morris Halle. 2006. Metrical complexity in Christina
of conversational principles, and so forth. Each of these directs Rossettis verse. College Literature 33.2: 91114.
us toward the endless interpretability of verbal art. As in other Fish, Stanley. 1980. Is here a Text in his Class? he Authority of
works, there is no clear operational criterion that would tell us Interpretive Communities. Cambridge: Harvard University Press.
that we have reached the end of our interpretation or that our Frijda, Nico. 1986. he Emotions. Cambridge: Cambridge University
interpretation is correct. Of course, as Fish indicates, there are Press.
professional constraints on our judgments in these areas. A Gadamer, Hans-Georg. 1989. Truth and Method. 2d ed. Trans. Joel
psychoanalytic critic might discuss how the poet seems to shift Weinsheimer and Donald Marshall. New York: Crossroad.
Gerrig, Richard, and Deborah Prentice. 1996. Notes on audience
between the position of a separated lover and that of an orphan,
response. In Post-heory: Reconstructing Film Studies, ed. David
suggesting an oedipal relation to the beloved. A writer in queer
Bordwell and Nol Carroll, 388403. Madison: University of Wisconsin
theory might stress that the poet puts his beloved in the position
Press.
of the (deceased) father, not the mother, thus suggesting a male Grice, Paul. 1989. Studies in the Way of Words. Cambridge: Harvard
beloved. University Press.
his brings us back to interpretation and reception. Again, we Hogan, Patrick Colm. 1996. On Interpretation: Meaning and Inference
are always free to take indirect address and put it into a more in Law, Psychoanalysis, and Literature. Athens: University of Georgia
directly referential context. For example, we may seek to read Press.
through the sonnets to Shakespeares own life and sexual feel- . 2003. he Mind and Its Stories: Narrative Universals and Human
ings. Indeed, as Walter Ong pointed out many years ago, we have Emotion. Cambridge: Cambridge University Press.
a strong tendency to do just that, placing the decontextualized Jakobson, Roman. 1987. Language in Literature, ed. Krystyna Pomorska
and Stephen Rudy. Cambridge, MA: Belknap Press.
voice of the poem back in a human body at a particular place in a
Kane, Julie. 2004. Poetry as right-hemispheric language. Journal of
particular time. hat sense of concrete human embodiment is
Consciousness Studies 11.5/6: 2159.
itself no doubt a crucial part not only of literary response but also
Kiparsky, Paul. 1987. On theory and interpretation. In he Linguistics
of all human communication. of Writing: Arguments Between Language and Literature, ed. Nigel
he preceding point suggests once again the distinctive- Fabb, Derek Attridge, Alan Durant, and Colin MacCabe, 18598.
ness of verbal art and its continuity with other speech actions. New York: Methuen.
73
Knapp, Steven, and Walter Benn Michaels. 1985. Against theory. In Ong, Walter J., S.J. he jinnee in the well wrought urn. In he
Against heory: Literary Studies and the New Pragmatism, ed. W. J. T. Barbarian Within and Other Fugitive Essays and Studies, 1525. New
Mitchell, 1130. Chicago: University of Chicago Press. York: Macmillan.
Labov, William. 1972. Language in the Inner City: Studies in the Black Pratt, Mary Louise. 1977. Toward a Speech Act heory of Literary Discourse.
English Vernacular. Philadelphia: University of Pennsylvania Press. Bloomington: Indiana University Press.
Lakof, George, and Mark Turner. 1989. More han Cool Reason: A Field Shakespeare, William. 2006. he Sonnets, ed. G. Blakemore Evans.
Guide to Poetic Metaphor. Chicago: University of Chicago Press. Cambridge: Cambridge University Press.
MacNeilage, Peter. 1998. Evolution of the mechanism of language Stockwell, Peter. 2002. Cognitive Poetics: An Introduction.
output: Comparative neurobiology of vocal and manual commu- London: Routledge.
nication. In Approaches to the Evolution of Language: Social and Tsur, Reuven. 2006. Kubla Khan: Poetic Structure, Hypnotic Quality and
Cognitive Bases, ed. James Hurford, Michael Studdert-Kennedy, and Cognitive Style A Study in Mental, Vocal and Critical Performance.
Chris Knight, 22241. Cambridge: Cambridge University Press. Amsterdam: John Benjamins.
Martindale, Colin. 1995. Creativity and connectionism. In he Creative Walter, Henrik. 2001. Neurophilosophy of Free Will: From Libertarian
Cognition Approach, ed. Steven Smith, homas Ward, and Ronald Illusions to a Concept of Natural Autonomy. Trans. Cynthia Klohr.
Finke, 24968. Cambridge, MA: MIT Press. Cambridge, MA: MIT Press.
Oatley, Keith. 2002. Emotions and the story worlds of iction. In Narrative
Impact: Social and Cognitive Foundations, ed. Melanie Green, Jefrey
Strange, and Timothy Brock, 3969. Mahwah, NJ: Erlbaum.
74
THE CAMBRIDGE ENCYCLOPEDIA OF
THE LANGUAGE SCIENCES

Abduction Absolute and Statistical Universals
A
and system of language that following generations infer often
difer from the system earlier generations are using. his often
results in semantic change, syntactic change, and sound
change.
ABDUCTION
Albert Atkin
Abduction is a form of reasoning irst explicated by the nine-
teenth-century philosopher C. S. Peirce. he central concept WORKS CITED AND SUGGESTIONS FOR FURTHER READING
he wishes to introduce is that of generating new hypotheses to
Anderson, Henning. 1973. Abductive and deductive change. Language
explain observed phenomena partly by guesswork or specu- 49: 76593.
lation. In his early work, Peirce tried to explain abductive rea- Burks, Arthur. 1946. Peirces theory of abduction. Philosophy of Science
soning, as distinct from deductive and inductive reasoning, by 13: 3016.
reference to syllogistic form. For instance, the following schema McMahon, April. 1994. Understanding Language Change.
is an example of deductive reasoning: Cambridge: Cambridge University Press.
Peirce, C. S. 1935. he Collected Papers of Charles S. Peirce. Vol. 5.
All the beans in the bag are white Cambridge: Harvard University Press.
hese beans came from this bag
herefore, these beans are white
ABSOLUTE AND STATISTICAL UNIVERSALS
his is distinct from inductive reasoning which, Peirce argues,
Language universals are statements that are true of all lan-
follows this pattern:
guages; for example, all languages have stop consonants. But
hese beans came from this bag beneath this simple deinition lurks deep ambiguity, and this
hese beans are white triggers misunderstanding in both interdisciplinary discourse
herefore, all the beans in this bag are white and within linguistics itself. A core dimension of the ambiguity
is captured by the opposition absolute versus statistical uni-
And both these forms are distinct from abductive reasoning versal, although the literature uses these terms in varied ways.
which, Peirce argues, follows this pattern: Many textbooks draw the boundary between absolute and statis-
hese beans are white tical according to whether a sample of languages contains excep-
All the beans in this bag are white tions to a universal. But the notion of an exception-free sample
herefore, the beans came from this bag is not very revealing, even if the sample contained all known
languages: here is always a chance that an as yet undescribed
In later work, however, Peirce felt that trying to it abduc- language, or an unknown language from the past or future, will
tive reasoning into such a strict syllogistic form was restric- provide an exception.
tive, and instead he opted for the following schema to explain It is impossible, in principle, to survey all languages of our spe-
abduction: cies. If we nevertheless want to make claims about all languages,
he surprising fact C is observed only two routes are open: a priori deduction of necessarily true
But if A were true, C would be a matter of course statements or statistical extrapolation from empirical samples to
Hence, there is a reason to suspect that A is true. the entire set. Absolute universals can then be deined as those
(Peirce 1935, 189) that are necessarily true, statistical universals as those that are
extrapolated from samples.
For example, suppose I observe that my car will not start.
One good explanation for this would be that it is out of fuel. Absolute Universals
Consequently, it seems that we have a good reason to think that For statements to be necessarily true, they must follow from a
my cars refusal to start is due to its being out of fuel. Of course, priori assumptions. he assumptions that linguists make are
we may very quickly discover that my car has plenty of fuel, diverse and heavily debated. An example is the assumption that
and a diferent hypothesis must be adopted, but Peirce always words consist of morphemes, that is, minimal form-mean-
intended that abductive reasoning was fallible and conjectural, ing pairs. If one accepts this, then it is necessarily true that all
awaiting conirmation from other testing. languages have morphemes, and there cannot be exceptions.
Peirces account of abduction has been widely adopted in Why? Suppose someone claims to have discovered a language
the philosophy of science, but it has also been of some interest without morphemes. One can of course simply analyze the lan-
to linguists. One particularly prominent use of abduction has guage without mentioning morphemes, but obviously that can-
been in historical linguistics for explaining language not challenge the universal just because one can always defend
change (see, for instance, Anderson 1973). he systematic it by reanalyzing the language with morphemes. he only true
features of a language that govern the use of one generation challenge would be to show that analyzing some data in terms
are opaque to the following generation as they acquire that lan- of morphemes leads to structures that are in conlict with other
guage the only access is through language output. It appears, assumptions, for example, that form-meaning pairs combine
then, that following generations must use abductive inferences exclusively by linear concatenation. he conlict can be illustrated
to access the rules of language before applying those rules to by languages with morphologies like the English plural geese,
new cases. And, of course, since abduction is fallible, the rules where the meanings plural and goose do not correspond to linear
77
Absolute and Statistical Universals
strings of morphemes. Confronted with such data, there are his view of absolute universals is highly controversial: Many
three options: linguists limit absolute universals to what is descriptively necessary
in every language; many psychologists propose that children apply
(i) give up the notion of morpheme;
diferent and much more general principles in acquiring a lan-
(ii) give up the assumption of linear concatenation; guage than those found in linguists metalanguages; and to date,
(iii) add additional assumptions that reconcile the conlict. no absolute universal has been conirmed by genetic research.
On any of these options, the universal remains exception-

less: On solution (i), no language has morphemes; on solu- Statistical Universals
tions (ii) and (iii), all languages have morphemes. As a result, What is not an absolute universal is a variable (or character, or
absolute universals can never be falsiied by individual data. parameter): some languages have a certain structure or they
heir validity can only be evaluated by exploring whether they dont have it, or to diferent degrees. It is interesting to note that
are consistent with other absolute universals that are claimed most variables show some skewing in their distribution; some val-
simultaneously. ues of a variable are favored only in certain geographical areas (rel-
Absolute universals can also be thought of as those aspects ative pronouns in Europe) or only in certain families (stem-internal
of ones descriptive metalanguage often called a theoreti- inlection in Afroasiatic). But some values are globally favored (e.g.,
cal framework that are necessarily referred to in the analysis nasals) or, what is more typical, globally favored under certain
of every language, that is, that constitute the descriptive a priori. structural conditions (e.g., postnominal relative clauses among
Depending on ones a priori, this includes, apart from the mor- languages with objects following the verb). hese global prefer-
pheme, such notions as distinctive feature, constituent (see con- ences are called unconditional (unrestricted) and conditional
stituent structure), argument, predicate (see predicate (restricted) statistical universals, respectively. (An alternative term
and argument), reference, agent, speaker, and so on. In some for conditional statistical universals is implicational univer-
metalanguages, the a priori also includes more speciic assump- sals, but this invites confusion because their probablistic nature
tions, for example, that constituents can only be described by diferentiates them from logical implications; cf. Cysouw 2005)
uniform branching (all to the left, or all to the right), or only by Statistical universals are mostly motivated by theories of how
binary branching, and so on. languages develop, how they are used, how they are learned,
he status of absolute universals is controversial. For many and how they are processed. One such theory, for example, pro-
linguists, especially in typology and historical linguis- poses that processing preferences in the brain lead to a universal
tics, absolute universals are simply the descriptive a priori, increase in the odds for postnominal structures among verb-ob-
with no additional claim on biological or psychological real- ject languages (Hawkins 2004).
ity. he choice between equally consistent universals/meta- Statistical universals take the same forms as statistical hypoth-
languages for example, among options (i), (ii), and (iii) in eses in any other science for example, they can be formulated
the previous example is guided by their success in describing in terms of regression models. hey can be tested with the same
structures and in deining variables that capture distributional range of statistical methods as in any other science, and, again
patterns, an evaluation procedure comparable to the way in as in other sciences, the appropriate choice of models, popula-
which technical instruments for analyzing objects are evaluated tion assumptions, and testing methods is an issue of ongoing
in the natural sciences. In the morphology problem, typologists research (e.g. Cysouw 2005; Janssen, Bickel, and Ziga 2006;
would most likely chose option (ii) because it allows for deining Maslova 2008).
a variable of stem-internal versus aixal plural realization that A central concern when testing statistical universals is to
has an interesting distribution (suggesting, for example, that ascertain true globality, that is, independence of area and fam-
within-stem realization is favored by a few families in Africa and ily. Areas can be controlled for by standard factorial analysis, but
the Near East). it is an unsettled question just what the relevant areal relations
In generative grammar, by contrast, absolute universals are; for example, should one control for the inluence of Europe
are thought of not only as descriptively a priori but also as bio- or the entire Eurasia or both? A quick solution is to assume a
logically given in what is called universal grammar: they are standard set of ive or six macroareas in the world and accept as
claimed to be innate (see innateness and innatism) and to universal if a distribution is independent of these areas (Dryer
be identical to the generalizations that a child makes when learn- 1989). But the rationale for such a set is problematic, and this
ing language. hus, if the morpheme is accepted as a universal, has led to a steep surge of interest in research on areas and their
that is, a priori term of our metalanguage, it will also be claimed to historical background (e.g., Nichols 1992; Haspelmath et al.
be part of what makes languages learnable (see learnability) 2005).
and to be part of our genetic endowment. An immediate conse- Controlling for family relations poses another problem.
quence of such an approach is that something can be claimed as Under standard statistical procedures, one would draw ran-
universal even if it is not in fact necessary in the analysis of every dom samples of equal size within each family and then model
language. For example, even if some language (e.g., the Rotokas families as levels of a factor. However, over a third of all known
language of Bougainville) lacks evidence for nasal sounds, one families are isolates, containing only one member each. And
could still include a distinctive feature [ nasal] in Universal picking one member at random in larger families is impossible
Grammar. Rotokas speakers are then said to have the feature as if at the same time one wants to control for areas (e.g., admitting
part of their genetic endowment even if they dont use it. an Indo-European language from both Europe and South Asia).
78
Absolute and Statistical Universals Accessibility Hierarchy
In response to this problem, typologists seek to ensure represen- Maslova, E.. 2000. A dynamic approach to the veriication of distribu-
tativity of a sample not by random selection within families but tional universals. Linguistic Typology 4: 30733.
by exhaustive sampling of known families, stratiied by area. In . 2008. Meta-typological distributions. Sprachtypologie und
order to then control for unequal family sizes, one usually admits Universalienforschung 61: 199207.
Newmeyer, F. J. 2005. Possible and Probable Languages: A Generative
only as many data points per family as there are diferent values
Perspective on Linguistic Typology. New York: Oxford University Press.
on the variables of interest (Dryer 1989; Bickel 2008).
Nichols, J. 1992. Language Diversity in Space and Time. Chicago: he
Samples that are not based on random sampling do not University of Chicago Press.
support parametric inference by statistical tests. An alternative Pinker, S., and R. Jackendof. 2005. he faculty of language: Whats spe-
to this is randomization methods (Janssen, Bickel, and Ziga cial about it? Cognition 95: 20136.
2006): he null hypothesis in these methods is that an observed Plank, F., and E. Filimonova. 2000. he Universals Archive: A
preference can be predicted from the totals of the sample (e.g., brief introduction to prospective users. Sprachtypologie und
that an observed 90% postnominal relatives in VO [verb-object] Universalienforschung 53: 10923.
languages could be predicted if 90% of the entire sample had Plank, F., ed. 2007. Linguistic Typology 11.1 (Special issue treating the
postnominal relatives) not that the sample stems from a popu- state of typology.)
lation without the observed preference. Extrapolation to the total
population (the entire of set of human languages) can then only ACCESSIBILITY HIERARCHY
be based on plausibility arguments: If a preference signiicantly Edward L. Keenan and Bernard Comrie (1972, 1977) introduce
deviates from what is expected from the totals of the observed the accessibility hierarchy (AH) as a basis for several cross-
sample, it is likely that the preference holds in all languages. A linguistic generalizations regarding the formation of relative
key issue in such argumentation is whether the tested variables clauses (RCs).
are suiciently unstable over time so that a present sample can
be assumed to not relect accidental population skewings from AH: SUBJ > DO > IO > OBL > GEN > OCOMP
early times in prehistory (Maslova 2000). In response to this, he terms of the AH are main clause subject, direct object, indi-
typologists now also seek to test universals by sampling language rect object, object of pre- or postposition, genitive (possessor), and
changes instead of language states a move that is sometimes object of comparison. Keenan and Comrie cross-classiied RCs
called the dynamization of typology (Greenberg 1995; Croft along two parameters: 1) the head noun precedes or follows the
2003). restrictive clause (RestCl), and 2) the case of the position relativ-
While the number of proposed statistical universals is ized, NPrel, is pronominally marked or not. In (1), from German,
impressive the Universals Archive at Konstanz has collected the RestCl, underlined, follows the head in (1a) and precedes it
more than 2,000 (Plank and Filimonova 2000) very few of them in (1b). In (2a,b), from Hebrew and Russian, NPrel is pronomi-
have been rigorously tested for independence of area, family, nally marked but not in English.
and time.
(1) a. der Mann, der in seinem Bro arbeitet
Balthasar Bickel the man, who in his study is+working
the man who is working in his study
b. der in seinem Bro arbeitende Mann
Bickel, B. 2008. A reined sampling procedure for genealogical control. the in his study working man
Sprachtypologie und Universalienforschung 61: 2233. the man who is working in his study
Croft, W. 2003. Typology and universals. 2d ed. Cambridge: Cambridge
University Press. (2) a. ha-isha she Dan natan la et ha-sefer
Cysouw, M. 2005. Quantitative methods in typology. In Quantitative the-woman that Dan gave to+her acc the-book
Linguistics: An International Handbook, ed. G. Altmann, R. Khler, and the woman that Dan gave the book to
R. Piotrowski, 55478. Berlin: Mouton de Gruyter. b. devuka, kotoruju Petr ljubit
Cysouw, M, ed. 2008. Special issue on analyzing he World Atlas of girl, who(acc) Peter loves
Language Structures. Sprachtypologie und Universalienforschung 61. the girl who Peter loves
Dryer, M. S. 1989. Large linguistic areas and language sampling. Studies
in Language 13: 25792. A given choice of values for the two parameters deines an
Greenberg, J. H. 1995. he diachronic typological approach to lan- RC-forming strategy. A strategy that applies to SUBJ is called pri-
guage. In Approaches to Language Typology, ed. M. Shibatani and T. mary. German has two primary strategies, a postnominal, +case
Bynon, 14366. Oxford: Clarendon. one, (1a), and a prenominal, case one, (1b). Keenan and Comrie
Haspelmath, M., M. S. Dryer, D. Gil, and B. Comrie, eds. 2005. he World support three hierarchy generalizations:
Atlas of Language Structures. Oxford: Oxford University Press.
Hauser, M. D., N. Chomsky, and W. T. Fitch. 2002. he faculty of lan- (3) a. All languages have a primary strategy
guage: What it is, who has it, and how did it evolve? Science 298: 1569 b. A given RC-forming strategy must apply to a continuous
79. his paper and the response by S. Pinker and R. Jackendof 2005 segment of the AH
launched an ongoing debate on the nature and extent of absolute uni- c. A primary strategy may cease to apply at any position on
versals in generative grammar. the AH
Hawkins, J. A. 2004. Eiciency and Complexity in Grammars.
Oxford: Oxford University Press. For example, many West Austronesian languages, such as
Janssen, D., B. Bickel, and F. Ziga. 2006. Randomization tests in lan- Malagasy (Madagascar), only have primary strategies. So we can
guage typology. Linguistic Typology 10: 41940. only relativize the agent in (4a).
79
Accessibility Hierarchy Acoustic Phonetics
(4) a. Manolotra (m+aN+tolotra) vary ho anny vahiny aminny causativizing a transitive verb, its agent argument may surface as
lovia vaovao ny tovovavy an IO (Jai fait manger les pinards aux enfants I made-eat the
ofers (pres+act+ofer) rice forthe guests onthe dishes new the spinach to the children). Lastly, S. Hawkins and Keenan (1987)
young+woman show psycholinguistically that recall of RCs formed on high posi-
he young woman ofers rice to the guests on the new dishes tions on the AH was better than recall of ones formed on low
b. ny tovovavy (izay) manolotra vary ho anny vahiny aminny positions.
lovia vaovao One interesting modiication to the hierarchy generaliza-
the woman (that) ofers rice tothe guests onthe dishes new tions concerns syntactic ergativity. Keenan and Comrie noted
the young woman who ofers rice to the guests on the new dishes that Dyirbal (Dixon 1972) relativizes absolutives intransitive
c. *ny vary (izay) manolotra ho anny vahiny aminny lovia subjects and transitive objects, but not transitive subjects. A
vaovao ny tovovavy verbal aix (antipassive) derives intransitive verbs from transi-
the rice (that) ofers tothe guests onthe dishes new the tive ones with the agent as subject, hence relativizable. Mayan
young+woman languages such as Jacaltec (Craig 1977, 196) are similar. his is
the rice that the young woman ofers to the guests on the new an elegant solution to the requirement that agents be relativ-
dishes izable, analogous to Bantu applicatives or Austronesian voices
aixes.
he irst four words in (4c) claim that the rice is doing the
ofering a nonsense. Malagasy does not, however, have an (7) a. x s watxe naj hun-ti
expressivity gap here since it has a rich voice system allow- asp 3abs 3erg make cl:man one-this
ing any major NP in a clause as subject. he form of ofer that He made this
takes theme subjects is atolotra, recipient subjects tolorana, b. naj x watxe n hun-ti
and oblique subjects anolorana. (5a,b) illustrate heme and cl:man asp 3abs make ap one-this
Instrument RCs. the man (who) made this
(5) a. ny vary (izay) atolo-dRasoa ho anny vahiny aminny lovia Edward L. Keenan
vaovao
the rice (that) ofered-by+Rasoa forthe guests onthe new WORKS CITED AND SUGGESTIONS FOR FURTHER READING
dishes
Comrie, Bernard. 1976. he syntax of causative constructions: Cross-
the rice that the young+woman ofers to the guests on the
language similarities and divergences. In he Grammar of Causative
new dishes Constructions, Syntax and Semantics 6, ed. Masayoshi Shibatani, 261
b. ny lovia vaovao (izay) anoloran-dRasoa vary ho anny 312. Amsterdam: Academic Press.
vahiny Craig, Colette. 1977. Jacaltec. Austin: University of Texas Press.
the dishes new (that) ofered-by+Rasoa rice for the guests Dixon, Robert M. W. 1972. he Dyirbal Language of North Queensland.
the new dishes on which the young woman ofered rice to the Oxford: Cambridge University Press.
guests Hawkins, Sarah, and Edward L. Keenan. 1987. he psychological validity
of the accessibility hierarchy. In Universal Grammar: 15 Essays, ed. E.
Bantu languages, such as Luganda, (6), illustrate the DO cutof. L. Keenan, 6089. London: Croom Helm.
Only subjects and objects are directly relativizable. Obliques can Keenan, Edward L. 1975. Variation in Universal Grammar. In Analyzing
be promoted to object using applicative aixed verbs. So the Variation in English, ed. R. Fasold and R. Shuy, 13648. Washington,
instrumental in (6a) is only relativizable from (6c). DC: Georgetown University Press.
Keenan, Edward L., and Bernard Comrie. 1972. Noun phrase accessibil-
(6) a. John yatta enkoko n (= na) ekiso
ity and Universal Grammar. Paper presented at the Annual Meetings
John killed chicken with knife of the Linguistic Society of America, Atlanta.
b. *ekiso John kye-yatta enkoko (na) knife . 1977. Noun phrase accessibility and Universal Grammar.
John rel-killed chicken (with) Linguistic Inquiry 8.1: 6399.
c. John yattisa (yatt+is+a) ekiso enkoko
John kill+with knife chicken
John killed+with a knife the chicken ACOUSTIC PHONETICS
d. ekiso John kye-yattisa enkoko knife Like the rest of linguistics, acoustic phonetics combines descrip-
John rel-kill+with chicken tion with theory. Descriptions are images of acoustic properties
the knife John killed the chicken with and quantitative measures taken from these images, and theory
Independent support for the AH: Keenan (1975) shows that accounts for the way in which a sounds articulation determines
stylistically simple texts used RCs formed high on the AH proits acoustics.
portionately more than texts independently judged stylistically
hard. Second, Comries work (1976) supports the conclusion that Description
the positioning of demoted subjects in morphological causatives he three most commonly used images of speech are the wave-
tends to assume the highest function on the AH not already illed. form, spectrum, and spectrogram (Figure 1). he waveform dis-
hus in the French Jai fait rire les enfants I made-laugh the chil- plays diferences in sound pressure level (in pascals) over time
dren, the children surfaces as a DO as laugh lacks a DO. But in (Figure 1a, d), the spectrum diferences in sound pressure level
80
Acoustic Phonetics
Figure 1. Waveforms, spectra, and

spectrograms of 30 ms intervals of
the vowel [i] (ac), and the fricative
[s] (df).
(in deciBels) over frequency (Figure 1b, e), and the spectrogram Note that the numerator in (2) is not 1 but instead c, the speed
diferences in frequency over time (Figure 1c, f,); darkness indi- of sound.
cates the sound pressure level at particular frequencies and he spectrum and spectrogram of [i] (Figures 1b, c) show peaks
moments in the spectrogram. and horizontal bands, respectively, known as formants, at roughly
he images in Figures 1ac difer from those in Figure 1df 300, 2,200, and 2,800 Hz. he corresponding images of [s] (Figures
in every conceivable respect: Sound pressure level varies more 1e, f) show a broad energy band spanning 4,0007,000 Hz.
or less regularly and repetitively, every 0.0073 second, in the Whether a sound is periodic and where in its spectrum energy
vowel [i] (as in heed), while in the fricative [s] (as in see), it is concentrated are nearly suicient to distinguish all speech
instead varies nearly randomly. he vowel is thus nearly peri- sounds from one another acoustically, and these two proper-
odic, while the fricative is decidedly aperiodic. his diference ties also relect the two components of the theoretical model for
gives the vowel a clear pitch, while making the fricative instead transforming articulations into acoustics. All voiced sounds are
sound noisy. periodic, as are trills. Sonorants (vowels, glides, liquids, nasals)
A single cycles duration in a periodic sound is its period (T); are usually periodic, while obstruents (fricatives, stops, afri-
the distance it travels in space is its wavelength (). As measures cates) are aperiodic. Voiced obstruents are both periodic and
of a single cycles extent, both period and wavelength are recip- aperiodic. Diferences in vowel quality and consonantal place of
rocally related to frequency (F), the number of cycles per second, articulation are both realized acoustically as diferences in where
or Hz: energy is concentrated in their spectra. he remaining property
is duration, which besides conveying short:long contrasts also
1
(1) F (cycles/sec) = contributes to the conveying of tense:lax contrasts between vow-
T (sec/cycle) els and the voicing and manner contrasts between consonants.
c (cm/sec)
Theory
(2) F (cycles/sec) = Speech sounds are the product of the application of a ilter that
(cm/cycle)
determines the frequencies in which energy is concentrated
81
Acoustic Phonetics
First resonance
released or continuously through a fricatives narrow con-
Glottis = closed Lips = open
striction. In strident sounds, the jet breaks up against a baffle
just downstream, increasing turbulence and noise intensity
considerably.
Minimum Maximum RESONANCE. Both periodic and aperiodic sound sources intro-
duce acoustic energy into the oral cavity across a broad enough
range of frequencies to excite any resonance of the air inside
0 2.5 5 7.5 10 12.5 15 17.5
the oral cavity. If the articulators are in their rest positions
Distance from glottis (cm) (a) and vocal folds are in the voicing position, this cavitys shape
approximates a uniform tube, closed at the glottis and open at
the lips (Figure 2). A resonance is produced by the propaga-
Second resonance
tion of acoustic energy away from the source and its relection
back and forth of the two ends of the tube, which establishes a
standing wave. In a standing wave resonance, the locations of
zero and maximum pressure variation are ixed.To understand
how air resonates, it is easier to consider the change in pressure
level in the standing wave, rather than pressure level itself, that
is, the extent to which the air molecules are being displaced
0 2.5 5 7.5 10 12.5 15 17.5 longitudinally, or equivalently the velocity of the pressure
Distance from glottis (cm) (b) change, rather than the extent of their instantaneous compres-
sion or rarefaction. Air is most freely displaced longitudinally
at the open end, the lips, and least freely at the closed end, the
Third resonance glottis. As a result, the standing waves that it best inside the
oral cavity are those whose wavelengths, and thus frequencies,
are such that they have a velocity maximum (antinode) at the
lips, and a velocity minimum (node) at the glottis. Because the
air resonates more robustly at these frequencies than at oth-
ers, the oral cavitys resonant response ilters the sound source,
passing energy in the source at some frequencies and stopping
it at others.
0 2.5 5 7.5 10 12.5 15 17.5
Figures 2ac show the three lowest-frequency standing
Distance from glottis (cm) (c) waves that fit these boundary conditions. How are their fre-
Figure 2. The oral cavity as a tube closed at one end and open at the quencies determined? Figures 2ac show that one-quarter
other: (ac) standing waves corresponding to the irst three resonances, of the first resonances wavelength spans the distance from
each with a velocity minimum at the closed end and a velocity maximum the glottis to the lips (the oral cavitys length, Loc), three-
at the open end. quarters of the seconds, and five-quarters of the thirds. More
generally:
(3) 2n 1
Loc = n
in the sounds spectrum to a periodic and/or aperiodic sound 4
source.
where n is the resonance number. Solving for wavelength and
SOUND SOURCES. Sound sources are produced by using valves substituting into (2) yields:
that control air low through the vocal tract to transform the
energy in that low into sound.In periodic sound sources, the c
(4) Fn =
low of air causes a valve to open and close rapidly and regularly, 4
Loc
which in turn causes air pressure to rise and fall just downstream. 2n 1
he repeated opening and closing of the glottis, known as vocal
fold vibration, is the most common periodic sound source; oth- Substituting 35,000 cm/sec for c and 17.5 cm for Loc (the aver-
ers are uvular, alveolar, and bilabial trills. age adult males oral cavity length) yields 500, 1,500, and 2,500
Aperiodic sound sources are produced by keeping a valve Hz as the irst three resonances frequencies, values close to
completely or nearly closed, in stops and fricatives, respec- schwas.
tively. Either way, oral air flow is obstructed enough that Because the variable Loc is in the denominator, resonance
oral air pressure rises behind the obstruction. This pressure frequencies are lower in adults and mens longer oral cav-
rise speeds up flow enough to turn it into a turbulent and ities than in childrens or womens shorter ones, and like-
thus noisy jet, in either a brief burst when a stop closure is wise when the lips are protruded in a rounded vowel, such
82
Acoustic Phonetics
Figure 4. Spectrograms of the irst 150 ms of the words (a) bad, (b)
Figure 3. Spectra of the vowels (a) [i] as in heed, (b) [u] as in whod, dad, and (c) gad. The onsets of F1F3 are labeled.
and (c) [a] as in hod. The individual peaks are the harmonics of the fun-
damental frequency (F0) of voice sound source, and the formants are the
ranges of ampliied harmonics. Peaks corresponding to F1F3 are labeled
at the top of each panel.
is higher in front unrounded [i] than schwa but lower in low
back unrounded [a] and especially high back rounded [u]. F1
varies inversely with tongue height, and F2 varies directly with
as the [u] in whod, rather than spread in an unrounded one, tongue backness and lip rounding. The formant frequencies
such as the [i] in heed. These observations yield the length of all other vowel qualities lie between the extremes observed
rule: Resonance frequencies vary inversely with resonating in these vowels, just as all other vowels lingual and labial
cavity length. articulations lie between those of these vowels. F1 starts low
he irst three formants of [i] (Figures 1b, c) difer decid- and rises following [b, d, g] (Figure 4), both F2 and F3 start low
edly in frequency from those of schwa, because raising the following [b] (Figure 4a), both formants start higher following
tongue body toward the front of the palate decreases the oral [d] (Figure 4b), and they diverge from very similar frequencies
cavitys cross-sectional area there and increases it in the phar- following [g] (Figure 4c). Although consonants are articulated
ynx, while spreading the lips, which shortens the oral cavity. at other places of articulation, these three places are distin-
Although the length rule predicts how shortening changes for- guished in nearly all languages, and many distinguish only
mant frequencies, an additional rule is needed to predict how these three.
decreasing and increasing cross-sectional area afects formant he irst heuristic treats the constriction as dividing the oral
frequencies. cavity into separate resonating cavities (Figure 5), and it applies
The predictions of two heuristics that are widely used for the length rule independently to each of them. he irst three for-
this purpose are tested here against the observed formant mants are the three lowest of the six resonances produced by the
frequency differences between the three vowels [i, u, a], and two cavities. his heuristic may be called the cavity association
between the three places of articulation of the stops in [b, heuristic because each formant can be associated with the cavity
d, g]. F1 is lower in the high vowels [i, u] (Figures 3a, b) from which it came. here are two complications. First, the cav-
than in schwa, but higher in the low vowel [a] (Figure 3c). F2 ity behind the constriction is efectively closed at both ends, and
83
Acoustic Phonetics
constriction
back 8 cm2 1 cm2 front
3 cm
6 cm 8.5 cm
Figure 5. The coniguration of the oral cavity with a constriction partway along its length.
so its resonances must have velocity minima at both ends. heir

frequencies are predicted by:
c
(5) Fn =
2
n Lrc
where Lrc is the length of the resonating cavity. he second com-

plication is that the acoustic interaction of the constriction with
the cavity behind it produces a Helmholtz resonance. Its fre-
quency (Fh) is:
c Ac
(6) Fh =
2 Ab Lb Lc
Figure 6. The irst three resonance frequencies of the back cavity (illed
symbols) and front cavity (empty symbols) and the Helmholtz resonance
Ac is the constrictions cross-sectional area, Lc is its length, and (crosses) produced by incremental movement of the constriction in
Ab and Lb are the cross-sectional area and length of the cavity Figure 5.
behind the constriction. If a 3 cmlong constriction with a cross-
sectional area of 1 cm2 is moved incrementally from 3 cm above
the glottis to 0.5 cm back of the lips along a 17.5 cm oral cavity, efects of lip rounding, which closes the front cavity at both ends
the back and front cavities produce the resonance frequencies and introduces another Helmholtz resonance. None of the reso-
displayed in Figure 6, along with the Helmholtz resonance. he nances produced by this front cavity are lower than the back cav-
arrows projected down from the intersections between back and ity resonances, but the additional Helmholtz resonance is low
front cavity resonances show where F2 and F3 change associa- enough to constitute the F2 observed in [u] (657 Hz if the labial
tion from the front to the back cavity. constriction has a cross-sectional area of 1 cm2 and a length of 2
he constriction centers in [a, u, i] are roughly one-quarter cm, and the front cavity is 4.5 cm long).
(4.5 cm), two-thirds (11.5 cm), and three-quarters (13 cm) of the In the second, perturbation, heuristic, a constrictions prox-
distance from the glottis to the lips. he constriction of [g] is close imity to a resonances velocity minimum or maximum deter-
to [u]s, while [d]s is about seven-eighths the distance from the mines how it perturbs that resonances frequency away from its
glottis (15.5 cm), and [b]s is of course at the lips (17.5 cm). schwa-like value: A constriction near a velocity minimum raises
he Helmholtz resonance is lowest for all constriction loca- the formants frequency, while one near a maximum lowers it
tions and thus constitutes F1. It also lowers progressively as the instead (expansions have the opposite efects). Figures 4ac show
constriction is moved forward because the cavity behind the that minima occur at even quarters of a resonances wavelength
constriction lengthens. he cavity-association heuristics suc- and maxima at odd quarters, and that their locations are at ixed
cessful predictions include the following : 1) he low or pharyn- proportions of the length of the oral cavity. Because constric-
geal vowel [a] has a higher F1 than the high or velar and palatal tion locations are also a certain proportion of the distance from
vowels [u, i]; 2) F1 is low following [g, d, b]; 3) [a]s F2 (the front the glottis to the lips, whether they coincide with a minimum or
cavitys irst resonance) is low, 4) F2 and F3 (the front and back maximum can be calculated by multiplying both sides of (4) by
cavities irst resonances) start at very similar frequencies follow- the proportion of the oral cavitys length that corresponds to the
ing [g], because a velar constriction is close to where the front constrictions location and rounding the result on the right-hand
and back cavities irst resonances cross at 11 cm from the glottis; side to the nearest quarter (Table I).
5) F2 and F3 start low following [b] (the back cavitys irst and he perturbation heuristic successfully predicts the efects of
second resonances). It incorrectly predicts: 6) he F2 of [i] (the the bilabial, palatal, velar, and pharyngeal constrictions on all
back cavitys irst resonance) is low, indeed lower than [u]s; and three formants of [b, i, g, a], and likewise the efects of the alveo-
7) F2 and F3 (the back cavitys irst and second resonances) are lar and velar constrictions on F1 and F3 in [d, u], but it fails to
low following [d]. For [u], the calculations leave out the acoustic predict F2 raising after [d], and F2 lowering in [u]. he latter can
84
Adaptation
Table 1. Calculating a constrictions proximity to a resonances velocity minimum or maximum from the constrictions proportional distance from
the glottis to the lips.
Place of Segment Proportion of oral Proximity to minimum or maximum

constriction cavity length
Calculation Odd/even Lower/higher
Labial b 1 1*1/4=1/4 Odd F1 lower
1*32/4=3/4 Odd F2 lower
1*53/4=1/4 Odd F3 lower
Alveolar d 7/8 7/8*1/4=7/321/4 Odd F1 lower
7/8*32/4=21/323/4 Odd F2 lower
7/8*53/4=35/324/4 Even F3 higher
Palatal i 3/4 3/4*1/4=3/161/4 Odd F1 lower
3/4*32/4=9/162/4 Even F2 higher
3/4*53/4=15/164/4 Even F3 higher
Velar g, u 2/3 2/3*1/4=2/121/4 Odd F1 lower
2/3*32/4=6/122/4 Even F2 higher
2/3*53/4=10/123/4 Odd F3 lower
Pharyngeal a 1/4 1/4*1/4=1/160/4 Even F1 higher
1/4*32/4=3/161/4 Odd F2 lower
1/4*53/4=5/161/4 Odd F3 lower
again be predicted once the acoustic efects of lip rounding are If a characteristic lacks any of these features, it is not an adap-
added, as the simultaneous labial constriction, together with the tation. An adaptation is not, therefore, simply anything in an
protrusion of the lips, lowers F2 along with all other formants. individual with a good or functional outcome, or that has
useful efects by intuitive standards. Rice cultivation, useful as it
Summary is, is not a biological adaptation because it lacks a speciic genetic
Speech sounds articulations produce sound sources by trans- basis. Similarly, the English language is not an adaptation,
forming aerodynamic energy into acoustic form, and those however useful it might be. In contrast, if a mutation occurred
sound sources in turn cause air inside the oral cavity to resonate, that modiied a neural structure so that the vocal chords could
at frequencies determined by the length of the resonating cavi- more reliably produce distinct phonemes, and this gene spread
ties and where they are constricted. throughout the species because its bearers prospered due to the
John Kingston advantages resulting from a lifetime of more eicient communi-
cation, then the modiied neural structure would qualify as an
WORKS CITED AND SUGGESTIONS FOR FURTHER READING adaptation.
Researchers judge whether something is an adaptation by
Fant, C., and M. Gunnar. 1960. Acoustic heory of Speech Production. he assessing how likely or unlikely it is that its functional organiza-
Hague: Mouton.
tion was produced by random mutation and spread by genetic
Jakobson, Roman, C. Fant, M. Gunnar, and Morris Halle. 1952.
drift. For example, the eye has hundreds of elements that are
Preliminaries to Speech Analysis, Cambridge, MA: MIT Press.
Ladefoged, Peter, and Ian Maddieson. 1996. Sounds of the Worlds
arranged with great precision to produce useful visual inputs. It is
Languages. Oxford: Blackwell Publishers. astronomically unlikely that they would have arrived at such high
Stevens, Kenneth N. 1998. Acoustic Phonetics. Cambridge, MA: MIT Press. levels of mutual coordination and organization for that function
unless the process of natural selection had diferentially retained
them and spread them throughout the species. Consequently,
ADAPTATION
the eye and the visual system are widely considered to be obvi-
An adaptation is a characteristic in an organism that evolved ous examples of adaptations. For the same reason, evolutionary
because it helped the organism or its relatives to survive and scientists consider it overwhelmingly likely that many neurocog-
reproduce. Examples include the vertebrate eye, claws, mam- nitive mechanisms underlying language are adaptations for
mary glands, the immune system, and the brain structures that communication (a proposition that Noam Chomsky has dis-
underlie the human capacity for language. More completely, puted; see Lyle Jenkinss essay, Explaining Language, in this
an adaptation is 1) a reliably developing set of characteristics volume). Language competence reliably develops, is believed
2) whose genetic basis became established and organized in the to have a species-typical genetic basis, and exhibits immensely
species (or population) over evolutionary time because 3) the complex internal coordination that is functionally organized to
adaptation interacted with recurring features of the body or envi- produce eicient communication, which vastly enhances the
ronment 4) in a way that, across generations, typically caused achievement of instrumental goals, plausibly including those
this genetic basis to increase its gene frequency. linked to itness.
85
Ad hoc Categories
Within the evolutionary sciences, the concept of adaptation relevant goal by organizing the current situation in a way that
plays an indispensable role not only in explaining and under- supports efective goal pursuit.
standing how the properties of organisms came to be what they Ad hoc categories contrast with thousands of well-established
are, but also in predicting and discovering previously unknown categories associated with familiar words (e.g., cat, eat, happy).
characteristics in the brains and bodies of species.Evolutionary Extensive knowledge about these latter categories resides in
psychologists, for example, analyze the adaptive problems our memory and may often become active even when irrelevant to
ancestors were subjected to, predict the properties of previ- current goals. When ad hoc categories are used frequently, how-
ously unknown cognitive mechanisms that are expected to have ever, they, too, become highly familiar and well established in
evolved to solve these adaptive problems, and then conduct memory. he irst time that someone packs a suitcase, the cate-
experimental studies to test for the existence of psychological gory things to pack in a suitcase is ad hoc. Following many trips,
adaptations with the predicted design (see evolutionary however, it becomes entrenched in memory.
psychology). An understanding that organisms embody sets Ad hoc categories constitute a subset of role categories,
of adaptations rather than just being accidental agglomerations where roles provide arguments for verbs, relations, and sche-
of random properties allows organisms to be properly studied as mata. Some role categories are so familiar that they become
functional systems. If language is accepted as being the product lexicalized (e.g., seller, buyer, merchandise, and payment name
of adaptations, then there is a scientiic justiication for studying the agent, recipient, theme, and instrument roles of buy). When
the underlying components as part of a functional system. the conceptualization of a role is novel, however, an ad hoc cate-
he concept of adaptation became more contentious when gory results (e.g., potential sellers of gypsy jazz guitars). Pursuing
human behavior and the human psychological architecture goals requires the constant speciication and instantiation of
began to be studied from an adaptationist perspective. Critics roles necessary for achieving them. When a well-established cat-
have argued that not every characteristic is an adaptation an egory for a role doesnt exist, an ad hoc category is constructed
error adaptationists also criticize. More substantively, critics to represent it.
have argued that it is impossible to know what the past was like Both conceptual and linguistic mechanisms appear central to
well enough to recognize whether something is an adaptation. forming ad hoc categories. Conceptually, people combine exist-
Adaptationists counter that we know many thousands of things ing concepts for objects, events, settings, mental states, proper-
about the past with precision and certainty, such as the three- ties, and so on to form novel conceptual structures. Linguistically,
dimensional nature of space and the properties of chemicals, the people combine words in novel ways to index these concepts.
existence of predators, genetic relatives, eyes, infants, food and Sometimes, novel concepts result from perceiving something
fertile matings, and the acoustical properties of the atmosphere, novel and then describing it (e.g., seeing a traditional opera set in
and that these can be used to gain an engineers insight into why a modern context and describing this newly encountered genre
organisms (including humans) are designed as they are. as modernized operas). On other occasions, people combine
words for conceptual elements before ever encountering an
Julian Lim, John Tooby, and Leda Cosmides
actual category instance (e.g., describing mezzo sopranos who
have power, tone, and lexibility before experiencing one). he
conceptual and linguistic mechanisms that formulate ad hoc
Gould, S. J., and R. C. Lewontin. 1979. he spandrels of San Marco and categories are highly productive, given that components of these
the Panglossian paradigm: A critique of the adaptationist programme. categories can be replaced systematically with alternative values
Proceedings of he Royal Society of London, Series B 205.1161: 58198. from semantic ields (e.g., tourist activities to perform in X, where
Pinker, Steven. 1994. he Language Instinct. New York: Morrow.
X could be Rome, Florence, Venice, etc.). Syntactic structures
. 2003. Language as an adaptation to the cognitive niche. In
are also central for integrating the conceptual/linguistic com-
Language Evolution, ed. M. Christiansen and S. Kirby, 1637. New
ponents in these categories (e.g., the syntax and accompanying
York: Oxford University Press.
Tooby, John, and I. DeVore. 1987. he reconstruction of hominid closed class words in tourist activities to perform in Rome).
behavioral evolution through strategic modeling. In he Evolution Lawrence Barsalou (1983) introduced the construct of ad hoc
of Primate Behavior: Primate Models. ed. W. G. Kinsey, 183237. New categories in experiments showing that these categories are not
York: SUNY Press. well established in memory and do not become apparent without
Williams, George C. 1966. Adaptation and Natural Selection: A Critique context. Once constructed, however, they function as coherent
of Some Current Evolutionary hought. Princeton, NJ: Princeton categories, exhibiting internal structures as indexed by typicality
University Press. gradients. Barsalou (1985) showed that these gradients are orga-
nized around ideal values that support goal achievement and
also around frequency of instantiation. He also showed (1987)
AD HOC CATEGORIES
that these internal structures are generally as stable and robust
An ad hoc category is a novel category constructed spontane- as those in familiar taxonomic categories.
ously to achieve a goal relevant in the current situation (e.g., con- Barsalou (1991) ofered a theoretical framework for ad hoc
structing tourist activities to perform in Beijing while planning a categories (see also Barsalou 2003). Within this framework, ad
vacation). hese categories are novel because they typically have hoc categories provide an interface between roles in knowl-
not been entertained previously. hey are constructed sponta- edge structures (e.g., schemata) and the environment. When a
neously because they do not reside as knowledge structures in role must be instantiated in order to pursue a goal but knowl-
long-term memory waiting to be retrieved. hey help achieve a edge of possible instantiations does not exist, people construct
86
Ad hoc Categories Adjacency Pair
an ad hoc category of possible instantiations (e.g., when going WORKS CITED AND SUGGESTIONS FOR FURTHER READING
camping for the irst time, constructing and instantiating Barsalou, L. W. 1983. Ad hoc categories. Memory & Cognition
activities to perform on a camping trip). he particular instan- 11: 21127.
tiations selected relect their it with a) ideals that optimize . 1985. Ideals, central tendency, and frequency of instan-
goal achievement and b) constraints from the instantiations tiation as determinants of graded structure in categories. Journal
of other roles in the knowledge structure (e.g., activities to of Experimental Psychology: Learning, Memory, and Cognition
perform on a camping trip should, ideally, be enjoyable and 11: 62954.
safe and should depend on constraints such as the vacation . 1987. he instability of graded structure: Implications for the nature
location and time of year). Once established, the instantiations of concepts. In Concepts and Conceptual Development: Ecological
of an ad hoc category are encoded into memory and become and Intellectual Factors in Categorization, ed. U. Neiser, 10140.
increasingly well established through frequent use (e.g., estab-
. 1991. Deriving categories to achieve goals. In he Psychology of
lishing touring back roads and socializing around the camp-
Learning and Motivation: Advances in Research and heory. Vol. 27.
ground as instances of activities to perform on a camping trip). Ed. G. Bower, 164. San Diego, CA: Academic Press.
Barsalou (1999) describes how this framework can be realized . 1999. Perceptual symbol systems. Behavioral and Brain Sciences
within a perceptual symbol system. Speciically, categories 22: 577660.
(including ad hoc categories) are sets of simulated instances . 2003. Situated simulation in the human conceptual system.
that can instantiate the same space-time region of a larger Language and Cognitive Processes 18: 51362.
mental simulation (where a simulation is the reenactment of Cech, C. J., E. J. Shoben, and M. Love. 1990. Multiple congru-
modality-speciic states, as in mental imagery). ity efects in judgments of magnitude. Journal of Experimental
Ad hoc categories have been studied in a variety of empiri- Psychology: Learning, Memory, and Cognition 16: 114252.
Chrysikou, E. G. 2006. When shoes become hammers: Goal-derived cat-
cal contexts. S. Glucksberg and B. Keysar (1990) proposed that
egorization training enhances problem-solving performance. Journal
ad hoc categories underlie metaphor (e.g., the metaphor jobs are
of Experimental Psychology: Human Learning and Performance
jails conceptualizes the category of conining jobs). C. J. Cech, E.
32: 93542.
J. Shoben, and M. Love (1990) found that ad hoc categories are Glucksberg, S., and B. Keysar. 1990. Understanding metaphorical com-
constructed spontaneously during the magnitude comparison parisons: Beyond similarity. Psychological Review 97: 318.
task (e.g., forming the ad hoc category of small furniture, such Lucariello, J., and K. Nelson. 1985. Slot-iller categories as mem-
that its largest instances anchor the upper end of the size dimen- ory organizers for young children. Developmental Psychology
sion). F. Vallee-Torangeau, S. H. Anthony, and N. G. Austin 21: 27282.
(1998) found that people situate taxonomic categories in back- Medin, D. L., N. Ross, S. Atran, D. Cox, J. Coley, J. Proitt, and S. Blok.
ground settings to form ad hoc categories (e.g., situating fruit 2006. Folkbiology of freshwater ish. Cognition 99: 23773.
to produce fruit in the produce section of a grocery store). E. G. Ross, B. H., and G. L. Murphy. 1999. Food for thought: Cross-classiication
and category organization in a complex real-world domain. Cognitive
Chrysikou (2006) found that people rapidly organize objects into
Psychology 38: 495553.
ad hoc categories that support problem solving (e.g., objects that
Valle-Tourangeau, F., S. H. Anthony, and N. G. Austin. 1998. Strategies
serve as platforms).
for generating multiple instances of common and ad hoc categories.
Research has also addressed ad hoc categories that become Memory 6: 55592.
well established in memory, what Barsalou (1985, 1991) termed
goal-derived categories (also called script categories, slot iller
ADJACENCY PAIR
categories, and thematic categories). J. Luciarello and K. Nelson
(1985) found that children acquire goal-derived categories associ- conversation analysis, an inductive approach to the micro-
ated with scripts before they acquire taxonomic categories (e.g., analysis of conversational data pioneered by Harvey Sacks
places to eat). B. H. Ross and G. L. Murphy (1999) examined how (1992), attempts to describe the sequential organization of
taxonomic and goal-derived concepts simultaneously organize pieces of talk by examining the mechanics of the turn-taking sys-
foods (e.g., apples as belonging simultaneously to fruit and snack tem. Adjacency pairs relect one of the basic rules for turn-taking
foods). D. L. Medin and colleagues (2006) found that goal-derived (Sacks, Scheglof, and Jeferson 1974), in which a speaker allo-
categories play central roles in cultural expertise (e.g., tree experts cates the conversational loor to another participant by uttering
form categories relevant to their work, such as junk trees). the irst part of a paired sequence, prompting the latter to pro-
Although ad hoc and goal-derived categories are ubiquitous vide the second part. Examples are question-answer, greeting-
in everyday cognition, they have been the subject of relatively greeting as in (1), and complaint-excuse:
little research. Much further study is needed to understand
(1) A: Hi there
their structure and role in cognition. Important issues include
B: Oh hi
the following: How do productive conceptual and linguistic
mechanisms produce ad hoc categories? How do these catego- he constitutive turns in adjacency pairs have the following
ries support goal pursuit during situated action? How do these structural characteristics:
categories become established in memory through frequent use?
(i) hey are produced by two diferent speakers.
How does the acquisition of these categories contribute to exper-
tise in a domain? (ii) hey are, as the term suggests, adjacent. his is not a strict
requirement, as the two parts can be separated by a so-called
Lawrence W. Barsalou insertion sequence, as in (2):
87
Age Groups
(2) A: Whats the time now? (Question 1) Static versus Dynamic Theories
B: Dont you have a watch? (Question 2) Mundane observations make it abundantly clear that language
A: No. (Answer 2) change is not punctual but gradual, and not categorical but
B: I think its around three. (Answer 1) variable. Traditional views of language change imposed meth-
(iii) hey are organized as a irst and a second part, that is, odological restrictions to avoid viewing change while it was pro-
they are nonreversible. his is the case, incidentally, even in gressing. Linguists had little conidence in their ability to discern
ostensibly identical irst and second parts, such as the greet- change precisely and accurately amid the apparent lawlessness
ing-greeting pair in (1), where reversing the order results in of social phenomena, as Edward Sapir incisively put it (1929,
an aberrant sequence. 213). However, in the 1960s, linguists began studying language in
(iv) hey are ordered, so that a particular irst part requires a its social context. By that time, economics, anthropology, sociol-
relevant second part (e.g., greetings do not follow questions). ogy, and other social sciences were well established, and linguis-
tic studies belatedly admitted sociolinguistics, its social science
he fact that the second part is conditionally relevant on the irst adjunct (Chambers 2002b).
part does not mean that only one option is available; in fact, cer- Viewing language changes as they progressed entailed the
tain irst parts typically allow for a range of possible second parts. admission of coexisting linguistic entities as data. Linguists
If two (or more) options are possible, one will be the more socially were required to study the social distribution of, for example,
acceptable, preferred response, the other(s) being dispreferred; both sneaked and snuck as variants of the past tense of sneak
this phenomenon is known as preference organization, as in: (as in the example to be discussed). Dealing coherently with
(3) A: Have a piece of cake (irst part) variables necessitated determining the distribution of vari-
B1: Great thanks I will (preferred second) ants with certain social factors, including the age of the speak-
B2: Ehm actually Ive just eaten but thanks anyway (dispre- ers. Uriel Weinreich, William Labov, and Marvin I. Herzog, in
ferred second) the document that became the manifesto for viewing language
change in its social context, said: A model of language which
As illustrated in (3), dispreferred second parts tend to be structur- accommodates the facts of variable usage and its social and
ally diferent from preferred seconds (B2 being indirect, includ- stylistic determinants not only leads to more adequate descrip-
ing an explanatory account, and containing hesitation markers, tions of linguistic competence, but also naturally yields a theory
unlike B1). of language change that bypasses the fruitless paradoxes with
Another, related phenomenon that merits mention here is which historical linguistics has been struggling for half a cen-
presequencing: Certain adjacency pairs can be introduced or tury (1968, 99). Primary among those paradoxes, of course,
foreshadowed by a preceding exchange, as in: were concepts of change as punctual, categorical, and static. By
(4) A1: Do you sell fresh semiskimmed milk? admitting social variables, it became possible to view change as
B1: We sure do gradual, variable, and dynamic, consistent with commonsense
A2: Ill have two bottles then please observations.
B2: OK
Real-Time Change
his whole exchange forms one unit, in the sense that the occur- he study of the progress of change can be carried out in real time
rence of the question-answer pair A1-B1 is only interpretable by revisiting survey sites at intervals and observing changes in the
given the subsequent request-compliance adjacency pair A2-B2. social distribution of variants from one time to the next. Because
Phenomena such as this are indicative of a level of sequential changes will not necessarily be completed in the interval but will
organization in conversation beyond two-turn sequencing (see be continuing, their progress must be calculated by some kind
Scheglof 2007). of proportional measure, such as the percentage of the variants,
their relative frequency, or their probabilities. he proportional
Ronald Geluykens
diferences from one visit to the next provide a quantitative mea-
sure of the progress of the change.
Studying change in real time has a number of methodologi-
Sacks, Harvey. 1992. Lectures on Conversation. Oxford: Blackwell. cal disadvantages. Most obvious is the time required before
Sacks, Harvey, Emanuel A. Scheglof, and Gail Jeferson. 1974. A sim- attaining a result. Locating subjects on subsequent visits also
plest systematics for the organization of turn-taking in conversation. poses obvious problems because of mobility, cooperation, or
Language 50: 696735.
death. Furthermore, subsequent visits require the addition of
Scheglof, Emanuel A. 2007. he Language of Turn and Sequence.
new subjects at the youngest age group each time. his is nec-
essary because of the manner in which linguistic innovations
typically difuse (see diffusion ) throughout the population.
AGE GROUPS Instead of spreading outward from the source and afecting
Age is one of the primary independent variables in almost everybody in their sphere of inluence, as infectious
sociolinguistics, along with social class, sex, ethnicity, and diseases do in epidemics and technological adoptions do
region. Age is the primary social correlate for determining that when, say, spin dryers replace washboards, linguistic inno-
language is changing and for estimating the rate of change and vations tend to be stratiied. Under ordinary circumstances,
its progress. people acquire their accents and dialects in their formative
88
Age groups
100
90
80
70
60
50
Figure 1. Percentage of people in different age groups who
40 say snuck, not sneaked, as past tense of sneak in the
30 Golden Horseshoe, Canada (Chambers 2002a, 36466).
20
10
0
0ver 80
7079
6069
5059
4049
3039
2029
1419
years, between 8 and 18, and maintain them throughout their currency. It gained currency steadily thereafter, however, and
lives. People who grow up saying sneaked as the past tense accelerated most rapidly in the speech of the 50-year-olds,
of sneak tend to use that form all their lives, even after they people born in 193342, increasing by some 25 percent and
come to know that younger people in their region say snuck becoming the variant used by almost 70 percent of them. In
instead. Because of this stratiication, the progress of a linguis- subsequent decades, it was adopted by ever-greater numbers.
tic change is not measurable in the life span of an individual In the 1980s, the formative years for people born in 19738, the
or one age group but only in comparison between individuals teenagers in this survey, snuck virtually eliminated sneaked as
whose formative years are not the same, that is, between dif- a variant.
ferent age groups. Changes normally take place beneath the level of conscious-
ness. Young people seldom have a sense of the history of the
Change in Progress variants. In this case, people under the age of 30 often con-
Correlating linguistic variants with their use by age groups in sider the obsolescent form sneaked to be a mistake when they
the community as the change is taking place is a way of measur- hear someone say it. here is no communal sense that sneaked
ing its progress. Inductively, change is evident when a linguistic was the historical standard and accepted form for centuries.
variant occurs with greater frequency in the speech of younger Occasionally, changes become self-conscious in the early stages
people than in the speech of their elders. here are some excep- if teachers, writers, or parents openly criticize them. Such criti-
tions (such as age grading), but the inference of change can be cisms almost never succeed in reversing trends, though they
made with reasonable conidence when the frequency of the may slow their momentum. When the incoming variant gains
variant is stratiied from one age group to the next (Labov 2001). enough currency, usually around 2030 percent, its use accel-
hat is, linguistic changes are almost never bimodal, with one erates rapidly. It then slows again as it nears completion. he
variant occurring in the speech of younger people and a difer- graphic pattern is known as the S-curve in innovation difusion,
ent one in the speech of older people. Instead, the variants are with relatively slow (or lat) movement up to 2030 percent, a
typically dispersed along the age continuum in a progressive rapid rise through the middle stages, and lattening again in the
gradation. inal stages.
Figure 1 provides a case study. he variable is the past tense of Lack of communal consciousness of linguistic changes
the verb sneak, with variants sneaked and snuck. he community in progress is a consequence of its social stratiication (see
is the Golden Horseshoe, the densely populated region in south- inequality, linguistic and communicative ). Changes
ern Ontario, Canada. Figure 1 shows the percentage of people generally progress incrementally, so that diferences between
who say snuck, not sneaked, correlated with their age, from octo- the most proximate age groups are small and barely noticeable.
genarians to teenagers. he correlation shows a progression from In Figure 1, 30-year-olds difer from 40-year-olds by about 10
18 percent in the oldest group to 98 percent in the youngest, with percent and from 20-year-olds almost not at all. he difer-
gradation in the intermediate age groups (29 percent of 70s, 42 ence between 30-year-olds and 70-year-olds, by contrast, is
percent of 60s, and so on). over 60 percent. Social relations are closest among age-mates,
Other things being equal, it is possible to draw historical and the gradation of diferences so that proximate age groups
inferences from apparent-time displays like Figure 1. The sur- are most like one another blunts the perception of genera-
vey from which the data are drawn took place in 1992. Among tion gaps within the community. By minimizing awareness of
people born 80 or more years prior, that is, before 1913, changes as they progress, social gradation is a unifying force in
sneaked was the standard variant and snuck had very little communities.
89
Age groups Aging and Language
Apparent-Time Hypothesis of how languages change, who the agents of change are, and how
Figure 1 draws historical inferences of change based on the changes difuse throughout communities. he sociolinguistic
behavior of age groups surveyed at the same time. he replace- perspective on language change as dynamic, progressive, and
ment of sneaked by snuck is not directly observed as it would be variable represents an advance in language studies. he prin-
in real-time studies, in which researchers go to communities at cipal theoretical construct, the apparent-time hypothesis, pro-
intervals and track the changes from one time to the next. Instead, vides a comprehensive view of historical sequences from a single
the inference of change is based on the assumption that under methodological vantage point.
normal circumstances, people retain the accents and dialects
J. K. Chambers
acquired in their formative years. hat assumption is known as
the apparent-time hypothesis. Common experience tells us that
the hypothesis is not without exceptions. People are sometimes
aware of expressions that they once used and no longer do, and Chambers, J. K. 2002a. Patterns of variation including change. In he
sometimes they will have changed their usage after their forma- Handbook of Language Variation and Change, ed. J. K. Chambers,
tive years. If such linguistic capriciousness took place throughout Peter Trudgill, and Natalie Schilling-Estes, 34972. Oxford: Blackwell
the entire community, it would invalidate historical inferences Publishing.
drawn from apparent-time surveys. . 2002b. Studying language variation: An informal epistemol-
ogy. In he Handbook of Language Variation and Change, ed.
However, community-wide changes beyond the formative
J. K. Chambers, Peter Trudgill, and Natalie Schilling-Estes, 314.
years are rare. Real-time evidence, when available, generally
Oxford: Blackwell Publishing.
corroborates apparent-time inferences. In the case of sneaked/ . 2009. Sociolinguistic heory: Linguistic Variation and Its Social
snuck, for instance, earlier surveys made in the same region in Signiicance. 3rd ed. Oxford: Blackwell.
1950 and 1972 show proportional distributions of the variants Labov, William. 2001. Principles of Linguistic Change: Social Factors.
that approximate the apparent-time results. However, inferring Oxford: Blackwell Publishing.
linguistic changes from the speech of contemporaneous age Sapir, Edward. 1929. he status of linguistics as a science. Language
groups is not a direct observation of that change. It remains a 5: 20714.
hypothesis, and its validity must be tested wherever possible. Weinreich, Uriel, William Labov, and Marvin I. Herzog. 1968. Empirical
foundations for a theory of language change. In Directions for
Historical Linguistics: A Symposium, ed. Winfred P. Lehmann and
Age-Graded Changes
Yakov Malkiel, 95188. Austin: University of Texas Press.
Deeper understanding of linguistic change in progress should
ultimately lead to predictable classes of deviation from the
apparent-time hypothesis. One known deviation is age-graded AGING AND LANGUAGE
change. hese are changes that are repeated in each generation,
It is well-documented that in healthy aging, some aspects of lin-
usually as people reach maturity (Chambers 2009, 200206).
guistic ability, for example, phonology, syntax, and vocab-
Age-graded changes are usually so gradual as to be almost
ulary, remain generally preserved into very old age. However,
imperceptible, so that tracking their progress is challenging.
other domains of language, for example, naming, comprehen-
As an example, in all English-speaking communities, there is a
sion and spoken discousre, undergo declines, albeit only at very
rule of linguistic etiquette that requires compound subject noun
late stages in the adult life span. It is interesting to note that
phrases (NPs) to list the irst-person pronoun (the speaker) last.
although these linguistic changes occur generally in the older
Adults say Robin and I went shopping, and never say I and
adult population as a group, there are many older adult individu-
Robin went shopping. here is no linguistic reason for this rule
als who do not experience these deicits but, rather, continue to
(that is, the sentences mean the same thing either way), but
perform as well as younger individuals. hus, the inding of great
putting oneself irst is considered impolite (see politeness).
inter-individual variability should be considered.
Children, however, do not know this and tend to say Me and
his entry presents a detailed review of the three main linguis-
Robin went shopping. At some as-yet-undetermined age, chil-
tic domains experiencing decline with age. Additionally, a review
dren become aware of the rule and change their usage to con-
of language changes due to cognitive deterioration, for example,
form to adult usage.
dementia, is presented.
Age-graded changes like these violate the apparent-time
hypothesis because the variants used by young people do not
persist throughout their lifetimes. Instead, young people change
Naming
A common complaint among the healthy aging population is the
as they reach maturity and bring their usage into line with adults.
increased frequency of word-inding problems in their everyday
he occurrence of age-graded changes does not refute the appar-
speech. A subset of these naming problems is often colloqui-
ent-time hypothesis, but they provide a well-deined exception
ally described as the tip-of-the-tongue (TOT) phenomenon. In
to it. Failure to recognize them as age-graded can lead to an erro-
terms of cognitive models of lexical access, TOTs are described
neous inference that change is taking place.
as a type of word retrieval failure whereby individuals are able to
access the conceptual, semantic, syntactic, and even some
Age and Language Change phonological/orthographic information (e.g., number of
Introducing age groups into linguistic analysis as an indepen- syllables or initial sound/letters) of the target word, not enough
dent variable yielded immediate insights into the understanding information, however, to fully phonologically encode the word
90
Aging and Language
for articulation. Research clearly supports the view that there is a that older adults beneit more from context in noise than do their
general breakdown in phonological encoding (James and Burke younger cohorts.
2000). However, since this stage of processing involves various Lexical comprehension is one aspect of language that is
substages (e.g., segment retrieval, syllable encoding) that occur largely preserved or even enhanced with age. Studies on vocabu-
at very fast rates, as per Willem J. M. Levelt, Ardi Roelofs and lary comprehension in older adults using a picture selection task
Antje S. Meyers (1999) model, behavioral methods are limited for auditorily presented words show that they are comparable to
in their ability to identify the locus of processing diiculty. younger adults in this task (Schneider, Daneman, and Pichora-
Evidence from priming studies has strongly demonstrated Fuller 2002). However, lexical-semantic integration at the level of
that TOTs are due to a failure in transmission of available sentence comprehension may be afected in older adults.
semantic information to the phonological system, as explained Sentence comprehension in the elderly is known to be poor
in the transmission deicit hypothesis (TDH; Burke et al. 1991). in comparison to younger listeners. A number of reasons, both
his theory proposes that older people are especially prone to linguistic and cognitive, have been discussed, including decline
word-retrieval problems due to weakened connections at the in auditory perceptual skills and lexical-semantic and syntactic
phonological level. he phonological level, as compared to processing capacity, as well as working memory capacity, speed
the semantic level, is particularly vulnerable to breakdowns of processing, and ability to process with competing stimuli and
in retrieval because this level generally has fewer connections inhibit noise (Wingield and Stine-Morrow 2000). Studies on syn-
(e.g., phoneme-sound), whereas the semantic system has mul- tactic comprehension of language in older adults demonstrate
tiple joined connections (e.g. many words/concepts linked to that they are slower at judging sentences that are syntactically
a given word). Factors such as word frequency or recency of complex and semantically improbable (Obler et al. 1991). It has
use inluence the strength of phonological connections; that also been found to be relatively more diicult for this population
is, the lower the word frequency and the less recently used the than for younger adults to take advantage of constraining context
word, the weaker the connections, leading to greater retrieval in a sentence. his leads to diiculties in situations where older
diiculties. adults need to rely on context but are not able to do so skillfully.
Both younger and older individuals beneit from phonologi- However, in other aspects of sentence comprehension such as
cal primes as opposed to unrelated primes during moments of a disambiguation, no age-related diferences has been reported.
TOT state, supporting the TDH model. he inding that priming It is apparent that sentence comprehension is largely medi-
leads to better retrieval is consistent with the claim that priming ated by the ability to hold these sentences in working memory
strengthens the inherently weak phonological connections and while they are being processed. Working-memory decline has
thus facilitates resolution of the TOT state. Furthermore, stud- been reported in older adults (Grossman et al. 2002). Moreover,
ies using semantic versus phonological cues have demonstrated syntactic-processing diiculties have also been attributed to
that in both younger and older people, provision of phonological reduction in working-memory capacity (Kemptes and Kemper
information was more efective, as a retrieval aid, than semantic 1997). Executive functions such as inhibition and task switching
cues (Meyer and Bock 1992). his illustrates that in older indi- have been reported as negatively afected in the elderly.
viduals, semantic information is intact, although there is some Comprehension problems eventually afect the older adults
degradation in the use of phonological information. discourse abilities. Often, the elderly ind that it gets harder to
In summary, much evidence supports the TDH and the claim follow discourse with advancing age. Studies have shown that
that the locus of the breakdown in TOT states is at the phono- older adults are signiicantly poorer than younger adults in fully
logical stage. he exact phonological substage responsible for understanding and inferring complex discourse, and this dii-
this problem still remains unclear; however, there are indica- culty is enhanced further with increased perceptual or cognitive
tions from phonological cuing studies, self-reports of individuals load (Schneider, Daneman, and Pichora-Fuller 2002), such as
experiencing TOT states (Brown 1991), and an electrophysiolog- noise and length or complexity of the material. A combination of
ical study of lexical retrieval in healthy younger and older adults general cognitive decline and a deterioration of speciic linguis-
(Neumann 2007) that the irst two substages (segmental and syl- tic and sensory-perceptual processes contribute to the general
labic retrieval) are particular points at which breakdowns occur. slowing observed in the elderly while they engage in conversa-
tion and discourse.
Language Comprehension
It is relatively well known that comprehension in older adults Spoken Discourse
is compromised in comparison to younger adults. hese prob- Results from many types of tasks requiring sentence produc-
lems in comprehension may arise from individual or combined tion, such as cartoon description tasks (Marini et al. 2005), have
efects of decline in their sensory/perceptual, linguistic, or cog- indicated that older adults tend to produce grammatically sim-
nitive domains. Current research is focused on disambiguating pler, less informative and more fragmented sentences than do
the efects of these processes on phonological, lexical-semantic, younger adults. Middle-aged and young elderly adults tend to
and syntactic aspects of language decline in the elderly. be better and more eicient story constructors than younger
Research shows that speech perception in older adults is espe- or older adults. However, older adults usually produce a larger
cially afected by noise. his means that even with normal hear- number of words in their narrative speech, but they can show
ing, older adults experience diiculties understanding speech diiculties in morpho-syntactic and lexico-semantic processing
(e.g., sentences) under noisy conditions such as a cocktail party by making more paragrammatic errors and semantic parapha-
or cafeteria. Experiments focusing on sentence perception show sias than younger adults.
91
Aging and Language
Older adults also show a decreased ability to coherently link Garrard, Peter, Matthew A. Lambon Ralph, John R. Hodges, and Karalyn
adjacent utterances in a story. Adults older than 75 use a larger Patterson. 2001. Prototypicality, distinctiveness, and intercorrela-
number of missing or ambiguous referents and more units of tions: Analysis of the semantic attributes of living and nonliving con-
irrelevant content that afect the coherence of the narratives cepts. Cognitive Neuropsychology 18.2: 12574.
Grossman, Murray, Ayanna Cooke, Christian DeVita, David Alsop, John
(Marini et al. 2005). Older people show a huge variation in story-
Detre, Willis Chen, and James Gee. 2002. Age-related changes in
telling abilities, and they can also compensate for their storytell-
working memory during sentence comprehension: An fMRI study.
ing diiculties due to their greater accumulated life experience, NeuroImage 15: 30217.
which they can use to combine diferent themes and to empha- James, Lori E., and Deborah M. Burke. 2000. Phonological priming
size relevant details. efects on word retrieval and tip-of-the-tongue experiences in young
In spoken discourse, some changes in conversational-in- and older adults. Journal of Experimental Psychology: Learning,
teraction style can occur in young elderly people (6074 years), Memory, and Cognition 26.6: 137891.
but the most noticeable changes are likely to take place in older James, Lori E., Deborah M. Burke, Ayda Austin, and Erika Hulme. 1998.
elderly people (7788 years), who show excessive verbosity, fail- Production and perception of verbosity in younger and older adults.
ure to maintain topic, poor turn-taking, and unclear referencing Psychology and Aging 13: 35567.
(James et al. 1998). Kemper, Susan, Ruth E. Herman, and Chiung-Ju Liu. 2004. Sentence
production by young and older adults in controlled contexts. Journal
Diiculties in grammatical processing in the aging can be
of Gerontology Series B: Psychological Sciences and Social Sciences
attributed to cognitive deterioration involving reduced working
59: 2204.
memory (Kemper, Herman, and Liu 2004) and inhibitory deicits Kemptes, Karen A., and Susan Kemper 1997. Younger and older adults
(James et al. 1998), but they can also be a sign of impaired access on-line processing of syntactically ambiguous sentences. Psychology
to the lemma level during lexical retrieval (Marini et al. 2005). and Aging 12: 36271.
In summary, there are clear diferences between younger and Levelt, Willem J. M., Ardi Roelofs, and Antje S. Meyer 1999. A theory
older adults in their sentence production, storytelling, and con- of lexical access in speech production. Behavioral Brain Sciences
versational abilities. hese diferences are manifest as changes 22: 175.
in morpho-syntactic and lexico-semantic processing, excessive Marini, Andrea, Anke Boewe, Carlo Caltagirone, and Sergio Carlomagno.
verbosity, and reduced informativeness and coherence. 2005. Age-related diferences in the production of textual descrip-
tions. Journal of Psycholinguistic Research 34: 43963.
Meyer, Anje S., and Kathryn Bock. 1992. he tip-of-the-tongue phe-
Dementia
nomenon: Blocking or partial activation? Memory and Cognition
Diiculties in language production, as well as comprehension,
20: 71526.
become obvious in dementia. Word-inding diiculties are char- Morris, John, Martha Storandt, J. Phillip Miller, Daniel W. McKeel, Joseph
acteristic of mild cognitive impairment (MCI), Alzheimers dis- L. Price, Eugene H. Rubin, and Leonard Berg. 2001. Mild cognitive
ease (AD), and vascular dementia (VaD). impairment represents early-stage Alzheimers disease. Archives of
Arto Nordlund and colleagues (2004) indicated that 57.1 per- Neurology 58.3: 397405.
cent of the individuals with MCI had signiicantly lower scores Neumann, Yael. 2007. An Electrophysiological Investigation of the Efects
in diferent language tasks than did typical aging adults. In AD, of Age on the Time Course of Segmental and Syllabic Encoding dur-
language-processing diiculties are early signs of the disease. ing Implicit Picture Naming in Healthy Younger and Older Adults.
In particular, AD appears to cause a breakdown of the seman- Publications of the Department of Speech, Language, and Hearing
Sciences. New York: City University of New York.
tic domain of language, which can be relected in the impaired
Nordlund, Arto, S. Rolstad, P. Hellstrm, M. Sjgren, S. Hansen, and A.
comprehension and use of semantic relations between words
Wallin. 2004. he Goteborg MCI study: Mild cognitive impairment is
(Garrard et al. 2001) and in the reduced semantic noun and verb
a heterogeneous condition. Journal of Neurology, Neurosurgery, and
luency performance (Pekkala 2004) in this population. Psychiatry 76: 148590.
Comparative studies between AD and VaD have indicated that Obler, Loraine K., Deborah Fein, Marjorie Nicholas, and Martin L. Albert
cognitive and linguistic performance cannot clearly diferentiate 1991. Auditory comprehension and aging: Decline in syntactic pro-
the two types of dementia from each other. Elina Vuorinen, Matti cessing. Applied Psycholinguistics 12: 43352.
Laine, and Juha Rinne (2000) indicated that both AD and VaD Pekkala, Seija. 2004. Semantic Fluency in Mild and Moderate Alzheimers
involved similar types of semantic deicits early in the disease, Disease. Publications of the Department of Phonetics 47, University of
including diiculties in comprehension, naming, and produc- Helsinki. Available online at: http://ethesis.helsinki.i/.
tion of semantic topics in narrative speech, while word repeti- Reuter-Lorenz, Patricia A., John Jonides, Edward E. Smith, Alan Hartlye,
Andrea Miller, Christina Marshuetz, and Robert A. Koeppe. 2000. Age
tion, oral reading, and luency of speech output were preserved
diferences in the frontal lateralization of verbal and spatial working mem-
in both types of dementia.
ory revealed by PET. Journal of Cognitive Neuroscience 12: 17487.
Yael Neumann, Seija Pekkala, and Hia Datta Schneider, Bruce A., Meredith Daneman, and M. Kathleen Pichora-
Fuller. 2002. Listening in aging adults: From discourse comprehen-
sion to psychoacoustics. Canadian Journal of Experimental Psychology
56: 13952.
Brown, Alan S. 1991. A review of the tip-of-the-tongue experience. Vuorinen, Elina, Matti Laine, and Juha Rinne. 2000. Common pattern
Psychological Bulletin 109.2: 20423. of language impairment in vascular dementia and in Alzheimers dis-
Burke, Deborah M., Donald G. MacKay, Joanna S. Worthley, and ease. Alzheimer Disease and Associated Disorders 14: 816.
Elizabeth Wade. 1991. On the tip of the tongue: What causes word Wingield, Arthur, and Elizabeth A. L. Stine-Morrow. 2000. Language
inding failures in young and older adults? Journal of Memory and and aging. In he Handbook of Cognition and Aging, ed. Craig and
Language 30: 54279. Salthouse, 359416. Mahwah, NJ: Psychology Press.
92
Agreement Agreement Maximization
AGREEMENT WORKS CITED AND SUGGESTIONS FOR FURTHER READING
Agreement is a form of featural dependency between diferent Aoun, J. and D. Sportiche. 1983. On the formal theory of government.
Linguistic Review 2: 21136.
parts of a sentence: he morphological shape of a word is a func-
Barlow, M., and C. Barlow, eds. 1988. Agreement in Natural Language.
tion of particular morphological features of a diferent, often dis-
Stanford, CA: CSLI Publications.
tant, word. Since the Middle Ages, agreement was taken to be in
Boeckx, C., ed. 2006. Agreement Systems, Amsterdam: Benjamins.
complementary distribution with government, and, hence, it Chomsky, N. 2000. Minimalist inquiries: the framework. In Step by
became important to determine both the context of each type of Step, ed. R. Martin, D. Michaels, and J. Uriagereka, 89155. Cambridge,
relation and the reasons why this diference exists (Covington MA: MIT Press
1984). his rich tradition has survived in generative grammar, all Covington, M. 1984 Syntactic heory in the High Middle Ages,
the way up to the minimalist program (see mimimalism), where Cambridge: Cambridge University Press.
it is embodied under the notion agree.
Depending on the features and occurrences in an expression,
agreement can be characterized as external or internal (Barlow AGREEMENT MAXIMIZATION
and Fergusson 1988). External agreement typically involves per- Maximizing, or optimizing, agreement between speaker and
son and number features, taking place between verbs and cor- interpreter is part of the principle of charity (see charity, prin-
responding dependents. We can witness it in you are friendly ciple of) that, according to philosophers of language in the
versus he is friendly. Internal agreement (concord) normally tradition of W. V. O. Quine and Donald Davidson, governs the
involves gender and number features, and typically takes place interpretation of the speech and thought of others and guides
internal to nominal expressions, between adjectives or relative the radical interpretation of a radically foreign language.
clauses and the head noun, freely iterating. Concord is easily According to Davidson, correct interpretation maximizes truth
observed in modiied nominal expressions in Spanish: atractivas and coherence across the whole of the beliefs of a speaker. For an
damas attractive ladies versus atractivos caballeros attractive interpreter, maximizing truth across the beliefs he ascribes to a
gentlemen (agreeing elements are boldfaced). Genitive agree- speaker necessarily amounts to maximizing agreement between
ment internal to nominal expressions normally falls within the himself and the speaker: He can only go by his own view of what
external (not the internal) rubric. is true.
he principles and parameters system concentrated on Take one of Davidsons own examples: Someone says here
external agreement, through the relation (head, speciier) (Aoun is a hippopotamus in the refrigerator, and he continues: Its
and Sportiche 1983). However, since agreement is possible also roundish, has a wrinkled skin, does not mind being touched.
in situations where no such relation seems relevant, the mini- It has a pleasant taste, at least the juice, and it costs a dime. I
malist program (Chomsky 2000) proposes a relation between squeeze two or three for breakfast (1968, 100). he simplest way
a probe and a goal. he probe contains a value-less attribute in of maximizing agreement between this speaker and us will prob-
need of valuation from a distant feature of the same type, which ably be interpreting his expression hippopotamus as meaning
the probing mechanism achieves. he goal cannot be contained the same as our expression orange. Davidson himself, however,
within a derivational cycle (a phase) that is too distant from the soon came to consider maximizing agreement as a confused
probe. When the probe inds an identical category within its com- ideal (1984, xvii) to be substituted with optimizing agreement
plement domain, it attempts to get its valuation from it, thereby (1975, 169). he idea here is that some disagreements are more
sanctioning the relevant agreement. To illustrate, observe the destructive of understanding than others (1975, 169). Very
Spanish example in (1); note also the internal agreement mani- generally, this is a matter of epistemic weight; the more basic a
fested within the noun phrase: belief is, and the better reasons we have for holding it, the more
(1) Parecen [haber quedado [los locos soldados] en la guarnicin] destructive disagreement on it would be: he methodology of
seem-3rd/pl. have remained the-m./pl. crazy-m./pl. interpretation is, in this respect, nothing but epistemology seen
soldiers-m./pl. in the garrison in the mirror of meaning (1975, 169). According to Davidson, it
is impossible to codify our epistemology in simple and precise
(2) Probe1 [ [Goal]]] (plus iteration of -os within the nominal) form, but general principles can be given: [A]greement on laws
and regularities usually matters more than agreement on cases;
( ) person pl. number
agreement on what is openly and publicly observable is more
( ) number m. gender to be favored than agreement on what is hidden, inferred, or ill
observed; evidential relations should be preserved the more they
Agreement adds a strange redundancy to the language faculty.
verge on being constitutive of meaning ([1980] 2004, 157).
In languages where the phenomenon is overt, the extra layer of
Agreement optimization does not exclude the possibility of
manifest dependency correlates with difering surface orders. But
error; speakers are to be interpreted as right only when plau-
it is unclear whether that justiies the linguistic emergence of the
sibly possible (Davidson 1973, 137). In certain situations this
agreement phenomenon, particularly since the probe/goal mech-
prevents the interpreter from ascribing beliefs of his own to
anism can be present without overt manifestations. his results in
the speaker, for instance, perceptual beliefs about objects the
much observed variation, from the almost total lack of overt agree-
speaker is in no position to perceive. Moreover, if the speaker
ment of Chinese to the poly-personal manifestation of Basque.
has other beliefs that provide him or her with very good reasons
Juan Uriagereka for believing something false, optimizing agreement across all
93
Agreement Maximization Alliteration
of his or her beliefs might well require ascription of outright . 1975. hought and talk. In Davidson 1984, 15570.
mistakes. . [1980] 2004. A uniied theory of thought, meaning, and action.
Optimizing agreement provides an interpreter with a method Problems of Rationality, 15166. Oxford: Clarendon Press.
for arriving at correct interpretations because of the way belief . [1983] 2001. A coherence theory of truth and knowledge.
Subjective, Intersubjective, Objective, 13753. Oxford: Clarendon Press.
content is determined, Davidson holds. he arguments for this
. 1984. Inquiries into Truth and Interpretation. Oxford: Clarendon
claim have changed over the years; initially, the idea was that a
Press.
belief is identiied by its location in a pattern of beliefs; it is this . [1991] 2001. hree varieties of knowledge. Subjective,
pattern that determines the subject matter of the belief, what the Intersubjective, Objective, 20520. Oxford: Clarendon Press.
belief is about (1975, 168). Later, however, the role played by . 2001. Externalisms. Interpreting Davidson, ed. P. Kotatko, P.
causal connections between objects and events in the world and Pagin, and G. Segal, 116. Stanford, CA: CSLI.
the beliefs of speaker and interpreter becomes more and more Gler, Kathrin. 2006. Triangulation. he Oxford Handbook of Philosophy
prominent: In the most basic, perceptual cases, the interpreter of Language, ed. E. Lepore and B. Smith, 100619. Oxford: Oxford
interprets sentences held true (which is not to be distinguished University Press.
from attributing beliefs) according to the events and objects in Grandy, Richard. 1973. Reference, meaning, and belief. Journal of
the outside world that cause the sentence to be held true ([1983] Philosophy 70: 43952.
Kripke, Saul. 1972. Naming and Necessity. Cambridge: Harvard University
2001,150). In the later Davidson, the account of content deter-
Press.
mination underlying the method of charitable interpretation
takes the form of a distinctive, social and perceptual meaning
externalism: In the most basic cases, the objects of thought ALLITERATION
are determined in a sort of triangulation as the shared causes
of the thoughts of two interacting persons, for instance, a child Linguistically, alliteration, also known as initial or head rhyme,
and its teacher (cf. Davidson [1991] 2001, 2001). According to is deined as the selection of identical syllable onsets within a
Davidson, such triangulation is a necessary condition for thought speciic phonological, morphosyntactic, or metrical domain.
with empirical content; moreover, he derives a quite far-reaching It is usually coupled with stress, as in the three Rs in educa-
epistemic antiskepticism from it and claims that belief is in its tion: reading, writing, and arithmetic. Etymologically, the term
nature veridical ([1983] 2001, 146; cf. also [1991] 2001, 211f). alliteration (from L. ad- to + littera letter) includes the repe-
Probably the most inluential argument against the idea that tition of the same letters at word beginnings; its dual association
any kind of maximizing agreement results in correct interpreta- with sounds and letters relects a common cognitive crisscross-
tion derives from Saul Kripkes attack on description theories ing between spoken and written language in highly literate
of proper names. According to such theories, the referent of a (see literacy) societies well illustrated by the famous phrase
proper name, for instance, Gdel is determined by a descrip- apt alliterations artful aid, where the alliteration is primarily
tion, or cluster of descriptions, held true by the speaker(s), for orthographic.
instance, the description the discoverer of the incompleteness Alliteration based on the sameness of letters is found in visual
of arithmetic. Kripke argued, among other things, that such poetry, advertising, and any form of playful written language.
theories fail because all of the relevant descriptions, all of the rel- Phonologically based alliteration is a frequent mnemonic and
evant beliefs that a speaker, or even a group of speakers, holds cohesive device in all forms of imaginative language: Examples
about Gdel could turn out to be false (cf. 1972, 83f). Kripke gave from English include idioms (beat about the bush), reduplicative
an analogous argument for natural-kind terms such as gold or word-formation (rifraf), binominals (slowly but surely), catch
tiger, and many philosophers today believe that these arguments phrases, refrains, political slogans, proverbs, and clichs. In
can be generalized even further. While it is quite clear, however, verse, alliteration serves both as ornamentation and as a struc-
that most of the descriptions a speaker associates, for instance, tural device highlighting the metrical organization into feet, cola,
with a name could turn out to be false when taken one by one, verses, and lines. Along with rhyme, alliteration is a common fea-
it is far less obvious that all (or most, or a weighted majority) of ture of folk and art verse in languages as diverse as Irish, Shona,
them could do so at the same time. According to Davidson, for Mongolian, Finnish, and Somali.
instance, optimizing agreement amounts to reference determi- he most frequent type of alliteration requires identity of the
nation by epistemically weighted beliefs. A signiicant number onsets of stressed syllables, which makes it a preferred poetic
of these are very elementary beliefs like the belief that Gdel device in languages with word-initial stress, such as the older
was a man, that he was human, that he worked on logic, that he Germanic languages. Within the Germanic tradition, metrically
lived on Earth, and so on. If a speaker did not believe any of these relevant alliteration occurs on the stressed syllables of the irst
things about Gdel, it has been argued with Davidson, it is far foot of each verse (or half-line), where it is obligatory. For Old
less clear that this speaker was in fact talking about Gdel: Too English, the language of the richest and most varied surviv-
much mistake simply blurs the focus (Davidson 1975, 168). ing alliterative poetry in Germanic, the second foot of the irst
half-line may also alliterate. Alliteration is disallowed on the last
Kathrin Gler stressed syllable in the line.
In Old English verse, alliteration appears with remark-
WORKS CITED AND SUGGESTIONS FOR FURTHER READING able regularity: Only 0.001% of the verses lack alliteration and
Davidson, Donald. 1968. On saying that. In Davidson 1984, 93108. less than 0.05% contain unmetrical alliteration (Hutcheson
. 1973. Radical interpretation. In Davidson 1984, 12539. 1995, 169). Alliteration is, therefore, a reliable criterion used by
94
Ambiguity
modern editors to determine the boundaries of verses and lines, Danny saw the man with the telescope, either Danny used the
though no such divisions exist in the manuscripts. he rein- telescope to help him see the man (3) or the man whom Danny
vented alliterative tradition of fourteenth-century England also saw had a telescope (4).
uses alliteration structurally, while its ornamental function is
3. Danny [VP saw [NP the man] [PP with the telescope]]
enhanced by excessive verse-internal and run-on alliteration.
As a cohesive device in verse, alliteration refers to the under- 4. Danny [VP saw [NP the man [PP with the telescope]]]
lying distinctions in the language and relies on identity of pho- Referential ambiguity occurs when it is not clear which entity
nological categories. he interpretation of onset identity for the in a context is being referred to by the given linguistic expression.
purpose of poetic alliteration varies from tradition to tradition Although deictics (see deixis), such as pronouns, are typical
and can include whole clusters, optionally realized segments, sources of referential ambiguity, full noun phrases and proper
and even the whole syllable up to the coda. In Germanic, all nouns can also give rise to it.
consonants alliterated only with one another, the clusters st-,
sp-, sk- could not be split, and all orthographic stressed vowels 5. (at a boys soccer game) He kicked him! Who kicked who?
alliterated freely among themselves, most likely because their 6. (at a boat race) hat boat seems to be pulling ahead. Which
identity was signaled by the presence of a prevocalic glottal stop one?
in stressed syllable onsets.
7. (in a university corridor) Im of to meet with Dr. Sullivan.
Donka Minkova Chemistry or math? (here are two Dr. Sullivans in difer-
ent departments.)
Scopal ambiguity occurs when a sentence contains more than
Fabb, Nigel. 1997. Linguistics and Literature: Language in the Verbal Arts one quantiied NP and the interpretation depends on the rela-
in the World. Oxford: Blackwell. tive scopes of the quantiiers. For example, Some children saw
Hutcheson, Bellenden Rand. 1995. Old English Poetic Metre. both plays can mean that a) there exist some children such that
Cambridge: D. S. Brewer.
each of them saw both plays or b) both plays were such that each,
Minkova, Donka. 2003. Alliteration and Sound Change in Early English.
individually, was seen by some children but not necessarily the
same children. Phonetic ambiguity arises when a given sound
pattern can convey diferent words, for example, two ~ too ~ to;
AMBIGUITY new deal ~ nude eel.
Although people typically do not notice the ambiguities
Ambiguity refers to the potential of a linguistic expression to have
that they efortlessly resolve through context, they are certainly
more than one meaning. Although many expressions (words,
aware of the potential for ambiguity in language. In fact, such
phrases, and even sentences) are ambiguous in isolation, few
awareness is a precondition for getting the joke in Abbott and
remain so when used in a particular context. In fact, people typi-
Costellos Whos on First? skit or in headlines like Iraqi Head
cally resolve all ambiguities without even detecting the potential
Seeks Arms.
for other interpretations. Ambiguity does not imply vagueness;
Whereas ambiguity does not frequently hinder efective
rather, ambiguity gives rise to competing interpretations, each of
communication among people, it is among the biggest hurdles
which can be perfectly concrete. Although ambiguity is pervasive
for the machine processing of language. his is not surprising if
and unavoidable in natural languages, artiicial languages devel-
one considers how much reasoning is required to resolve ambi-
oped for mathematics, logic, and computer programming strive
guity and how much knowledge of language, the context, and
to eliminate it from their expressions.
the world must underpin such reasoning. As an example of the
Ambiguity can be lexical, structural, referential, scopal,
large scale of the task, consider the short sentence he coach
or phonetic. he examples of these phenomena that follow
lost a set, which you probably interpreted to mean the person
include well-known classics in English.
who is the trainer of some athletic team experienced the loss of
Lexical ambiguity refers to the fact that some words, as writ-
a part of a match in an athletic competition (whether the coach
ten or spoken, can be used in diferent parts of speech (see
was playing or the team was playing is yet another ambiguity).
word classes) and/or with diferent meanings. For example,
Other interpretations are also valid, given speciic contexts. For
duck can be used as a noun or a verb and, as a noun, can refer to
example, the person who is the trainer of some team might have
a live animal or its meat. Structural ambiguity arises when dif-
lost a set of objects (keys, golf clubs) or a railroad car might have
ferent syntactic parses give rise to diferent interpretations. For
lost a set of objects (door handles, ball bearings). If this sentence
example, in addition to being lexically ambiguous, hey saw her
were used as input to an English-Russian machine translation
duck is also structurally ambiguous:
system that relied on a standard English-Russian dictionary,
1. hey saw [NP her duck] (the bird or its meat belongs to her) that system would have to select from among 15 senses of coach,
11 senses of lose, and 91 senses of set a grand total of 15,015
2. hey saw [NP her] [VP duck] (the ducking is an action she carries
combinations, if no further knowledge were brought to bear. Of
out)
course, all machine translation systems incorporate some heu-
A common source of structural ambiguity involves the attach- ristic knowledge, and lexicons developed for natural language
ment site for prepositional phrases, which can be at the level processing typically do not permit the amount of sense splitting
of the nearest noun phrase (NP) or the clause. In the sentence found in dictionaries for people. On the other hand, it is common
95
Amygdala
for sentences to contain upward of 20 words, in which case there cortical visual processing stream. Hence, through its extensive
is still the threat of combinatorial explosion. connectivity with sensory processing regions, the amygdala is
ideally located to inluence perception based on emotion.
Marjorie J. McShane
Language
In order to survive in a changing environment, it is especially
Cruse, D. A. 1986. Lexical Semantics. Cambridge: Cambridge University important for the organism to remember events and stimuli
Press. Includes features of and tests for ambiguity. that are linked with emotional consequences. Furthermore, it is
Small, Steven, Garrison Cottrell, and Michael Tanenhaus, eds. 1988. important to be vigilant of emotional stimuli in the environment
Lexical Ambiguity Resolution: Perspective from Psycholinguistics,
in order to allow for rapid evaluation of and response to these
Neuropsychology and Artiicial Intelligence. San Mateo, CA: Morgan
emotional stimuli. In humans, emotional cues are transmitted
Kaufmann.
Zwicky, Arnold M., and Jerrold M. Sadock. 1975. Ambiguity Tests and
linguistically, as well as through body posture, voice, and facial
How to Fail hem. In Syntax and Semantics, ed. J. Kimball, IV: 136. expression (see gesture).
New York: Academic Press. Discusses tests to distinguish ambiguity In the irst imaging study to examine language and the
from lack of speciication. amygdala, a modiied Stroop task was utilized, along with a
high-sensitivity neuroimaging technique, to target the neu-
ral substrate engaged speciically when processing linguistic
AMYGDALA
threat (Isenberg et al. 1999). Healthy volunteer subjects were
Studies in animals have established a clear role for the amygdala instructed to name the color of words of either threat or neutral
in social and emotional behavior, especially as related to fear valence, presented in diferent color fonts, while neural activity
and aggression (Le Doux 1996). Human studies, including lesion was measured by using positron emission tomography. Bilateral
studies, electrophysiology, and functional neuroimaging, amygdalar activation was signiicantly greater during color nam-
have further elucidated the role of the amygdala in the processing of threat words than during color naming of neutral words
ing of a variety of emotional sensory stimuli, as well as its rela- (see Color Plate 1). Associated activations were also noted in
tionship to behavioral and cognitive responses (Adolphs 2001). sensory-evaluative and motor-planning areas of the brain. hus,
hese responses not only guide social behavior but also aid in the our results demonstrate the amygdalas role in the processing
acquisition of social knowledge. he focus of this entry is on the of danger elicited by language. In addition, the results reinforce
amygdala and its role in the processing of language, in particu- the amygdalas role in the modulation of the perception of, and
lar language relevant to social and emotional behavior (see also response to, emotionally salient stimuli. his initial study further
emotion and language and emotion words). suggests conservation of phylogenetically older mechanisms of
he amygdala is an almond-shaped group of neurons located emotional evaluation in the context of more recently evolved lin-
in the rostral medial temporal region on both left and right sides guistic function. In a more recent study that examines the neural
of the brain (see left hemisphere language process- substrates involved when subjects are exposed to an event that is
ing and right hemisphere language processing). It verbally linked to an aversive outcome, activation is observed in
has reciprocal connections to regions, such as the hypothala- the left amygdala (Phelps et al. 2001, 43741).
mus, that are important for coordinating autonomic responses his activation correlated with the expression of the fear
to complex environmental cues for survival, as well as premo- response as measured by skin conductance response, a periph-
tor and prefrontal areas that are necessary for rapid motor and eral measure of arousal. he laterality of response may relate to
behavioral responses to perceived threat. Visual, somatosen- the explicit nature of the fear, as well as to the fact that the stim-
sory, and auditory information is transmitted to the amygdala ulus is learned through verbal communication. Fears that are
by a series of indirect, modality-speciic thalamocorticoamygda- simply imagined and anticipated nonetheless have a profound
lar pathways, as well as by direct thalamoamygdalar pathways. impact on everyday behavior. he previous study suggests that
Within the amygdaloid complex, information processing takes the left amygdala is involved in the expression of fear when antic-
place along numerous highly organized parallel pathways with ipated and conveyed in language. Another study that sought to
extensive intraamygdaloid connections. he convergence of examine the role of the amygdala in the processing of positive as
inputs in the lateral nucleus enables stimulus representations well as negative valence verbal stimuli also demonstrated activ-
to be summated. Speciic output pathways from the central ity in the left amygdala (Adolphs, Baron-Cohen, and Tranel 2002,
nucleus and amygdalohippocampal area mediate complemen- 126474). During magnetic resonance (MR) scanning, subjects
tary aspects of learning and behavioral expressions connected viewed high-arousal positive and negative words and neutral
with various emotional states. he amydgala is thus well posi- words. In this study, activity was found in the left amygdala while
tioned to play a role in rapid cross-modal emotional recognition. subjects viewed both negative and positive words in comparison
It is important for the processing of emotional memory and for to neutral words. Taken together, these studies suggest that the
fear conditioning. amygdala plays a role in both positive and negative emotional
In addition, anatomical studies of the primate amygdala dem- responses. Furthermore, they suggest that the left amygdala may
onstrate connections to virtually all levels of visual processing in be preferentially involved in the processing of emotion as con-
the occipital and temporal cortex (Amaral 2003). herefore, veyed through language.
the amygdala is also critically placed to modulate visual input, Lesion studies have generally suggested that the amygdala
based on afective signiicance, at a variety of levels along the is not essential for recognizing or judging emotional and social
96
Analogy
information from explicit, lexical stimuli, such as stories (see In the aftermath of analogical reasoning about a pair of cases,
narrative, neurobiology of) (Amaral 2003, 33747). some form of relational generalization may take place, yielding a
However, while recognition of emotional and social information schema for a class of situations (Gick and Holyoak 1983).
may be relatively preserved in amygdalar damage, the aware-
ness that unpleasant emotions are arousing appears to be lost. In Psychological Research
a lesion study, normal subjects judge emotions such as fear and Within psychology, work in the intelligence tradition focused on
anger to be both unpleasant and highly arousing; however, patient four-term or proportional analogies, such as ARM: HAND :: LEG: ?
S. M. 046, who sustained early amygdalar damage, judged these Charles Spearman (1946) reviewed studies that found high cor-
same stimuli to be unpleasant but of low arousal. For example, relations between performance in solving analogy problems and
when told a story about someone driving down a steep mountain the g factor (general intelligence). he ability to solve analogy-
who had lost the car brakes, she identiied the situation as unpleas- like problems depends on a neural substrate that includes sub-
ant but also gave a highly abnormal judgment that it would make areas of the prefrontal cortex (Bunge, Wendelken, and Wagner
one feel sleepy and relaxed. It is interesting to note that S. M. 046 2005; see frontal lobe). Although there have been reports of
was able to judge arousal normally from positive emotions. great apes being successfully trained to solve analogy problems,
he human amygdala is important both for the acquisition these results are controversial (Oden, hompson, and Premack
and for the online processing of emotional stimuli. Its role is dis- 2001). Complex relational thinking appears to be a capacity that
proportionate for a particular category of emotional information, emerged in homo sapiens along with the evolutionary increase
that pertaining to the evaluation of potential threat in the envi- in size of the frontal cortex. he ability to think relationally
ronment. he amygdalas role in enhanced, modality-speciic increases with age (Gentner and Rattermann 1991). Greater
processing required for the rapid evaluation and response to sensitivity to relations appears to arise with age due to a com-
threat is highlighted. Furthermore, this review suggests conser- bination of incremental accretion of knowledge about relational
vation of phylogenetically older limbic mechanisms of emotional concepts (Goswami 1992), increases in working memory
evaluation in the context of more recently evolved language. capacity (Halford 1993), and increased ability to inhibit mis-
leading featural information (Richland, Morrison, and Holyoak
Nancy B. Isenberg 2006). Analogy plays a prominent role in teaching mathematics
(Richland, Zur, and Holyoak 2007).
WORKS CITED AND SUGGESTIONS FOR FURTHER READING Dedre Gentner (1983) developed the structure-mapping the-
Adolphs, R. 2001. he neurobiology of social cognition. Current ory of analogy, emphasizing that analogical mapping is guided
Opinions in Neurobiology 11.2: 2319. by higher-order relations relations between relations. Keith
Adolphs, R., S. Baron-Cohen, and D. Tranel. 2002. Impaired recognition Holyoak and P. hagard (1989) proposed a multiconstraint the-
of social emotions following amygdala damage. Journal of Cognitive ory, hypothesizing that people ind mappings that maximize
Neuroscience 14.8: 126474. similarity of corresponding elements and relations, structural
Amaral, D. G. 2003. he amygdala, social behavior, and danger detection.
parallelism, and pragmatic importance for goal achievement.
Annals of the New York Academy of Sciences 1000 (Dec.): 33747.
Several computational models of human analogical thinking
Freese, J. L., and D. G. Amaral. 2006. Synaptic organization of projections
from the amygdala to visual cortical areas TE and V1 in the macaque
have been developed. Two inluential models are SME (Structure
monkey. Journal of Comparative Neurology 496.5: 65567. Mapping Engine; Falkenhainer, Forbus, and Gentner 1989),
Isenberg, N., D. Silbersweig, A. Engelien, S. Emmerich, K. Malavade, B. based on a classical symbolic architecture, and LISA (Learning
Beattie, A. C. Leon, and E. Stern. 1999. Linguistic threat activates the and Inference with Schemas and Analogies; Hummel and
human amygdala. Proceedings of the National Academy of Sciences Holyoak 2005), based on a neural-network architecture. LISA has
USA 96.18: 104569. been used to simulate some efects of damage to the frontal and
Le Doux, J. 1996. he Emotional Brain: he Mysterious Underpinnings of temporal cortex on analogical reasoning (Morrison et al. 2004).
Emotional Life. New York: Touchstone.
Phelps, E. A. 2006. Emotion and cognition: Insights from studies of the Analogy and Language
human amygdala. Annual Review of Psychology 57: 2753. Analogy is related to metaphor and similar forms of symbolic
Phelps, E. A., K. J. OConnor, J. C. Gatenby, J. C. Gore, C. Grillon, and M.
expression in literature and everyday language. In metaphors,
Davis. 2001. Activation of the left amygdala to a cognitive representa-
the source and target domains are always semantically distant
tion of fear. Nature Neuroscience 4.4: 43741.
(Gentner, Falkenhainer, and Skorstad 1988). Rather than simply
comparing the source and target, the target is identiied with the
ANALOGY source (Holyoak 1982), either directly (e.g., Juliet is the sun)
or by applying a predicate drawn from the source domain to the
Two situations are analogous if they share a common pattern of
target (e.g., he romance blossomed). As a domain-general
relationships among their constituent elements, even though
learning mechanism linked to human evolution, analogy ofers
the elements are dissimilar. Often one analog, the source, is more
an alternative to strongly nativist views of language acquisition
familiar or better understood than the second analog, the target
(Vallauri 2004; see innateness and innatism). Gentner and
(see source and target). Typically, a target situation serves
L. L. Namy (2004) review evidence that analogical comparison
as a retrieval cue for a potentially useful source analog. A map-
plays important roles in speech segmentation, word learning,
ping, or set of systematic correspondences aligning elements
and possibly acquisition of grammar.
of the source and target, is then established. On the basis of the
mapping, it is possible to derive new inferences about the target. Keith Holyoak
97
Analogy: Synchronic and Diachronic
WORKS CITED AND SUGGESTIONS FOR FURTHER READING b2, b3 (etc.). here is some relation R between a1, a2, and a3,
Bunge, Silvia, C. Wendelken, and A. D. Wagner. 2005. Analogical rea-
expressed as R(a1,a2,a3), just as there is another such relation
soning and prefrontal cortex: Evidence for separable retrieval and inte- S between b1, b2, and b3, expressed as S(b1,b2,b3). For A and
gration mechanisms. Cerebral Cortex 15: 23949. B to be analogous, it is required that R and S be exempliica-
Falkenhainer, Brian, K. D. Forbus, and D. Gentner. 1989. he Structure- tions of the same abstract structure X, as evidenced by a map-
mapping engine: Algorithm and examples. Artiicial Intelligence ping between a1/a2/a3 and b1/b2/b3. his is what is meant by
41: 163. saying that analogy (e.g., between A and B) is a structural sim-
Gentner, Dedre. 1983. Structure-mapping: A theoretical framework for ilarity, or a similarity between relations (e.g., R and S). It is not
analogy. Cognitive Science 7: 15570. a material similarity, or a similarity between things (e.g., a1
Gentner, Dedre, B. Falkenhainer, and J. Skorstad. 1988. Viewing meta- and b1). More and more abstract analogies are constituted by
phor as analogy. In Analogical Reasoning: Perspectives of Artiicial
similarities between similarities between relations between
Intelligence, Cognitive Science, and Philosophy, ed. D. Helman, 1717.
things.
Dordrecht, the Netherlands: Kluwer.
Gentner, Dedre, K. J. Holyoak, and B. N. Kokinov , eds. 2001. he Analogical
In its purely synchronic use (see synchrony and dia-
Mind: Perspectives from Cognitive Science. Cambridge, MA: MIT Press. chrony), analogy is understood to be the centripetal force that
his book contains survey articles on topics in analogy. holds the units of a structure together. To simplify an example
Gentner, Dedre, and L.L. Namy. 2004. he role of comparison in chil- given by N. S. Trubetzkoy ([1939] 1958, 6066), in a structure
drens early word learning. In Weaving a Lexicon, ed. D. Hall and S. containing just /p/, /b/, /t/, and /d/, the phoneme /p/ acquires
Waxman, 53368. Cambridge, MA: MIT Press. the distinctive features voiceless and bilabial by being contrasted
Gentner, Dedre, and M. Rattermann. 1991. Language and the career of with, respectively, (voiced) /b/ and (dental) /t/. he relation
similarity. In Perspectives on hought and Language: Interrelations between the pairs /p/ & /b/ and /t/ & /d/ is the same, and so is
in Development, ed. S. Gelman and J. Byrnes, 22577.
the relation between /p/ & /t/ and /b/ & /d/, which means by
deinition that there is in both cases an analogy between the
Gick, Mary, and K. J. Holyoak. 1983. Schema induction and analogical
two pairs. his type of analogy-based analysis applies to any well-
transfer. Cognitive Psychology 15: 138.
Goswami, Usha. 1992. Analogical Reasoning in Children. Hillsdale, articulated structure, linguistic or nonlinguistic: A unit is what
NJ: Erlbaum. the other units are not (as /p/ is neither /b/ nor /t/ nor, of course,
Halford, Graeme. 1993. Childrens Understanding: he Development of /d/); and this otherness is based on corresponding oppositions
Mental Models. Hillsdale, NJ: Erlbaum. (like voiceless vs. voiced and bilabial vs. dental).
Holyoak, Keith. 1982. An analogical framework for literary interpreta- Synchronic analogy may be characterized as analogy-as-
tion. Poetics 11: 10526. structure. Its counterpart is analogy-as-process,that is, discovery,
Holyoak, Keith, and P. hagard. 1989. Analogical mapping by constraint manipulation, or invention of structural similarity. Traditionally,
satisfaction. Cognitive Science 13: 295355. language acquisition was thought to be based on analogy: From
. 1995. Mental Leaps: Analogy in Creative hought. Cambridge,
innumerable sentences heard and understood [the child] will
MA: MIT Press. his book provides a broad introduction to the nature
abstract some notion of their structure which is deinite enough
and uses of analogy.
to guide him in framing sentences of his own (Jespersen [1924]
Hummel, John, and K. J. Holyoak. 2005. Relational reasoning in a neu-
rally-plausible cognitive architecture: An overview of the LISA Project. 1965, 19). After a period of neglect, this traditional view has
Current Directions in Cognitive Science 14: 1537. again become fashionable in some quarters (Pinker 1994, 417;
Morrison, Robert, D. C. Krawczyk, K. J. Holyoak, J. E. Hummel, T. W. Tomasello 2003, 1639).
Chow, B. L. Miller, and B. J. Knowlton. 2004. A neurocomputational Only if analogy-as-process leaves a permanent trace that devi-
model of analogical reasoning and its breakdown in frontotemporal ates from the current norm is there reason to speak of language
lobar degeneration. Journal of Cognitive Neuroscience 16: 26071. change, the province of diachronic linguistics. Traditionally, the
Oden, David, R. K. R. hompson, and D. Premack. 2001. Can an ape rea- term analogical change was restricted to morphology, or to cases
son analogically? Comprehension and production of analogical prob- where irregularities brought about by sound change are elimi-
lems by Sarah, a chimpanzee (Pan Troglodytes). In Gentner, Holyoak,
nated so as to achieve, or to approach, the goal of one meaning
and Kokinov 2001, 47197.
one form. However, this same goal is involved in such large-scale
Richland, Lindsey, R. G. Morrison, and K. J. Holyoak. 2006. Childrens
changes as have generally been ascribed to a need for harmony
development of analogical reasoning: Insights from scene analogy
problems. Journal of Experimental Child Psychology 94: 24971. or symmetry. In syntactic change, analogy consists in extending a
Richland, Lindsey, O. Zur, and K. J. Holyoak. 2007. Cognitive supports reanalyzed structure to new contexts (Anttila 1989, 1024).
for analogy in the mathematics classroom. Science 316: 11289.
Esa Itkonen
Spearman, Charles. 1946. heory of a general factor. British Journal of
Psychology 36: 11731. WORKS CITED AND SUGGESTIONS FOR FURTHER READING
Vallauri, Edoardo. 2004. he relation between mind and language: he
innateness hypothesis and the poverty of the stimulus. Linguistic Anttila, Raimo. 1989. Historical and Comparative Linguistics.
Review 21: 34587. Amsterdam: Benjamins.
Jespersen, Otto. [1924] 1965. Philosophy of Grammar. London: Allen and
Unwin.
ANALOGY: SYNCHRONIC AND DIACHRONIC Pinker, Stephen. 1994. he Language Instinct. New York: Morrow.
Tomasello, Michael. 2003. Constructing a Language. A Usage-Based
Analogy involves two (or more) systems, A and B, which are con- heory of Language-Acquisition. Cambridge: Harvard University
stituted by their respective parts, that is, a1, a2, a3 (etc.) and b1, Press.
98
Analyticity
Trubetzkoy, N. S. [1939] 1958. Grundzge der Phonologie. Gttingen: expressing it. hus, our current linguistic conventions do not
Vandenhoeck and Ruprecht. make it true. hird, our irst example certainly does say some-
thing about the world. Indeed, it says something about every
ANALYTICITY object in the universe. If it is an uncle, then either it is married
or it has a sibling.
Analyticity is a property that a statement has when its truth is Some philosophers have called the sort of analyticity dis-
in some special way determined by its meaning. Many believe cussed so far, where things are said to be true in virtue of mean-
that no such property exists. Led by W. V. O. Quine (1953), phi- ing, metaphysical analyticity. his is distinguished from epistemic
losophers complain that no one has been able to deine the con- analyticity. A statement is epistemically analytic when under-
cept of analyticity in a way that is precise and that also fulills the standing it suices for being justiied in believing it (Boghossian
purposes to which it is typically put. 1996). While these considerations make trouble for metaphys-
To some people, it seems that all uncles are either mar- ical analyticity, they allegedly leave its epistemic counterpart
ried or have a sibling is true just because of the meaning of its untouched.
constituent words, most prominently because of the meaning he purpose of introducing epistemic analyticity is similar to
of uncle. But all uncles are less than eight feet tall is true not that of its older ancestor. he hope is that mathematical, logical,
because of meanings but because of how the world has turned and conceptual truths can be designated as a priori without pos-
out to be. he irst sort of statement is said to be analytic, the tulating a special faculty of reason or intuition. his is done by
second synthetic. building certain kinds of knowledge in as preconditions for pos-
his distinction has far-reaching interest and application. sessing or understanding concepts. If part of what it is to under-
Empiricists have always had diiculty accounting for the seem- stand the word uncle is to be disposed to accept that all uncles
ingly obvious fact that the truths of logic and mathematics are are either married or have a sibling, then it could be argued
both necessary (i.e., they could not be false) and a priori (i.e., they that once we understand that sentence, we know that it is true.
are known independently of sensory experience). For most of the No experience (beyond what is required for understanding) is
twentieth century, it was agreed that analyticity could explain necessary.
this obvious fact away as a merely linguistic phenomenon. he best candidates for epistemically analytic truths are sim-
he idea that all necessity could be explained away by ana- ple truths of logic. But even the most obvious logical truths are
lyticity fell out of fashion when S. Kripke (1980) convinced most not immune to challenge. For example, a few philosophers and
philosophers that some necessary truths are neither analytic nor logicians have claimed that some statements can be both true
a priori (e.g., water = H2O). But as L. BonJour (1998, 28) points and false (Preist 1987) and that modus ponens is invalid (McGee
out, many still think that the unusual modal and epistemic sta- 1985). Yet these sophisticated theoreticians certainly understand
tus of logic and mathematics is due to a special relation between the meanings of their own words. herefore, acceptance of some
truth and meaning. speciic truth of logic is not necessary for understanding any log-
However we apply the concept, trouble for analyticity begins ical concept. And since we might someday have good reason to
when we remind ourselves that every truth depends, to some reject any particular truth of logic while continuing to understand
extent, on the meanings of its constituent terms. If the word our logical concepts, understanding some logical concept is not
uncle had meant oak tree, then both previous examples would suicient for being justiied in believing any particular truth of
be false. In response, it is said that an analytic truth is one whose logic. And if logic is not epistemically analytic, nothing is.
truth depends solely on the meanings of its terms. Our linguistic hese considerations make the existence of analyticity dubi-
conventions alone make the irst example true, and the second is ous. But there still appears to be a deep diference between the
true party because of meaning and partly because of the way the two examples. If there is really a diference, it is not one of true in
world is. But how can linguistic convention alone make some- virtue of meaning versus true in virtue of reality, but one of nec-
thing true? essary and a priori versus contingent and empirical.
We can distinguish the sentence all uncles are either mar-
ried or have a sibling from the proposition that this sentence Michael Veber
now expresses. Meaning or linguistic convention alone makes
this sentence true in the following way. Given that this sentence
expresses the proposition that it does (i.e., given our current lin- BonJour, L. 1998. In Defense of Pure Reason. Cambridge: Cambridge
guistic conventions), it is true. his cannot be said of our other University Press.
example. hat this sentence means what it does is not suicient Boghossian, P. 1996. Analyticity reconsidered. Nous 30.3: 36091.
to determine its truth or falsity (see also sentence meaning). Kripke, S. 1980. Naming and Necessity. Cambridge: Harvard University
he world plays a part. Press.
McGee, V. 1985. A counterexample to modus ponens. Journal of
here are three serious problems. First, if this is what it is for
Philosophy 82: 46271.
a sentence to be true solely in virtue of its meaning, then it is just
Priest, G. 1987. In Contradiction. Boston: Kluwer.
another way of saying that it expresses a necessary truth, and that Quine, W. V. O. 1953. Two dogmas of empricism. In From a Logical
tells us nothing about how we know it. hus, appeal to analyticity Point of View, 2046. Cambridge: Harvard University Press.
cannot explain the necessity and a priority of logic, mathemat- Veber, M. 2007. Not too proud to beg (the question): Why inferential-
ics, or anything else. Second, the proposition now expressed ism cannot account for the a priori. Grazer Philosophische Studien
by our irst example would be true no matter how we ended up 73: 11331. A critique of Boghossian 1996 and similar views.
99
Anaphora Animacy
ANAPHORA restrictive conditions, some of these expressions may yet allow a

free logophoric interpretation.
Languages have expressions whose interpretation may involve
It is an important task to arrive at a detailed understanding
an entity that has been mentioned before: Subsequent reference to
of the ways in which languages encode interpretive relations
an entity already introduced in discourse approximates a general
between their expressions and of the division of labor between
deinition of the notion of anaphora (Sair 2004, 4). his works
the components of the language system involved.
well for core cases of nominal anaphora as in (1) (Heim 1982):
Eric Reuland
(1) a. his soldier has a gun. Will he shoot?
b. Every soldier has a gun. ??Will he shoot? WORKS CITED AND SUGGESTIONS FOR FURTHER READING
He in (1a) can be interpreted as the same individual as this soldier. Chomsky, Noam. 1981. Lectures on Government and Binding. Dordrecht,
In (1b) every soldier is quantiicational, hence does not denote an the Netherlands: Foris.
entity he can refer to, which makes anaphora impossible. Possible Heim, Irene. 1982. he semantics of deinite and indeinite noun phrases.
discourse antecedents are as diverse as soldiers, water, beauty, Ph.D. diss., University of Massachusetts at Amherst.
headaches, dissatisfaction, and so on. In addition to nominal Reinhart, Tanya. 1983. Anaphora and Semantic Interpretation.
expressions, sentences, verb phrases, prepositional phrases, London: Croom Helm.
Reuland, Eric. 2010. Anaphora and Language Design. Cambridge: MIT
adjective phrases, and tenses also admit anaphoric relations.
Press.
hus, the notion discourse entity must be broad enough to cap-
Sair, Ken. 2004. he Syntax of Anaphora. Oxford: Oxford University
ture all these cases of anaphora yet restrictive enough to separate
Press.
them from quantiicational cases such as every soldier.
he notion of anaphora is closely related to the notion of
interpretative dependency. For instance, in (2), he can depend for ANIMACY
its interpretation on every soldier, and here, too, it is said that he
Languages often treat animate and inanimate nouns diferently.
is anaphorically related to every soldier.
Animacy can afect many aspects of grammar, including word
2. Every soldier who has a gun says he will shoot. order, and verbal agreement. For example, in Navajo, the
more animate noun must come irst in the sentence (Hale 1973),
However, (1) versus (2) shows that two diferent modes of inter-
and in some Bantu languages, a more animate object must come
pretation must be distinguished: i) directly assigning two (or
before a less animate object. Verbs are more likely to agree with
more) expressions the same discourse entity from the inter-
more animate nouns (Comrie 1989). Animacy can also afect the
pretation domain (ID) as a value: co-reference as in (1a), and ii)
choice of case, preposition, verb form, determiner (article), or
interpreting one of the expressions in terms of the other by gram-
possessive marker (Comrie 1989).
matical means: binding (Reinhart 1983). his contrast is repre-
What counts as animate difers cross-linguistically. he gram-
sented in (3).
matical category of animates may include certain objectively
inanimate things, such as ire, lightning, or wind. Languages may
make additional distinctions among pronouns, proper nouns,
and common nouns, or between deinites and indeinites, and
these are sometimes viewed as part of an animacy hierarchy by
linguists (Comrie 1989). person and number distinctions may
also be included in an animacy hierarchy in this broader sense.
For example, according to Michael Silverstein (1976), subjects
with features at the less animate end of the Animacy Hierarchy in
(1) are more likely to be marked with morphological case, while
the reverse holds of objects.
Coreference in (3a) is restricted in terms of conditions on dis-
(1) Animacy Hierarchy
course entities, binding in (3b) in terms of grammatical conig-
1pl > 1sing > 2pl > 2sing > 3human.pl > 3human.sing >
uration. Expr1 can only bind expr2 if it c-commands the latter
3anim.pl > 3anim.sing > 3inan.pl > 3inan.sing
(Reinhart 1983). his condition is met in (2), but not in (1b),
hence the contrast. Dyirbal exhibits this pattern in that only third person sub-
Virtually all languages have words and expressions that are ref- jects have morphological case, whereas all human objects do.
erentially defective they cannot be used deictically (see deixis). Silverstein (1976) postulates that the function of such diferen-
In much of the linguistic literature, these are called anaphors, as tial case marking is to lag less animate subjects and more ani-
they appear specialized for anaphoric use. Examples vary from mate objects to avoid ambiguity. It is interesting to note that
English himself, Dutch zich(zelf), Icelandic sig, Russian sebja, the patterns of such diferential animacy marking are far more
Chinese (ta) ziji, to Georgian tav tavis, and so on. Such expres- complex and diverse cross-linguistically for objects than for sub-
sions cannot be assigned a discourse value directly. Rather, they jects (Aissen 2003). his may be traced to a relation between ani-
must be bound, often in a local domain approximately the macy and object shift, which produces an associated change in
domain of the nearest subject but subject to variation in terms case or verbal agreement (Woolford 2000, 2007). he less diverse
of speciic anaphor type and language. Furthermore, under animacy efects on subject case, which do not afect agreement
100
Animal Communication and Human Language
(Comrie 1991), may be purely morphological, markedness efects within this ield (e.g., Bradbury and Vehrenkamp 1998) reveals
(Woolford 2007). a great deal about the communicative behavior of many species
and its origins, but within essentially the same picture of what
Ellen Woolford
constitutes communication in (nonhuman) animals, conined to
unitary signals holistically transmitted and interpreted. Little if
WORKS CITED AND SUGGESTIONS FOR FURTHER READING any of what we have come to know about human linguistic com-
Aissen, Judith. 2003. Diferential object marking: Iconicity vs. economy. munication inds a place here.
Natural Language and Linguistic heory 21: 43583. Biologists have not, in general, paid much attention to the
Comrie, Bernard. 1989. Language Universals and Linguistic Typology. speciics of linguistic research (though their attention has been
Oxford: Blackwell. caught by the notion that human language is importantly based
. 1991. Form and function in identifying cases. In Paradigms, ed. in human biology) and are often not as sophisticated as one
F. Plank, 4155. Berlin: Mouton de Gruyter.
might wish about the complexity of natural language. But the
Hale, Kenneth L. 1973. A note on subject-object inversion in Navajo. In
consequences may not be as serious as linguists are inclined to
Issues in Linguistics: Papers in Honor of Henry and Rene Kahane, ed. B.
think. In fact, the communicative behavior of nonhumans in gen-
B. Kachru et al., 300309. Urbana: University of Illinois Press.
Silverstein, Michael. 1976. Hierarchy of features and ergativity. In eral is essentially encompassed within the simple signal-passing
Grammatical Categories in Australian languages, ed. R. M. Dixon, model. he complexities of structure displayed by human lan-
11271. Canberra: Australian Institute of Aboriginal Studies. guage are apparently quite unique to our species and may not
Woolford, Ellen. 2000. Object agreement in Palauan. In Formal Problems be directly relevant to the analysis of animal communication
in Austronesian Morphology and Syntax, ed. I. Paul , V. Phillips, and L. elsewhere.
Travis, 21545. Dordrecht, the Netherlands: Kluwer.
. 2007. Diferential subject marking at argument structure, syn- What (Other) Animals Do
tax, and PF. In Diferential Subject Marking, ed. H. de Hoop and P. de Communication in the sense of emission and reception of infor-
Swart, 1740. Dordrecht: Springer.
mative signals is found in animals as simple as bacteria (quorum
sensing). Most familiar, perhaps, are visual displays of various
sorts that indicate aggression, submission, invitations to mate,
ANIMAL COMMUNICATION AND HUMAN LANGUAGE
and so on. In some instances, these may involve quite complex
An understanding of the communicative capacities of other ani- sequences of gestures, reciprocal interactions, and the like, as in
mals is important on its face both for an appreciation of the place the case of the nesting and mating behavior of many birds. In
of human language in a broader context and as a prerequisite to others, a simple facial expression, posture, or manner of walking
discussion of the evolution of language (see, for example, evo- may provide the signal from which others can derive information
lutionary psychology). On closer examination, however, about the animals intentions and attitudes.
the diferences between human language and the systems of hese diferences of internal structure are, of course, crucial
other animals appear so profound as to make both projects more for the correct expression and interpretation of a particular sig-
problematic than they appear at irst. nal, but they play little or no role in determining its meaning. hat
In the 1950s and 1960s, ethologists like Konrad Lorenz and is, the individual components of the signal do not in themselves
Niko Tinbergen revolutionized behavioral biologists views of correspond to parts of its meaning, in the sense that varying one
the cognitive capacities of animals, but consideration of ani- subpart results in a corresponding variation in what is signaled.
mal communication focused on the properties of quite simple Animal signals, however complex in form (and however elaborate
systems. A prime example of communication in the texts of the message conveyed), are unitary wholes. An entire courtship
the time was the stickleback. A crucial component of the mat- dance, perhaps extending over several minutes or even longer,
ing behavior of this common ish is the pronounced red color- conveys the sense I am interested in mating with you, providing
ation of the males underbelly when he is in mating condition, a nesting place, and care for our ofspring. No part of the dance
which furnishes a signal to the female that she should follow him corresponds exactly to the providing care part of the message;
to his preconstructed nest, where her eggs will be fertilized. On the message cannot be minimally altered to convey I am inter-
this model, communication was viewed as behavioral or other ested in mating but not in providing care for our ofspring, I was
signals emitted by one organism, from which another organism interested in mating (but am no longer), and so on. Variations in
(typically, though not always, a conspeciic) derives some infor- intensity of expression can convey (continuous) variations in the
mation. he biological analysis of communication thus came to intensity of the message (e.g., urgency of aggressive intent), but
be the study of the ways in which such simple signals arise in the that is essentially the only way messages can be modulated.
behavioral repertoire of animals and come to play the roles they he most widely discussed apparent exception to this gen-
do for others who perceive them. hose discussions make little if eralization is the dance language of some species of honeybees.
any contact with the analysis of human language. he bees dance conveys information about a) the direction, b)
In the intervening half century, we have come to know vastly the distance, and c) the quality of a food source (or potential
more about the nature and architecture of the human lan- hive site), all on quasi-continuous scales and each in terms of
guage faculty and to have good reason to think that much of it a distinct dimension of the dance. Although the content of the
is grounded in human biology. One might expect, therefore, to message here can be decomposed, and each part associated
ind these concerns relected in the behavioral biology literature. with a distinct component of the form of the signal, there is no
A comprehensive modern textbook on animal communication element of free combination. Every dance necessarily conveys
101
Animal Communication and Human Language
exactly these three things, and it is only the relative value on each this regard. In general, their song is learned on the basis of early
dimension that is variable. As such, the degree of freedom avail- exposure to appropriate models, from which they in turn com-
able to construct new messages is not interestingly diferent from pose their own songs. It is interesting to note there appear to be
that involved in conveying diferent degrees of fear or aggression quite close homologies in the neurophysiology of vocal learn-
by varying degrees of piloerection. ing and perhaps even in its underlying genetic basis between
Visual displays do not at all exhaust the modalities in which birds and humans, although what is learned in birds is a uni-
animal communication takes place, of course. Auditory signals tary, holistic signal like those in other nonhuman communica-
are important to many species, including such classics of the ani- tion systems, rather than individual lexical items subject to free
mal communication literature as frog croaks and the calls and recombination to produce diferent meanings.
songs of birds (see birdsong and human language). In here is much variation across bird species, but a clear gen-
some species, portions of the auditory spectrum that are inac- eralization emerges: For each, there is a speciic range of song
cessible to humans are involved, as in the ultrasound communi- structures that individuals of that species can learn. Experience
cation of bats, some rodents, and dolphins, and the infrasound plays a role in providing the models on which adult song is based,
signals of elephants. Chemical or olfactory communication is but (with the exception of a few very general mimics, such as the
central to the lives of many animals, including moths, mice, and lyrebird) this role is quite narrowly constrained by the song-
lemurs, as well our pet cats and dogs. More exotic possibilities learning system of the individual species.
include the modulation of electric ields generated (and per-
ceived) by certain species of ish. What Humans Do, and How It Is Different
In some of these systems, the internal structure of the signal Like the systems of communication of other animals, human
may be quite complex, as in the songs of many oscine songbirds, language is deeply embedded in human biology. Unlike others,
but the general point still holds: However elaborate its internal however, it provides an unbounded range of distinct, discrete
form, the signal has a unitary and holistic relation to the message messages. Human language is acquired at a speciic point in
it conveys. In no case is it possible to construct novel messages development from within a limited range of possibilities, simi-
freely by substitutions or other ways of varying aspects of the sig- lar to the acquisition of song in birds. Unlike the communicative
nals form. signals of other species, human language is under voluntary con-
In most animals, the relation of communicative behavior to trol, with its underlying neurobiology concentrated in cortical
the basic biology of the species is very direct. Perceptual systems structures, as opposed to the subcortical control characteristic of
are often quite precisely attuned to signals produced by con- those other species that have been studied in this regard.
speciics. hus, the frogs auditory system involves two separate Human language is structurally a discrete combinatorial sys-
structures (the amphibian papilla and the basilar papilla) that are tem, in which elements from a limited set combine in a recursive,
sensitive to acoustic signals, typically at distinct frequencies. he hierarchical fashion to make an unlimited number of potentially
frequencies to which they are most sensitive vary across species novel messages (see recursion, iteration, and metarep-
but are generally closely related to two regions of prominence resentation). he combinatorial structure of language is gov-
in the acoustic structure of that species calls. Mice (and many erned by two quite independent systems: A small inventory of
other mammals) have two distinct olfactory organs, projecting individually meaningless sounds combine to make meaningful
to quite distinct parts of the mouse brain. he olfactory epithe- words, on the one hand (phonology), while these words are
lium is responsive to a wide array of smells, but the vomeronasal combined by a quite diferent system to make phrases, clauses,
organ is sensitive primarily to the pheremones that play a major and sentences (see syntax). hese properties (discrete com-
role in communication and social organization. In this case, as in bination, recursive hierarchical organization, and duality of pat-
many, many others, the perceptual system is matched to produc- terning) are not simply idiosyncratic ornaments that could in
tion in ways that optimize the organisms sensitivity to signals principle be omitted without afecting the overall communicative
that play a crucial ecological role in the life of the animal. capacity of the system. Rather, they are what make large vocabu-
he essential connection between a species system of com- laries practical and unbounded free expression possible. Contrast
munication and its biology is also manifested in the fact that the unlimited range of potentially novel utterances that any (nor-
nearly all such systems are innately speciied. hat is, the ability mal) speaker of a language can produce, and another speaker of
to produce and interpret relevant signals emerges in the individ- the same language comprehend, with the strictly limited range of
ual without any necessary role of experience. Animal commu- meaningful signals available to other organisms. No other form of
nication is not learned (or taught) but, rather, develops (in the communication found in nature has these properties. Although
absence of speciic pathology, such as deafness) as part of the song in some species of birds does display a limited amount of
normal course of maturation. Animals raised under conditions phonological combinatoriality, there is no analog even here to
in which they are deprived of exposure to normal conspeciic meaningful syntax. Human language, and especially its syntactic
behavior will nonetheless communicate in the fashion normal to organization, is quite unique in the animal world.
their species when given a chance. Furthermore, eforts to teach systems with these essential
Exceptions to this generalization are extremely rare, apart properties to other animals have not succeeded. Despite wide-
from human language. Vocal learning, in particular, has been spread claims to the contrary in the popular literature, there is
demonstrated only to a limited extent in cetaceans and some bats no evidence that any nonhuman animal is capable of acquiring
and, more extensively, in 3 of the 27 orders of birds. he study of and using such a system. his should not be seen as particularly
birds, especially oscine songbirds, is particularly instructive in surprising. If language is indeed embedded in human biology,
102
Aphasia
there is no reason to expect it to be accessible to organisms with emission tomography (PET), and functional MRI (fMRI) have
a diferent biological endowment, anymore than humans are helped deine functions of core speech and language areas.
capable of acquiring, say, the echolocation capacities of bats, a
system that is equally grounded in the speciic biology of those Aphasia Syndromes
animals. Aphasia has traditionally been categorized into seven subtypes,
including Brocas, Wernickes, global, anomic, conduction,
Conclusion transcortical sensory, and transcortical motor. hese aphasia
Human language is often considered as simply one more instan- variants are characterized by diferent patterns of speech luency,
tiation of the general class of animal communication systems. auditory comprehension, repetition, and naming. Although
Indeed, like others it appears to be highly species speciic. patients may be classiied as having one type of aphasia in the
Although relevant experience is required to develop the system early period after a brain injury, this classiication may change as
of any particular language, the overall class of languages acces- language problems resolve with time and treatment.
sible to the human learner is apparently highly constrained, and Brocas aphasia is characterized by a constellation of symp-
the process of language learning is more like genetically gov- toms, including slow, halting speech with impaired grammar;
erned maturation than like learning in general. he structural disturbed auditory comprehension for grammatically complex
characteristics of human language are quite diferent from those phrases and sentences; and poor repetition. Word-inding prob-
of other communication systems, and it is the freedom of expres- lems and diiculty with reading and writing are common. Motor
sion subserved by those distinctive properties that gives language speech disorders, such as apraxia of speech, a disorder of articu-
the role it has in human life. (See also primate vocalizations latory planning or coordination, and dysarthria, an impairment
and grooming, gossip, and language). in muscle strength, tone, or coordination, very often co-occur.
Patients with Brocas aphasia often talk in a series of nouns and
Stephen R. Anderson
verbs, as is the case in the following sample from a patient describ-
ing the picnic scene from the Western Aphasia Battery (WAB):
I know it tree house car man with uh woman kid
Anderson, Stephen R. 2004. Doctor Dolittles Delusion: Animals and the over here with lag i can know it nice sun shiny day
Uniqueness of Human Language. New Haven, CT: Yale University
[unintelligible word]
Press.
Bradbury, J. W., and Sandra Vehrenkamp. 1998. Principles of Animal Patients with Brocas aphasia can participate relatively well
Communication. Sunderland, MA: Sinauer. in everyday conversation by using single words or phrases,
Hauser, Marc D. 1996. he Evolution of Communication. Cambridge, often combined with meaningful gestures and facial expres-
MA: MIT Press. sions. Some patients use writing and drawing to compensate for
restricted verbal output. As is true with all aphasia syndromes,
there is a wide range of symptom severity. hose with severe
APHASIA
Brocas aphasia may present with such profound verbal deicits
Aphasia is a language impairment caused by brain injury that that their speech is limited to a single recurrent utterance (e.g.,
afects speech content, auditory comprehension, reading, and yeah, yeah). In these patients, comprehension is usually pre-
writing to varying degrees. Mild aphasia may result in occasional served for short, simple phrases, but is signiicantly impaired for
word-inding problems, while more severe forms can cause pro- more complex information.
found deicits in all language domains. Aphasia difers from Classic aphasia models assume that lesions to brocas area
developmental disorders in that it occurs after a brain injury to a (See Figure 1) result in Brocas aphasia, but research has indi-
person with otherwise normal language skills. cated that this is not always the case. Reports as early as 1870
Typically, aphasia results from damage to the left hemi- documented cases that did not support this linear relationship
sphere of the brain due to stroke, traumatic brain injury, tumor, (e.g., Bateman 1870; Marie 1906; Moutier 1908; Dronkers et al.
or degenerative neurological disease. Nearly all right-handed 2007). Modern research has found that chronic Brocas aphasia
individuals and most left-handers are thought to have core lin- typically results from large lesions that encompass left frontal
guistic functions semantics, phonology, syntax, and brain regions, underlying white matter, the insula, and the ante-
morphology lateralized to the left hemisphere, while other rior parietal lobe. Lesions restricted to Brocas area tend to
aspects of language, speciically prosody and pragmatics, are cause transient mutism that spontaneously resolves within days
associated with the right hemisphere. Approximately one or weeks (Mohr 1976). In some cases, Brocas aphasia can occur
million people in the United States are alicted with aphasia, without damage to Brocas area (e.g., Basso et al. 1985; Mazzocchi
which is a prevalence rate similar to that of Parkinsons disease. and Vignolo 1979).
Roughly 80,000 more acquire aphasia annually. Patients with Wernickes aphasia present a reverse pattern of
Recent advances in the study of language have provided greater symptoms when compared to those with Brocas aphasia: While
insight into aphasia syndromes. Modern neuroimaging tech- speech is luent, comprehension is impaired. Patients speak in
nology has helped reine classic models of language localization, a normal or rapid rate. However, they often use meaningless
as well as our understanding of aphasia treatment and recovery. words, jargon, or semantic paraphasias (e.g., using a related
In particular, brain-imaging techniques such as magnetic reso- word, bike for a target word, car). Reading and writing may
nance imaging (MRI), computerized tomography (CT), positron be similarly disrupted. he following exempliies the speech
103
Aphasia
Figure 1. Several of the key brain regions affected

in aphasia. Areas depicted as typical lesions are
derived from patient data obtained at the Center
for Aphasia and Related Disorders.
content of a patient with Wernickes aphasia describing the Initial reports that conduction aphasia arose from lesions
WAB picnic scene: to the arcuate fasciculus (the white matter tract connecting
Wernickes and Brocas areas; see Figure 1) have been reined
And the man and hers and Ill say I dont think shes working. over the years. Modern studies have shown that conduction
heyre not doing the thing. hen the ladder then the tree aphasia results most often from lesions to the posterior supe-
and the /let/ [points to kite] and lady here [points to boy] rior temporal gyrus (Dronkers et al. 1998), the auditory cortex
have to clean that. (Damasio and Damasio 1980), or periventricular white matter
underlying the supramarginal gyrus (Sakurai et al. 1998).
In contrast to patients with Brocas aphasia, those with
Global aphasia, the most severe syndrome, is characterized
Wernickes aphasia may understand very little in conversation
by profound impairments in all language modalities. Speech,
because of their impaired comprehension of single words. In
auditory comprehension, naming, repetition, reading, and writ-
addition, successful communication is made challenging by ver-
ing are all afected, leaving the patient with very little functional
bal output that is empty, coupled with an inability to monitor
communication. Speech may be limited to single stereotyped
speech content. Using visual information to compensate for com-
or automatic words and phrases (e.g., yes, no, I dont know).
prehension deicits is often beneicial (e.g., providing pictures,
Auditory comprehension may be impaired for even simple yes/
drawing, or writing key words during conversational exchanges).
no questions. Such a severe loss of language typically results
Persisting cases of Wernickes aphasia are not caused by injury
from a large cortical lesion, encompassing the frontal, temporal,
to wernickes area alone but, rather, by much larger lesions
and parietal lobes. Patients often rely on preserved nonverbal
afecting most of the middle temporal gyrus and underlying white
skills to aid in communication (e.g., the recognition of pictures
matter (Dronkers, Redfern, and Ludy 1995; see Figure 1). Such
and gestures to support auditory comprehension and the ability
damage amounts to a poorer prognosis for recovery. Patients
to draw or gesture to aid in expression).
with lesions conined to Wernickes area tend to have symptoms
Anomic aphasia, the mildest of the syndromes, results in
of Wernickes aphasia that resolve, resulting in milder forms of
word-inding deicits (anomia), while other language skills are
aphasia, most often conduction aphasia or anomic aphasia, if
typically well preserved. When attempting to ind a target word,
the lesion spares the middle temporal gyrus.
patients with anomic aphasia may describe its function or use a
Conduction aphasia is a luent aphasia characterized by an
synonym. Speech may be slow and halting, due to anomia, but
inability to repeat. Auditory comprehension is relatively pre-
grammar is unafected. Anomic aphasia can result from lesions
served, and patients use speech that is largely understandable
anywhere within the perisylvian region.
but may be rife with phonemic paraphasias (substituting sounds
he transcortical aphasias are rare and characterized by a
in words, e.g., netter for letter). While high-frequency words
preserved ability to repeat, despite impairments in other lan-
and short phrases may be repeated accurately (e.g., the tele-
guage domains. Transcortical motor aphasia (TCMA) is simi-
phone is ringing), low-frequency items are more diicult (e.g.,
lar to Brocas aphasia, in that patients present with nonluent
irst British ield artillery). Patients may retain the meaning
speech and relatively intact comprehension, but repetition skills
of such phrases, owing to their preserved comprehension, but
are markedly well preserved. Lesions typically spare core lan-
the phonological trace is disrupted, thereby disturbing verba-
guage areas, are smaller than those that cause Brocas aphasia,
tim repetition. he following typiies the speech of a patient with
and are restricted to anterior and superior frontal lobe regions.
conduction aphasia, again describing the WAB picnic scene:
Although patients may be mute initially, their symptoms tend
Well theres a house near a clearing, evidently its on the water. to resolve quickly, resulting in anomic aphasia. Patients with
Further, theres a stick with a banner in the foreground transcortical sensory aphasia (TCSA) present much like patients
[referring to the lag]. I dont know what thats called a pier with Wernickes aphasia, with empty, luent speech and poor
a tier? heres a bucket and a /kovel/. It looks like theres comprehension, but they too retain a striking ability to repeat.
someone playing in the water. Lesions typically involve portions of the posterior temporal
104
Aphasia Areal Distinctness and Literature
and parietal regions, but tend to be much smaller than those of Dronkers, N. F., O. Plaisant, M. T. Iba-Zizen, and E. A. Cabanis. 2007.
Wernickes aphasia. Acute symptoms usually resolve to produce Paul Brocas historic cases: High resolution MR imaging of the brains
an anomic aphasia. of Leborgne and Lelong. Brain 130: 143241.
While aphasia most often occurs suddenly, as the result Dronkers N. F., B. B. Redfern, C. Ludy, and J. Baldo. 1998. Brain regions
associated with conduction aphasia and echoic rehearsal. Journal of
of injury, a degenerative form of aphasia was irst described
the International Neuropsychological Society 4: 234.
over a century ago by Arnold Pick, a Czech neurologist, and
Gorno-Tempini, M. L., N. F. Dronkers, K. P. Rankin, et al. 2004. Cognition
later expanded upon by Marsel Mesulam in a landmark paper and anatomy in three variants of primary progressive aphasia. Annals
in which he described six patients who presented with lan- of Neurology 55: 33546.
guage deicits, in the absence of other behavioral abnormalities Holland, A. L., D. S. Fromm, F. DeRuyter, M. Stein. 1996. Treatment ei-
(Mesulam 1982). Speech or language deicits remained the only cacy. Journal of Speech and Hearing Research 39.5: S2736.
impairment for the irst two years in these patients, but as the Marie P. 1906. Revision de la question de laphasie: La troisieme circon-
disease progressed, more generalized dementia emerged. his volution frontale gauche ne joue aucun role special dans la fonction du
progressive disorder was distinct from other dementias, such as langage. Semaine Medicale 26: 2417.
Alzheimers disease, because language problems, rather than Mazzocchi, F., and L. A. Vignolo. 1979. Localization of lesions in
memory complaints, were the most salient symptoms. aphasia: Clinical CT-scan correlations in stroke patients. Cortex
15: 62754.
Since then, numerous cases of what is now termed pri-
Mesulam, M. M. 1982. Slowly progressive aphasia without generalized
mary progressive aphasia (PPA) have been described, in which
dementia. Annals of Neurology 11: 5928.
patients present with both luent and nonluent variants of the Mohr, J. P. 1976. Brocas area and Brocas aphasia. In Studies in
disorder (Snowden, et al. 1992; Gorno-Tempini et al. 2004). Neurolinguistics. Vol. 1. Ed. H. Whitaker and H. Whitaker, 20133. New
Neuroimaging typically shows left perisylvian atrophy, encom- York: Academic Press.
passing frontal regions in progressive nonluent aphasia and Moutier, F. 1908. LAphasie de Broca. Paris: Steinheil.
anterior temporal and temporo-parietal regions in the more lu- Sakurai, Y., S. Takeuchi, E. Kojima, et. al. 1998. Mechanism of short-
ent semantic dementia and logopenic variants. here are many term memory and repetition in conduction aphasia and related cogni-
underlying pathologies that cause the clinical syndrome of PPA, tive disorders: a neuropsychological, audiological and neuroimaging
including Picks disease, progressive supranuclear palsy, corti- study. Journal of Neurological Sciences 154.2: 18293.
Snowden, J. S., D. Neary, D. M. Mann, et al. 1992. Progressive language
cobasal degeneration, dementia lacking distinctive pathology,
disorder due to lobar atrophy. Annals of Neurology 31: 17483.
and Alzheimers disease.
Treatment for Aphasia

AREAL DISTINCTNESS AND LITERATURE
Critical reviews of aphasia treatment studies (e.g., Bhogal,
Teasell, and Speechley 2003; Holland et al. 1996) have shown here are two criteria for determining whether a linguistic prop-
that treatment can be efective in improving language skills past erty is a universal. First, it must occur across languages with a
the point that might be expected from spontaneous recovery frequency greater than chance. Second, the presence of the prop-
alone. Although it remains diicult to predict the treatment that erty in some of these languages should not have been caused by
will result in the greatest amount of change for an individual, its presence in other languages. In linguistics, the causal crite-
there are many options from which to choose. rion is often operationally speciied into two subcriteria genetic
Patients with aphasia are typically referred to speech lan- and areal distinctness, which is to say, distinctness in origin and
guage pathologists for diagnostic testing aimed at developing in cross-language interaction.
treatment goals. herapy may focus on improving impaired skills Researchers in literary universals also adopt the pre-
or developing compensatory strategies to overcome obstacles to ceding criteria. However, literature is diferent from language
successful communication. Patient-speciic factors (e.g., aphasia in being more readily open to inluence. Speciically, the opera-
severity, cognitive ability, general health, and motivation) also tional criterion of areal distinctness becomes much more diicult
inluence treatment decisions. Research is inconclusive, how- to satisfy in the case of literature. Even a single work, transported
ever, as to the prognostic weight that these variables contribute across continents, may produce signiicant changes in the recipi-
to recovery and treatment planning for an individual. ent literature.
here are three ways of responding to this problem. he irst is
Nina F. Dronkers, Jennifer Ogar
to focus on literary works produced before the period of extensive
global interaction. Research of this sort must form the primary basis
for any serious study of literary universals. Moreover, such research
Basso, A., A. R. Lecours, S. Moraschini, and M. Vanier. 1985. Anatomo- indicates that there are some signiicant universals, for example the
clinical correlations of the aphasias as deined through computerized narrative universals of heroic, romantic, and sacriicial tragi-
tomography: On exceptions. Brain and Language 26: 20129. comedy. However, this approach to areal distinctness cannot be as
Bateman, F. 1870. On Aphasia. London: Churchill.
rigorous as one might like. Global interaction extends back through
Bhogal, S. K., R. Teasell, and M. Speechley. 2003. Intensity of aphasia
the formation of all the major literary traditions.
therapy, impact on recovery. Stroke 34.4: 98793.
Damasio, H., and A. R. Damasio. 1980. he anatomical basis of conduc-
he second response involves a more nuanced approach for
tion aphasia. Brain 103: 33750. isolating inluence from a source tradition to a recipient tradi-
Dronkers, N. F., B. B. Redfern, and C. A. Ludy. 1995. Lesion localization tion. Here, we may distinguish between self-conscious and
in chronic Wernickes aphasia. Brain and Language 51: 6265. implicit learning. Self-conscious learning can occur with a single
105
Art, Languages of
exposure to salient features of a literary work. Implicit learning, made from symbol systems that have one or more of the symp-
however, is likely to require many exposures, commonly while toms of the aesthetic: syntactic density, semantic density, syntac-
immersed in the culture and language of the source tradition. In tic repletenesss, and exempliication. hese notions are deined
isolating literary universals, then, we may take into account the later in this entry.
degree to which a particular property is likely to have been trans- According to Goodman, A symbol system consists of a sym-
ported from one tradition to another by learning of either sort, bol scheme correlated with a ield of reference ([1968] 1976,143).
given the degree of contact between the traditions. For example, Goodmans primary interest in deining a symbol system is to
the practice of dramatic performance may be transmitted from diferentiate the notational from the non-notational schemes,
one tradition to another through limited interaction, as this may where notation is a technical notion to which his Chapter 4
be learned through a single exposure. he same point does not is devoted. His concern about notations follows from a concern
hold for background imagery. about forgeries and fakes and with the fact that some types of
Finally, we may wish to expand our study of cross-cultural art (such as painting) can be faked while others (such as per-
patterns to actual borrowings. Here, too, it is crucial to distin- formance of a speciic piece of music) cannot. Where a work is
guish diferent types of inluence. We may roughly divide inlu- deined by compliance to a score (i.e., it has a notation), it cannot
ence into two categories hegemonic and nonhegemonic. be faked; such works are called allographic. Where a work is not
Hegemonic inluence occurs when the source tradition has deined by compliance to a score, as in the case of a painting, its
greater economic power (e.g., in the publication and distribu- authenticity can be established only by tracing the history of its
tion of literary works), more pervasive control of government production back to its origin, and this permits faking; such works
or education, or a higher level of prestige (due, for example, to are called autographic.
military strength), or when it is otherwise in a position of cul- A symbol system is built on a symbol scheme, which con-
tural domination over the recipient society. Obvious cases are sists of characters (and usually modes of combination for these
to be found in colonialism. Common properties that result characters). For example, for a natural language, a character is a
from non-hegemonic inluences are not universals themselves. class of marks, where marks might include anything from single
However, they may tell us something about cross-cultural aes- sounds or letters up to whole spoken or written texts, as in the let-
thetic or related propensities. Common properties that result ter P, a character that is the class whose members are all the writ-
from hegemonic inluences, in contrast, may simply relect the ings-down of the letter P. Symbol systems are either notations or
efects of power. not notations. If the symbol system is a notation, the characters
of which its scheme is comprised must meet two conditions, as
Patrick Colm Hogan
follows:
WORK CITED AND SUGGESTIONS FOR FURTHER READING (1) For a character in a notation, the members can be
interchanged, where diferent characters can be true cop-
Comrie, Bernard. 1981. Language Universals and Linguistic Typology.
ies of one another; this is called the condition of character-
Chicago: University of Chicago Press.
Hogan, Patrick Colm. 2005. Literary universals and their cultural tra- indiference and is true, for example, of letters of the English
ditions: he case of poetic imagery. Consciousness, Literature, and alphabet.
the Arts 6.2. Available online at: http://www.aber.ac.uk/cla/archive/ (2) Characters in a notation must be initely diferentiated
hogan.html. or articulate; for a mark that does not belong to two charac-
ters, it must be theoretically possible to determine that it does
ART, LANGUAGES OF not belong to at least one of them (this is explained further
shortly).
Languages of Art, a book by Nelson Goodman (190698), was
(1, 2) are the two syntactic requirements that deine a sym-
irst published in 1968, with a second edition in 1976. he pres-
bol system as notational.
ent entry focuses solely on this book, which raises interesting
questions about language in a general sense and its role in Characters in a scheme are correlated with things outside
aesthetic experience. his entry does not attempt to contextu- the scheme. For example, the marks that make up a score are
alize Goodmans book relative to his philosophy, for which see correlated with elements in the performance of the score; the
Daniel Cohnitz and Marcus Rosenberg (2006) and Catherine Z. mark that is a written word is correlated with a pronunciation of
Elgin (1992); Goodmans later and related Ways of Worldmaking that word; and the mark that is a written word is (also and inde-
is also recommended (Goodman and Elgin 1978). pendently) correlated with that words referent. Goodman uses
By languages (of art), Goodman means more generally sym- the term complies and says that the performance complies
bol systems; natural language is one of the symbol systems, which with the score, or the referent, or pronunciation, complies with
include, for example, musical notation or the symbol system of the written word. he set of things that comply with an inscrip-
cubist painting. Certain characteristics of symbol systems, when tion (e.g., the set of things named that can be denoted by a
used in an artwork, place cognitive demands on its audience, name) is called the compliance class of the inscription. For the
which make the artwork good to think (to borrow Claude Lvi- symbol system to be a notation, it must irst include a symbol
Strausss term). he symbol systems from which artworks are scheme that is notational (i.e., that satisies the two syntactic
composed enable us to be exploratory, drawing on our cognitive conditions), and it must also satisfy three semantic conditions,
(including emotional) resources. his is because artworks are as follows.
106
Art, Languages of
(3) Notational systems must be unambiguous; it must be of the outline of Mount Fuji. What makes the electrocardiogram
clear which object complies with each unique element of the a diagram is that not every aspect of its form is relevant; the line
scheme. can vary in thickness or color without constituting a diferent
(4) In a notation, compliance classes must be disjoint; for character. In contrast, every aspect of the form of the picture is
example, a performance cannot comply with two diferent relevant; pictures have much fewer contingent features than dia-
scores. grams, and pictures are thus said to be relatively (syntactically)
replete. he diference between diagram and picture is a matter
(5) A notational system must be semantically initely difer-
of degree; repleteness is a relative characteristic.
entiated; for an object that does not comply with two char-
Goodman concludes his book by using these notions to
acters, it must be theoretically possible to determine that the
develop four symptoms of the aesthetic. Objects have aes-
object does not comply with at least one of them.
thetic symptoms when they use symbol systems that are syn-
he notion of inite diferentiation is important both in the tactically dense, semantically dense, and syntactically replete.
syntax and semantics of notational systems; inite diferentiation he fourth symptom of the aesthetic is that aesthetic objects
is articulation, and its lack constitutes density. As we will see, exemplify. (In Ways of Worldmaking, Goodman explores the
though articulation is important for a notational system, density notion of style and proposes that the style of an artwork is one
is more generally important in deining works as aesthetic. Finite of the referents exempliied by its symbols, where style consists
diferentiation requires gaps between elements in the system of those features of the symbolic functioning of a work that are
(between characters, or between compliants); if between two characteristic of author, period, place or school [1978, 35]. In the
adjacent elements a third can always be inserted, the scheme same book, he introduces a ifth symptom of the aesthetic, which
lacks inite diferentiation. For example, a scheme lacks inite is multiple and complex reference.) Note that the irst three of
diferentiation if it has two characters, where all marks not lon- these symptoms are characteristic of non-notational symbol sys-
ger than one inch belong to one character and all longer marks tems; all three are associated with density in some more gen-
belong to the other, and where marks can be of any length. eral sense, which, Goodman says arises out of, and sustains, the
Between a mark belonging to the character of marks not longer unsatisiable demand for absolute precision, thus engaging our
than one inch and a mark belonging to the character of longer interest in aesthetic works ([1968] 1976, 253).
marks, it is always (theoretically) possible to have a third that Goodman concludes his discussion by asking what gives an
falls between them (this ever-diminishing between-space is a aesthetic object its value, both relative to other aesthetic objects
kind of Derridean mise-en-abme). and, more generally, to us: What makes us want to know it? He
A symbol system is called a notation if it meets the ive condi- argues that aesthetic objects invite our interest by asking us to
tions. Goodman asks whether various types of symbol systems understand what they are, including how their symbol systems
that have been developed in the arts are notations. (A type of operate, and what they exemplify; these tasks are made particu-
artistic practice may be non-notational just because no notation larly diicult by the four symptoms of the aesthetic, which thus
has been developed for it; in principle, notations might be devel- particularly stimulate our interest in aesthetic objects. He sum-
oped for all of them, but in practice they have not been.) A tra- marizes three criteria, drawn from general ideas about aesthet-
ditional musical score is a character in a notational system. he ics: Engagement with artworks improves our itness to cope with
compliants of the score are the performances, which collectively the world, just manifests our playfulness (i.e., homo ludens),
constitute the work of music. Similar comments are made for or communicates special kinds of knowledge to us. hese are
Labanotation, a scoring system developed for dance. A literary partial insights into the primary purpose of our engagement with
work is a character in a notational scheme (but not a character aesthetic objects: he primary purpose is cognition in and for
in a notational system): Like the language from which it is com- itself; the practicality, pleasure, compulsion, and communica-
posed, it meets the syntactic requirements for a notation, but not tive utility all depend on this ([1968] 1976, 258). he symbol
the semantic requirements. A painting is a work that is in a sym- systems or languages of art serve this purpose, allowing
bol system that is not notational. for the possibility of producing symbolic objects that engage us.
Having developed these notions, Goodman uses them as Furthermore, the characteristic density of the symbolic systems
a way of deining a representation, a problem raised and not used in artworks, and the characteristic unparaphrasability of
solved in the irst part of the book, where, for example, he argues what they express both permit a person to reenter the same art-
that we cannot distinguish a representation by criteria such work and repeatedly to discover new things in it.
as resemblance to the represented object. Representation for
Nigel Fabb
Goodman is distinct from description (i.e., the term representa-
tion does not correspond to current cognitive science or linguistic WORKS CITED AND SUGGESTIONS FOR FURTHER READING
uses, in which propositions or tree structures are representa-
Cohnitz, Daniel, and Marcus Rosenberg. 2006. Nelson Goodman.
tions). A description uses a symbol scheme that is (syntactically)
Montreal: McGill-Queens University Press.
articulate, whereas a representation uses a symbol system that is
Elgin, Catherine Z. 1992 . Depiction. In A Companion to Aesthetics, ed.
dense (or uses symbols from a dense part of a symbol scheme).
David Cooper, 11316. Oxford: Blackwell.
He distinguishes between two types of dense (representational) . 1992. Nelson Goodman. In A Companion to Aesthetics, ed. David
schemes, diferentiating a diagram from a picture. His example is Cooper, 1757. Oxford: Blackwell.
a pair of representations that are visually identical, consisting of a Goodman, Nelson. [1968] 1976. Languages of Art. 2d ed.
peaking line, one an electrocardiogram and the other a picture Indianapolis: Hackett.
107
Articulatory Phonetics
. 1988. Reconceptions in Philosophy and Other Arts and Sciences. Sound is short-term variations or disturbances in ambient
London: Routledge. air pressure. hese pressure disturbances are created when air
Goodman, Nelson, and Catherine Z. Elgin. 1978. Ways of Worldmaking. moves from a region of high pressure to a region of low pressure.
Indianapolis: Hackett. here are three piston-like articulatory movements that can
create such pressure diferentials with respect to atmospheric
ARTICULATORY PHONETICS pressure. hese, which J. C. Catford calls the initiation mecha-
nisms, are pulmonic, glottalic, and velaric. hese mechanisms
Articulatory phonetics is that part of phonetics that studies how can either create a positive pressure vis--vis atmospheric pres-
speech is produced by the lips, tongue, velum (soft palate), lar- sure, in which case they are called egressive, or a negative pres-
ynx, and lungs to alter the air pressures and airlows and turbulent sure, and then they are called ingressive.
noises in the vocal tract and to create the air spaces that yield the Pulmonic egressive initiation is by far the most common. All
resonances that diferentiate speech sounds. he basic vocabulary languages use it and most use it exclusively. he chest cavity, by
of articulatory phonetics is used for the taxonomic, that is, clas- virtue of decreasing its volume as in normal respiratory expira-
siicatory, description of speech sounds. For example, the initial tion, compresses the air in the lungs, thus raising lung pressure
sound in the French word pre, father, would be described as above that of the atmospheric pressure. Since speech neces-
a voiceless bilabial stop, symbolized using the IPA (International sarily involves valves that impede the exiting airlow (e.g., the
Phonetic Alphabet) symbol [p]. he initial sound in the English adducted vocal cords and/or whatever articulations are made in
word pear would be described as a voiceless aspirated bilabial the oral cavity), the pulmonic or subglottal pressures developed
stop and symbolized using the IPA symbol [ph]. However, besides in speech are much larger than those seen in quiet respiratory
this essential taxonomic function, articulatory phonetics stud- expiration. Because such initiation is so common, it is normally
ies the mechanisms (e.g., muscular, aerodynamic) that produce not included in the usual phonetic descriptions; for example,
speech and especially the how and why of variability in speech the [p] in French pre, which would otherwise be described as
sounds. he following is a brief overview of the subject; for in- pulmonic expiratory voiceless bilabial stop, is usually designated
depth accounts, the readings listed below should be consulted. simply as voiceless bilabial stop.
Pulmonic ingressive initiation (so-called ingressive voice)
History is possible and is encountered in many cultures, notably in
he study of speech articulation and the development of a Scandinavia and France, where short interjections, ja, oui, non,
descriptive terminology has an impressive history, with the irst can be uttered on ingressive voice (usually with some breathi-
surviving instance being the Adhyy of the Sanskrit gram- ness), but although some sociolinguistic or pragmatic contrast
marian, Pini (ca. 500 b.p.e.), who gave an articulatory account may be associated with this trait, no language documented so
of the relatively large sound inventory of Sanskrit. Other notable far uses pulmonic ingressive initiation to make lexical contrasts.
achievements in the characterization of speech sounds were Ingressive phonation may also be encountered as a (not very
given by many Greek grammarians, notably Dionysius hrax efective) vocal disguise, and it is universally encountered as a
(irst cent. b.p.e.); the Arab and Persian grammarians al Khalil kind of coda to very young babies cries where the vocal cords are
Ibn Ahmad and Sbawaihi of the eighth century, who described still approximated but the respiration has shifted from expiratory
the Arabic of their times; the First Grammarian of Iceland to inspiratory.
(twelfth cent.); and the work commissioned by and credited to If the vocal cords are tightly closed and the larynx as a whole
the Korean King Sejong (ifteenth cent.), which provided not only is raised, acting like a piston, while there is a complete closure in
an articulatory description of Korean as spoken then but also a the oral cavity (and with the velum raised), a positive pressure
transcription, now the oicial orthography for Korean, hangul, may be developed. Such sounds, glottalic egressives or ejectives,
which is partially iconic in its representation of how the sounds are not uncommon, being found in various African languages
are produced. (from diferent language families), in some languages of South
In Europe, the Baroque and modern eras saw dozens of and Central America and the Paciic Northwest (in the Americas),
proposals for the description of speech sounds, for example, and in the Caucasus. For example, Quechua bread is [tanta].
by John Wilkins, Johan Conrad Amman, William Holder, Glottalic ingressives or implosives involve the larynx most
Francis Lodwick, Alexander J. Ellis, Robert Nares, Ernst Brcke, commonly when the vocal cords are in voicing position being
Richard Lepsius, Alexander Melville Bell, Henry Sweet, and Otto lowered during the stop closure, thus creating a negative pres-
Jespersen. Although there is still some variation in the descrip- sure in the oral cavity or at least moderating the buildup of pos-
tive terms, works such as Catford (1977) and Maddieson (1984) itive pressure. Historically, such stops often come from voiced,
have helped to standardize the terminology. especially geminated (long), stops, for example, Sindhi /pauni/
lotus plant fruit < Prakrit *paba. Enlarging the oral cavity helps
The Basics to maintain a positive pressure drop across the glottis, which
Speech articulations enable communication between speakers favors voicing. Although ejective fricatives are attested, there are
and listeners because they create sound; it is the sound trans- no implosive fricatives probably because the noise of a fricative
mitted to listeners and the perception of these sounds that are is generated when the air jet expands after leaving the narrow
the ultimate goal in speaking. Descriptions of articulation are constriction. Such expansion would occur inside the vocal tract
intended to capture the gestures that create these distinctive ele- if made implosively, and the sound would be attenuated by the
ments in the speech code. oral constriction.
108
Articulatory Phonetics
If an air pocket is trapped between the tongue and palate or intense frication noise, because with no resistance to the airlow
the tongue dorsum and lips, and the tongue is lowered, a large at the glottis, the velocity of airlow will be greater at the oral con-
negative pressure can be generated, which, when released, can striction, and that also afects the degree and loudness from the
create quite a loud sound. Such sounds, so-called clicks have air turbulence.
velaric ingressive initiation. hey are common in cultures all over Subcategories of sonorants include laterals, where the con-
the world as interjections, signals to animals, and so on. However, striction is on one side of the palate, the other being open, for
they are used as speech sounds to diferentiate words only in a example, the medial geminate (or long) alveolar lateral in Hindi
few languages of southern and East Africa. hey are very com- palla loose end of a sari used as a head covering [pla]. Nasals
mon in the Khoisan languages and a few neighboring Bantu lan- are consonants made with a complete closure in the oral cav-
guages. For example, Khoi one is [|ui] where the [|] is a dental ity (at any place farther forward of the uvular region) but with a
africated click (the sound often symbolized in Western orthog- lowered velum, for example, Tswana [ku] sheep with an ini-
raphies as tsk or tut). In the Khoisan languages, clicks can be tial velar nasal. Glides and approximants have nonlateral oral
freely combined with pulmonic and/or glottalic egressives either constrictions that are not suicient to generate turbulence, for
simultaneously or in clusters. Velaric egressives, where a posi- example, the labial-velar glide at the beginning of the English
tive pressure is created by upward and forward movement of the word [wd]. Vowels are considered to have the least constriction
tongue, are not found in any languages lexicon but are used in (descriptive terms follow).
some cultures as a kind of exaggerated spitting sound, where the
positive pressure creates a brief bilabial trill. Place
In general, after the mechanism of initiation is speciied for he primary places of articulation of speech sounds, proceeding
speech sounds, there are three main categories of terms to fur- from the farthest back place to the farthest forward: glottal, pha-
ther characterize them: place, manner, qualiiers. For example, ryngeal, uvular, velar, palatal, alveolar, dental, and labial. Some
the Russian [bratj] to take has in word-inal position a voiceless of these places have already been illustrated. Finer place distinc-
palatalized dental stop. he manner is stop, the place is den- tions can easily be made if necessary by appending the preixes
tal, and voiceless and palatalized are qualiiers. pre- and post-, and multiple simultaneous constrictions can be
diferentiated by concatenating these terms as was done with the
Manners labial-velar glide [w]. In most cases, these anatomical landmarks
here are two fundamental categories of manners, obstruent and on the upper side of the vocal tract are suicient; if necessary, an
sonorant, each with subcategories. An obstruent is a sound that indication of the lower (movable) articulator can be speciied, for
substantially impedes the low of air in the vocal tract to a degree example, the voiced labial-dental fricative [v] as in French voir
that turbulent noise is generated either as continuous frication or to see [vw] (as opposed to, say, the voiced bilabial fricative
as a noise burst. Obstruents may be stops or fricatives. (Ejectives, [] in Spanish cerveza beer [sesa]).
implosives, and clicks are inherent stops.) Sonorants, which do
not impede airlow, are subdivided generally into laterals, glides, State of the Glottis
approximants, nasals, and vowels. In most cases, specifying whether the vocal cords are apart and
Stops present a complete blockage of the airlow, for exam- not vibrating (voiceless), lightly approximated and vibrating
ple, the glottal stop in the name of the Hawaiian island Oahu (voiced), or tightly pressed together (glottal stop) is suicient.
[oahu]. A special subclass of stops are africates, which are stops However, voicing itself occasionally needs to be further diferen-
with a fricative release, as in the initial and inal consonants of tiated as breathy (a more lax type of voicing lacking energy in the
English judge []. Another subclass, often categorized in higher harmonics), tense (rich in higher harmonics), or creaky
other ways, is comprised of trills, for example, the initial sound (irregular staccato type of phonation, also with much energy in
of Spanish roja red [roxa]. the higher frequencies, though since it is irregular, one cannot
Fricatives present a partial blockage, but with noise gener- clearly identify harmonics as such). Many of the Indic languages
ated due to air being forced through a relative narrow constric- employ a distinctive breathy voice associated with voiced stops,
tion, for example, the velar fricative in Dutch groot large [xot]. for example, Hindi bhsh language [b::], and many lan-
Fricatives may be further divided into sibilants (s-like fricatives) guages and many speakers voiced phonation changes to creaky
made by an apical constriction on or near the alveolar ridge. he at a point of low F0 in intonation. (Low F0 is a low rate of vibra-
essential characteristic of this class, as opposed to other nonsibi- tion of the vocal cords due to lesser tension giving rise to low
lant fricatives, is relatively loud high-frequency noise that exploits pitch or note of the voice.) Creaky voice is also a common vari-
the small downstream resonating cavity, for example, English ant of glottal stop.
schist a category of rock [st] which contains two diferent
sibilant fricatives. Nonsibilant fricatives either have no down- Vowels
stream resonating cavity (e.g., the bilabial fricative in Japanese he descriptors for vowels deviate from those for consonants.
Fuji name of the famous mountain [u i]) or, like the velar An imaginary quadrilateral space in the mouth (seen sagittally)
fricative, are made further upstream of the alveolar region and so is posited, and vowels are said to have the high point of tongue
have a longer downstream resonating cavity and, thus, lower fre- at regions in this space whose dimensions vertically are high
quencies. Presence or absence of voicing also afects fricatives mid low and horizontally, front central back. (hese may
loudness: Even if paired voiced and voiceless fricatives have the also have further qualiiers. In French the high front unrounded
same degree of constriction, voiceless fricatives will have more vowel contrasts with a high front rounded vowel [i] and with a
109
Articulatory Phonetics Artiicial Languages
high back rounded vowel [u], e.g., dit (s/he) said [di] vs. du of suix), dempster judge < deem + ster, hompson > hom + son,
the [dy] vs. doux soft [du].) Although ostensibly referring to youngster [jkst] < young [j] + ster. One has to ask where the
anatomical features, it is now accepted that these descriptors medial stop came from in these nasal + fricative clusters, neither
actually correspond to acoustic-auditory features of vowels: he element of which is a stop. he answer emerges when one con-
height dimension correlates inversely with their irst formant, siders that these speech sounds are opposite in the state of the
the front dimension correlates directly with their second for- oral and velic exit valves. he nasal has all oral exit valves closed
mant, and unrounded-rounded correlates roughly with their and the velic valve open whereas the fricative has an oral valve
third formant. It is still technically possible to apply the tradi- open, and the nasal valve closed. If in the transition between the
tional anatomical-physiological descriptors to vowels, in which nasal and fricative the velic valve should close prematurely, then
case [i] would be a close palatal vowel with spread lips, and [u] a all exit valves will be closed and thus a brief epiphenomenal stop
close velar vowel with lip rounding, and [] a pharyngeal vowel will emerge. (For more examples, see Ohala 1997.)
with a widely open mouth shape. Other vowels would just be
John Ohala
variants of these with either less constriction or intermediate
places of constriction. here is merit in applying the anatomical-
physiological labels, for example, to explain the Danish dialectal
variant pronunciations [bi] ~ [bi] bee. he latter variant with Browman, C. P., and L. Goldstein. 1986. Towards an articulatory phonol-
the voiceless palatal fricative can arise simply from the vowel ter- ogy. In Phonology Yearbook 3, ed. C. Ewan and J. Anderson, 21952.
minus being devoiced (since, it will be recalled, the degree of tur-
Catford, J. C. 1977. Fundamental Problems in Phonetics. Bloomington,
bulence is determined not only by the degree of closure but also
IN: Indiana University Press.
by the velocity of the airlow, which, in a voiceless vowel, is high
Goldstein, L., and C. P. Browman. 1986. Representation of voicing con-
to generate turbulence at the point of constriction). trasts using articulatory gestures. Journal of Phonetics 14: 33942.
Hardcastle, W. J., and J. Laver. 1999. he Handbook of Phonetic Sciences.
Secondary Articulations or Modiications Oxford: Blackwells.
here are dozens of ways a speech sound can be qualiied. Hufman, M. K. and R. Krakow, eds. 1993. Nasals, Nasalization and the
Typically, these are additional modiications or lesser constric- Velum: Phonetics and Phonology, Vol. 5. San Diego, CA: Academic
tions that can be done simultaneously with the primary con- Press.
striction, or are in such close temporal proximity to or invariably Ladefoged, P. 1964. A Phonetic Study of West African Languages: An
linked to it that they are considered inherent to it. he label Auditory-Instrumental Survey. Cambridge: Cambridge University
Press.
breathy voiced is an example. Some additional examples (where
Ladefoged, P., and I. Maddieson. 1995. he Sounds of the Worlds
the italicized term is the qualiier): voiceless aspirated velar stop,
Languages. Oxford: Blackwell.
as in English key [khi]; the English phoneme //, often phoneti- MacNeilage, P. F., ed. 1983. he Production of Speech. New York: Springer-
cally a voiceless post-alveolar labialized fricative as in ship [wip]; Verlag.
the nasalized mid-front vowel as in French faim hunger [f]. Maddieson, I. 1984. Patterns of Sounds. Cambridge: Cambridge University
Press.
Prosody, Tone, Intonation Ohala, J. J. 1990. Respiratory activity in speech. In Speech Production and
he terminology describing distinctive uses of voice pitch and Speech Modelling, ed. W. J. Hardcastle A. Marchal, 2353. Dordrecht,
relative timing of sounds (and, perhaps, diferent voice qualities) the Netherlands: Kluwer.
is still relatively nonstandardized except in the case of tones. he . 1997. Emergent stops. Proc. 4th Seoul International Conference
International Phonetic Associations transcription recognizes on Linguistics [SICOL] 1115 Aug 1997: 8491.
Rothenberg, M. 1968. he Breath-Stream Dynamics of Simple-Released-
a variety of possible tone shapes, for example, hai [kha\|] (with
Plosive Production. Basel: Karger.
falling tone) servant versus [kha/|] (with rising tone) leg. Here,
Silverman, D. 2006. A Critical Introduction to Phonology: Of Sound, Mind,
the vertical line is supposed to represent the range of voice pitch and Body. London and New York: Continuum International Publishing
characteristic of the speaker, the sentence context, and so on, Group.
and the attached line the range and direction of the distinctive Sol, M-J. 2002. Aerodynamic characteristics of trills and phonological
pitch modulation. patterning. Journal of Phonetics 30: 65588.
Beyond Taxonomy
ARTIFICIAL LANGUAGES
he conventional descriptions of speech just reviewed form the
basis for scientiic investigations of considerable sophistication An artiicial language can be deined as a language, or language-
and with applications in ields as diverse as medicine (especially like system, that has not evolved in the usual way that natural
speech pathology), man-machine communication, irst (and languages such as English have; that is, its creation is due to
subsequent) language learning, and phonology. hese investiga- conscious human action. However, this deinition leaves open
tions involve study of more than just the anatomical-physiological some questions. For one thing, what do we mean by language
character of speech sounds, but also, as was hinted at in the or language-like system? Among the systems of communica-
preceding discussion, speech aerodynamics, speech acoustics, tion that could be, and have been, called artiicial languages
speech perception, and neurophonetics. Space allows just one are systems of logic, for example, predicate calculus, and com-
example in the area of phonology: Medial stops emerge seemingly puter languages, such as BASIC. However, the functions of these
out of nowhere in words such as glimpse < gleam + s (nominalizing languages are diferent from the function of natural languages,
110
Artiicial Languages Aspect
which is communication among humans. I, therefore, focus on elements from any natural languages, on the surface they may
artiicial languages that have this latter function, for example, seem rather strange, as shown by the following examples from the
Esperanto. Under the heading of artiicial languages, one might language Oz (which, in spite of its name, was a serious project):
also include languages that have been made up in connection
(1) ap if-blEn-vOs
with novels, ilms, television programs, and so on, for example,
he HABITUAL-seldom-study
Klingon (ictional or imaginary languages), or as part of some
he seldom studies(Elam 1932, 20)
other imaginary world, or those that have been created for the
enjoyment of their designer (personal languages). (2) ep ip-Qks ap
Some languages (philosophical languages) were designed I PAST-see him
to relect the real world better than natural languages. Some of I saw him (ibid.)
the earliest known ideas on artiicial languages, from the sev-
However, one could assert that since even a priori languages
enteenth century, involve this type. he terms constructed lan-
are human creations, they cannot be that diferent from natural
guage (or conlang) and planned language are roughly equivalent
languages.
to artiicial language (although one could point out that some
A posteriori languages can draw from several languages or
natural languages have undergone a degree of planning), while
from just one. In the latter case, they are usually or always sim-
(international) auxiliary language covers only those languages
pliications of the language. here have been many such simpli-
intended for international communication (of course, some nat-
ications of Latin, some of English, and some of other languages.
ural languages are also used for this); many, if not most, artiicial
Following is a Latin sentence and its equivalent in SPL (or SIMP-
languages have been created for this purpose.
LATINA), an artiicial language created from Latin.
Another question concerns our notions of artiicial and natu-
ral. On the one hand, many (arguably all) natural languages have (3) Nuntium audiverat antequam domum venit.
been subjected to some human manipulation. Consider, for exam- Fin audit nntium ntequam in venit in domus.
ple, the long line of English prescriptivists who have tried to elimi- He had heard the news before he came home. (Dominicus
nate some constructions of the language, or organizations such as 1982, 21)
the French Academy, which has attempted to keep some English
One might be surprised to learn that there are a thousand
words out of French. Although many of these manipulations have
or more artiicial languages, even excluding ictional ones (but
not completely succeeded, they have had some efect, and there-
including languages that were not fully elaborated). It might also
fore one could argue that English and French are partly artiicial.
be unexpected that people have continued to devise new artii-
On the other hand, many consciously created languages were
cial languages for international communication, given how many
built from elements of one or several natural languages and could
have already been proposed and not achieved their goal. he
thus be considered not entirely artiicial. herefore, the boundary
existence of the Internet may have served as an impetus, since it
between natural and artiicial languages is not entirely clear.
is now easy for language creators to present their languages to a
In fact, a common classiication of artiicial languages is in
wide audience. he number of artiicial languages will probably
terms of whether they are based on natural languages: a poste-
keep increasing, though with none of them achieving the status
riori languages are, while a priori languages are not (the philo-
of a universal second language.
sophical languages belonging to the second group). hat is, a
priori languages are (supposedly) built from scratch, not tak- Alan Reed Libert
ing anything from natural languages. his is a simpliication, as
few, if any, languages are entirely a priori; many contain both a WORKS CITED AND SUGGESTIONS FOR FURTHER READING
posteriori and a priori components. herefore, the distinction
Albani, Paolo, and Berlinghiero Buonarroti. 1994. Aga Magra Difra.
should, rather, be seen as a spectrum, with languages at diferent
Bologna: Zanichelli. An encyclopaedia with many entries for artiicial
points having difering ratios of a priori and a posteriori compo-
languages, including ictional languages, and their creators.
nents. Artiicial languages consisting of substantial proportions Dominicus, Richardius. 1982. SPL. Wisconsin: Dominicus Publishing
of both types are called mixed languages. House.
Esperanto stands far above other artiicial languages in terms Elam, Charles Milton. 1932. he Case for an A Priori Language. Cincinnati,
of success it has vastly more speakers than any others (and OH: he Open Sesame Press.
even some native speakers). It has been claimed to have more Large, Andrew. 1985. he Artiicial Language Movement. Oxford: Basil
than a million speakers, though some would disagree with such Blackwell.
a large number, and of course, the question hinges on how one Pei, Mario. 1968. One Language for the World. New York: Biblo and
deines a speaker. Only a relatively small number of artiicial lan- Tannen.
guages have achieved much of a community of speakers. hese
include Volapk, Interlingua, and Ido, the latter being a modiied
ASPECT
Esperanto. Many artiicial languages were not used by anyone
other than their designer and perhaps several other people. In Situations unfold over time. When we talk about them, we often
fact, a large number of artiicial languages were never fully devel- specify how they unfold over time (or not). here are many ways
oped, with only incomplete descriptions having been published. in which language conveys such temporal information. While
Let us now see some examples of sentences in artiicial lan- tense speciies location of an event in relation to other points
guages. Because the a priori languages do not (intentionally) use in time (e.g., past, present, future, pluperfect), aspect speciies
111
Aspect
internal temporal structure of a situation (e.g., whether it is as French imparfait) and atelic verbs (states and activities), and
ongoing or completed). his is important information to convey between progressive (i.e., dynamic imperfective) marking and
for our linguistic communication to be successful, and many lan- activity verbs (see Li and Shirai 2000 for a review). Psychologists
guages convey it by various means lexical, grammatical, and/ and linguists alike have tried to explain this observation. One
or pragmatic. English grammatically marks tense (-ed, -s, will), important proposal relies on innateness (the language biopro-
while Chinese does not, relying instead on lexical and pragmatic gram hypothesis, Bickerton 1981), while an alternative proposal
means. English also grammatically marks aspect (progressive is based on input frequency (Shirai and Andersen 1995).
be V-ing), while Hebrew does not. he notion of compatibility is crucial when we discuss the
Grammatical marking of aspect, often encoded in auxilia- interaction of lexical and grammatical aspect since some combi-
ries and inlections, is known as grammatical aspect, or view- nations are more natural, prototypical, and frequent. Telic verbs
point aspect. It is called viewpoint aspect because it signiies are more compatible with perfective aspect, while activities
the speakers viewpoint. When one chooses to say He ran a are most naturally associated with progressive marking. his is
mile, one is viewing the situation from outside, disregarding relected in frequency distribution cross-linguistically (Andersen
its internal structure (perfective aspect), while if one says He and Shirai 1996). For example, about 60 percent of past tense
was running a mile, the beginning and end of this situation markers in child-directed speech in English were attached to
are disregarded and one is focusing on the internal structure of achievement verbs, while almost 95 percent of past tense forms
this situation (imperfective aspect) (Comrie 1976; Smith 1997). in childrens speech were used with achievement verbs (e.g.,
he former is often used to push the narrative storyline forward broke, dropped) when children started using them (Shirai and
(foreground), while the latter is associated with background Andersen 1995).
information (Hopper 1979). his frequency efect is not yet well recognized in the area of
Equally important in conveying aspectual information is lex- language processing. Carol J. Madden and Rolf A. Zwaan (2003)
ical aspect also known as inherent (lexical) aspect, situation and Todd Feretti, Marta Kutas, and Ken McRae (2007) found
aspect (or situation type), aktionsart, event type, and so on. his the strong efect of grammatical aspect in their experiments on
is deined by the temporal semantic characteristic of the verb aspectual processing, but they did not manipulate the efect
(and its associated elements) that refers to a particular situation. of lexical aspect. Although Madden and Zwaan (2003) found a
Although there are numerous proposals, the most well known is facilitating efect of perfective aspect on processing but not of
the classiication proposed by Zeno Vendler (1957): imperfective aspect, their experiments used only accomplish-
ment verbs, which are telic and more compatible with perfective
Achievement: that which takes place instantaneously, and is
aspect (i.e., past tense in English). Foong Ha Yap and colleagues
reducible to a single point in time (e.g., recognize, die, reach
(2009) replicated facilitating efects of perfective aspect with
the summit)
accomplishments and, in addition, of imperfective (progressive)
Accomplishment: that which has dynamic duration, but has a aspect with activities in Cantonese. hus, the interaction of lexi-
single clear inherent endpoint (e.g., run a mile, make a chair, cal and grammatical aspect is pervasive and cannot be ignored
walk to the store) in any research involving aspectual phenomena.
Activity: that which has dynamic duration, but with an arbi- Yasuhiro Shirai
trary endpoint, and is homogeneous in its structure (e.g., run,
sing, play, dance) WORKS CITED AND SUGGESTIONS FOR FURTHER READING
State: that which has no dynamics, and continues without Andersen, Roger W., and Yasuhiro Shirai. 1996. Primacy of aspect in
additional efort or energy being applied (e.g., see, love, hate, irst and second language acquisition: he Pidgin/Creole connection.
want) In Handbook of Second Language Acquisition, ed. W. Ritchie and T.
Bhatia, 52770. San Diego, CA: Academic Press.
Lexical aspect has proved to be important in linguistic analy-
Bickerton, Derek. 1981. Roots of Language. Ann Arbor, MI: Karoma.
sis, acquisition, and processing of aspect. In linguistic analysis, Comrie, Bernard. 1976. Aspect. Cambridge: Cambridge University Press.
Carlota S. Smith (1997) proposed the two-component theory, a Ferretti, Todd R., Marta Kutas, and Ken McRae. 2007. Verb aspect
system in which the aspectual meaning of a sentence is deter- and the activation of event knowledge. Journal of Experimental
mined by the interaction between lexical aspect and grammati- Psychology: Learning, Memory, and Cognition 33.1: 18296.
cal aspect. For example, imperfective aspect (e.g., progressive in Hopper, Paul J. 1979. Aspect and foregrounding in discourse. In Syntax
English) takes the internal view, and, therefore, it is compatible and Semantics. Vol. 12: Discourse and Syntax. ed. T. Givon, 21341.
with durative predicates of activity and accomplishment and New York: Academic Press.
yields progressive meaning. In contrast, achievement, since it is Li, Ping, and Yasuhiro Shirai. 2000. he Acquisition of Lexical and
Grammatical Aspect. Berlin: Mouton de Gruyter.
nondurative, is not so compatible with imperfective aspect, and
Madden, Carol. J., and Rolf A. Zwaan. 2003. How does verb aspect con-
such pairing is often anomalous (e.g., *He is noticing the error) or
strain event representation? Memory & Cognition 31: 66372.
results in preliminary stages meaning (e.g., He is dying).
Shirai, Yasuhiro, and Roger W. Andersen. 1995. he acquisition of tense-
In acquisition, this interaction of lexical and grammatical aspect morphology: A prototype account. Language 71: 74362.
aspect has been observed since the 1970s. Cross-linguistically, Smith, Carlota S. 1997. he Parameter of Aspect. 2d ed. Dordrecht, the
when children acquire (perfective) past tense marking, they Netherlands: Kluwer.
show strong association between telic verbs (achievements and Vendler, Zeno. 1957. Verbs and times. Philosophical Review
accomplishments), between general imperfective marking (such 66: 14360.
112
Auditory Processing
Yap, Foong Ha, Patrick Chun Kau Chu, Emily Sze Man Yiu, Stella Fay such as syllables, words, and formulaic chunks, inluencing their
Wong, Stella Wing Man Kwan, Stephen Matthews, Li Hai Tan, Ping Li, decisions as to what has been heard. Decoding is also assisted
and Yasuhiro Shirai. 2009. Aspectual asymmetries in the mental rep- by context, in the form of world knowledge, knowledge of the
resentation of events: Role of lexical and grammatical aspect. Memory speaker, and recall of what the speaker has said so far.
& Cognition 37: 58795.
here has been much discussion as to whether these pieces of
information combine in the mind of the listener during decoding
AUDITORY PROCESSING or whether they are handled separately. he argument behind
the irst (interactive) view is that all the evidence can be con-
Auditory processing refers to the cognitive processes that enable sidered simultaneously; the argument behind the second view
a listener to extract a message from the raw material of a speech (modularity) is that the processor operates more rapidly if it
signal. he study of auditory processing draws upon a range of employs localized criteria speciic to the phoneme, the word, or
sources within the linguistic sciences: most notably cognitive the context.
psychology, discourse analysis, phonetics, phonology Decoding can be discussed at three levels.
and neurolinguistics (see brain and language). It is distinct he irst is phoneme recognition (see speech perception).
from theories of empathetic listening (what makes a good lis- Translating acoustic evidence into the sounds of the target lan-
tener) in areas such as counseling. guage does not involve simple one-to-one matching. here
It was not until the 1960s that a signiicant body of listen- is, irstly, an issue of noninvariance: researchers have not suc-
ing research developed with the advent of more sophisticated ceeded in inding clusters of cues that uniquely identify individ-
recording equipment and the increased availability of spec- ual phonemes. Indeed, they have discovered that the same set of
trograms to display the physical characteristics of the speech cues may be interpreted diferently according to the phonemes
signal (see acoustic phonetics). he many advances in our that precede and follow them. here is also an issue of nonline-
understanding of the skill since then include early work on arity: he phonemes within a word are not clearly bounded units
phoneme perception by the Haskins Laboratories; the recog- but blend into each other in a process known as co-articulation.
nition that processing occurs on line rather than waiting for A further complication is speaker variation: A listener has to
an utterance to be completed; and insights into how listeners adjust to diferences between individual speakers in pitch of
identify word boundaries in connected speech. Evidence of voice, accent, speech rate, and so on.
the extent to which listeners have to build a message for them- One solution holds that listeners employ a more reliable
selves on the basis of inference has resulted in a sharp move unit of analysis than the phoneme. hey might map direct from
away from a view of listening as a passive skill and the recog- acoustic stimuli to words stored in their minds, or they might use
nition that a listener actively engages in a process of meaning the syllable as their principal perceptual unit. Another solution
construction. views phoneme recognition as the outcome of cue trading, where
Auditory processing falls into two closely linked operations: the listener weighs competing evidence until a particular candi-
Decoding, where acoustic stimuli are translated into linguistic date emerges as the most likely.
units; hese accounts tend to assume that we have in our minds
a set of idealized templates for the sounds of a language and
Meaning construction, which embellishes the bare meaning
match imperfect real-life examples to them by normalization
of the utterance by reference to knowledge sources outside
by editing out features that are nonstandard or irrelevant. An
the signal. It also requires listener decisions as to the impor-
alternative approach shifts the focus away from processing and
tance of what has been said and how it is linked to the dis-
onto how the listener represents the sounds. A variant of the
course that preceded it.
template notion suggests that a phoneme may be represented
he listener thus draws upon four information sources. he irst in an underspeciied way that stores only a few features essen-
is perceptual, based on the signal reaching the listeners ear. he tial to its recognition. A second possibility is that phonemes
second is linguistic, consisting of the listeners stored knowledge are stored as prototypes with a range of permissible varia-
of the phonology, lexis, and syntax of the language being tion associated with each. But there is now increasing support
spoken. he third is external: drawing upon the listeners knowl- for a view that we do not construct a central representation of a
edge of the world, the speaker, the topic, and the type of situa- phoneme but instead store multiple examples of the words we
tion. A inal component is the listeners ongoing model of what hear, uttered in their variant forms. his exemplar account
has been said so far. accords with evidence that the human mind is better at storing
massive amounts of information than was previously supposed.
Decoding It explains how we are able to adjust to unfamiliar accents in
Decoding is principally a matching operation in which evidence our native language and to recognize the same word uttered in a
from the signal is mapped onto stored representations in the range of diferent voices.
listeners mind of the phonemes, words, and recurrent chunks he second level is word recognition. he listening pro-
of a language (see mapping). he process was once represented cess takes place on line, with a listener able to shadow (repeat)
as taking place in a linear and bottom-up way, with phonemes what a speaker says at a delay of about a quarter of a second.
shaped into syllables, syllables into words, and words into Cohort theory (Marslen-Wilson 1987) postulates that a listener
clauses. In fact, listeners appear to draw upon several levels of retrieves a bank of possible word matches when the initial pho-
representation at once, with their knowledge of higher-level units, nemes of a word are uttered, then gradually narrows them down
113
Auditory Processing
as more of the word is heard. he correct item is identiied when Decoding in ones irst language (L1) is highly automatic
the words uniqueness point is reached and the cohort is reduced which is why decoding skills in a second language are diicult
to one possible match. to acquire. Studying vocabulary and syntax is not suicient; a lis-
However, many words do not have a uniqueness point (the tener needs to recognize the relevant linguistic forms as they occur
sequence man might be a complete word or the irst syllable of in connected speech. he listener is likely to perceive the sounds
manner or manager), and there are no consistent gaps between of the second language by reference to the phoneme categories of
words in connected speech to mark where boundaries fall. the irst and may also transfer processing routines, such as the L1s
Locating boundaries (lexical segmentation) is unproblematic lexical segmentation strategy or the relative importance it accords
when one is listening to languages that bear ixed stress on the to word order, inlection, and animacy. Second language lis-
irst, penultimate, or last syllable of the word but becomes an teners often ind themselves heavily dependent upon contextual
issue when processing languages with variable stress. Research cues in order to compensate strategically for failures of decoding.
suggests that listeners exploit certain prosodic features of these
languages in order to establish the most likely points for words to Meaning Construction
begin or end. In English, they take advantage of the fact that the he outcome of decoding is an abstract proposition, which
majority of content words in running speech are monosyllabic or represents the literal meaning of the utterance independently of
begin with a strong syllable (Cutler 1990). the context. A listener has to build a more complex meaning rep-
A further problem for lexical recognition is that many words resentation (or mental model), which
in connected speech (particularly function words) occur in
(a) adds to and contextualizes the proposition;
a reduced form. hey might be brief, of low saliency, and very
diferent from their citation forms. Using the gating method, (b) links it conceptually to what has gone before. his opera-
which presents connected speech in gradually increasing seg- tion takes place locally as well as at a discourse level. Listeners
ments, researchers have demonstrated that word identiication need to make local connections between ideas, associating
in listening is sometimes a retrospective process, with listeners pronouns with their referents and recognizing logical connec-
unable to identify a word correctly and conidently until two or tives (in addition, however). But they also have to carry forward
more syllables after its ofset. a developing representation of the whole discourse so far.
here have also been attempts to model lexical recognition Meaning construction embraces several diferent processes.
using connectionist computer programs that analyze spoken Many of them are more cognitively demanding in listening than
input in brief time slices rather than syllables. Early matches in reading because the listener is entirely dependent upon the
are continually revised as evidence accumulates across slices mental representation that he/she has built up and cannot look
(see connectionist models, language structure, and back to check understanding.
representation). he most well known of these programs is
TRACE (McClelland and Elman 1986). ENRICHMENT. he listener adds depth and relevance to the
he speed with which a word is identiied by a listener is proposition by drawing upon external information: knowledge
subject to variation. High-frequency words are more rapidly of the world, the topic, the speaker, and the current situation.
retrieved than low-frequency ones and are said to be more eas- Understanding is also deepened by relating the proposition to
ily activated. Recognition is also faster when the listener has the current topic and to the points made so far by the speaker.
recently heard a word that is closely associated with the target.
hus, encountering a word like doctor facilitates (or primes; see INFERENCE. Listeners supply connections that the speaker does
priming, semantic ) later recognition of nurse, patient, and not make explicitly. hey might employ scripts to provide
hospital. his process, known as spreading activation, default components for common activities. If a speaker men-
is highly automatic and is distinct from the normal efects of tions going to a restaurant, the listener takes for granted a waiter,
context. a menu, and a conventional procedure for ordering.
Syntactic parsing is the third level. hough speech is received
SELECTION. Listeners do not simply record facts; they select
linearly, syllable by syllable, a listener needs to build larger-scale
some, they omit some, and they store some in reduced form. he
syntactic structures from it. Listeners appear to retain a verbatim
same utterance may result in diferently constituted messages
record of the words heard until a major syntactic boundary. A
in the minds of diferent listeners. One important factor is the
wrap-up process then turns the words into an abstract propo-
listeners perception of the intentions of the speaker. Another is
sition, and they cease to be available to report. Intonation con-
the listeners own purpose for listening. A further consideration
tours frequently coincide with syntactic units and assist listeners
is redundancy: Spoken discourse is often repetitive, with the
in locating clause and sentence boundaries.
speaker reiterating, rephrasing, or revisiting information that has
Where there is ambiguity early in an utterance, a listener
already been expressed.
has to carry forward parallel hypotheses about how the utter-
ance will end, with one prioritized and the others held in reserve. INTEGRATION. Listeners integrate the incoming information into
Researchers employ garden path sentences (example: he law- what has been heard so far. Heed is paid to whether it extends an
yer questioned by the judge apologized) to establish the crite- established topic or whether it initiates a new one.
ria that inluence the preferred interpretation. Listeners appear
to be swayed by multiple factors, including syntactic simplicity, SELF-MONITORING. Listeners check to see whether incoming
semantic probability, and argument structure. information is consistent with the meaning representation built
114
Autism and Language
up so far. If it is not, one or the other needs to be adjusted, or a difers remarkably from that of age-matched peers. Speech char-
comprehension check needs to be made. acteristics typical of autism include pronoun reversal (referring
to self as you); unvaried or atypical intonation; neologisms
DISCOURSE STRUCTURE. Listeners impose an argument struc- (Volden and Lord 1991); the use of stereotyped, repetitive, and
ture upon the meaning representation, with major points dis- idiosyncratic language; and echolalia. Barry Prizant and Judith
tinguished from minor. Here, they may be assisted by analogy Duchan (1981, 246) suggest that echolalia may serve important
with previous speech events. communicative and cognitive functions, such as turn-taking for
John Field people with autism.
Signiicantly, social communication in ASD often fails even
WORKS CITED AND SUGGESTIONS FOR FURTHER READING in the presence of apparently intact grammatical skills. his
can be seen in Asperger syndrome, where language skills can
Brown, Gillian. 1995. Speakers, Listeners and Communication. be advanced, vocabulary extensive, and syntax formally cor-
Cambridge: Cambridge University Press. Discourse-based account of
rect, even bookish. he speech of individuals with Asperger
meaning construction in listening.
syndrome is often pedantic, exhibiting unvaried, stereotyped
Cutler, Anne. 1990. Exploiting prosodic possibilities in speech segmen-
phrases and expressions associated with contexts or registers
tation. In Cognitive Models of Speech Processing, ed. G. Altmann,
10521. Cambridge, MA: MIT Press. not presupposed by the immediate situation of talk. he speech
Garrod, Simon, and Martin Pickering, eds. 1999. Language Processing. patterns associated with ASD are part of the broader spectrum of
New York: Psychology Press. Papers on lexical, syntactic, and dis- impaired reciprocal social interaction.
course processing. Conversation may be the most diicult area of communica-
Marslen-Wilson, William. 1987. Functional parallelism in spoken word- tion for people with ASD. Conventional rules of turn-taking are
recognition. Cognition 25: 71102. often ignored. Speakers may fail to sustain conversation beyond
McClelland, J. L., and J. L. Elman. 1986. he TRACE model of speech per- yes/no answers or speak at length on circumscribed interests,
ception. Cognitive Psychology 18: 186. and they may resist attempts to shift topic. Speakers may also
Miller, Joanne L., and Peter D. Eimas, eds. 1995. Speech, Language and
fail to attend to the conversational needs of listeners and may
Communication. San Diego, CA: Academic Press. Papers by L. Nygaard
have diiculty applying contextual and cultural knowledge in
et al. and Anne Cutler review issues in perception.
Pisoni, David B., and Robert E. Remez. 2005. he Handbook of Speech
conversation. hey may thus encounter problems interpreting
Perception. Oxford: Blackwell. Papers covering most major issues in deictic references, as the following example illustrates: Speaker
decoding. 1: What did you do on the weekend? Speaker 2: What weekend?
Here, the conventional response to the question posed would be
to interpret the weekend as the one that had just passed. Such
AUTISM AND LANGUAGE problems with relevance appear to be related to the tendency
Autism is a neurodevelopmental disorder that is among the most in ASD toward an overliteral understanding of communication,
prominent of disorders afecting language. While its causes are including diiculties interpreting indirect requests and meta-
unknown, research has focused on cognitive, neurological, and phor (Happ 1993).
genetic explanations. Autism afects more than one domain of A number of cognitive theories are currently being explored
functioning, with language and communication as primary to explain the core features of ASD. Executive dysfunction is one
deicits. widely accepted cognitive explanation for some behavior dif-
Since Leo Kanner published the irst account of children with iculties in ASD. his refers to decision-making processes that
autism in 1943, widening diagnostic criteria have increased the are necessary for performing goal-directed activities, which
identiication of cases. here have also been dramatic changes in are thought to originate in the frontal lobes (Russell 1997).
classiication: Autism is no longer regarded as an isolated disor- Weak central coherence theory posits a detail-oriented process-
der but includes Asperger syndrome and atypical autism under ing style at the expense of global and contextual information
the rubric autism spectrum disorder (ASD) in order to relect and alludes to poor connectivity between brain regions (Happ
that variability in expression. Diagnoses along the spectrum and Frith 2006). Intriguingly, this information-processing style
are characterized by a common set of features: impairments in can often lead to superior performance on certain tasks, such as
social communication, restricted interests, and repetitive activi- the Embedded Figures Task (Witkin et al. 1971), underscoring
ties, with behaviors varying at diferent ages as well as diferent the fact that ASD is not merely a set of impairments but involves
levels of functioning (DSM-IV, American Psychiatric Association unique ways of processing information. he theory most fre-
1994). Autism occurs in at least 0.2 percent of the population, quently cited to explain communication diiculties in ASD is
afecting three times more males than females, while the other theory of mind (ToM) (Baron-Cohen 1995). ToM explains
disorders on the spectrum are estimated to afect another 0.4% these diiculties in terms of a cognitive mechanism underly-
(Fombonne et al. 2006). ing the ability to recognize others mental states. Many of the
he social communication problems in ASD vary widely. pragmatic impairments that are known to occur in ASD can
Parents of young children later diagnosed with ASD often be linked to a lack of intuitive mentalizing ability, for example,
observe an absence of simple communicative behaviors, such diiculties understanding pretense, irony, deception, and
as shared attention (e.g., pointing to something to share inter- nonliteral language. he ToM hypothesis does not preclude
est) and make-believe play. Although many children with autism the presence of assets and islets of ability as suggested by weak
never acquire functional speech, others develop speech that central coherence theory. Cognitive theories and hypothesized
115
Autonomy of Syntax
neural correlates with respect to facial and emotion informa- A central idea of much of structural linguistics was that the formal
tion processing in the amygdala have so far provided the most devices of language should be studied independently of their use.
compelling explanations for the communication impairments he earliest work in transformational-generative grammar took
seen in ASD. Research into genetic causes appears promis- over a version of this thesis, as a working hypothesis. I think it has
ing, since some of the strongest genetic efects in autism seem been a fruitful hypothesis. It seems that grammars contain a sub-
related to language abilities. structure of perfectly formal rules operating on phrase-markers
in narrowly circumscribed ways. Not only are these rules inde-
Jessica de Villiers
pendent of meaning or sound in their function
WORKS CITED AND SUGGESTIONS FOR FURTHER READING his passage is very pertinent in seeking to understand the
American Psychiatric Association. 1994. Diagnostic and Statistical Manual assumption and its status and origins (and cf. Chomsky 1975,
of Mental Disorders (DSM-IV). 4th ed. Washington, DC: American 1822; 1977, 3858). But let us observe some things about origins
Psychiatric Association. that it doesnt entirely convey.
Baron-Cohen, Simon. 1995. Mindblindness: An Essay on Autism and Firstly, Chomskys much of structural linguistics should
heory of Mind. Cambridge, MA: MIT Press. not be taken to include most of the work done in early structural
Fombonne, Eric, Rita Zakarian, Andrew Bennett, Linyan Meng, and linguistics in Europe, even by self-declared autonomists (see
Diane McLean-Heywood. 2006. Pervasive developmental disorders
Anderson 2005). It is notably the followers of Leonard Bloomield
in Montreal, Quebec, Canada: Prevalence and links with immuniza-
(1926) and proponents of transformational grammar who
tions. Pediatrics 118.1: 13950.
insist on the autonomy of syntax from meaning. And, even for
Frith, Uta. 2003. Autism: Explaining the Enigma. 2d ed. Oxford:
Blackwell. the post-Bloomieldians, syntax is far from autonomous from
Happ, Francesca. 1993. Communicative competence and theory of phonology.
mind in autism: A test of relevance theory. Cognition 48.2: 10119. Perhaps more signiicantly, we should fully register the extent
Happ, Francesca, and Uta Frith. 2006. he weak coherence to which the autonomy assumption is an innovation (Anderson
account: Detail-focused cognitive style in autism spectrum disorders. 2005). Gram mar or syntax before structuralism was not
Journal of Autism and Developmental Disorders 36.1: 525. conceived of as autonomous; syntactic rules and principles can
Kana, Rajesh K, Timothy A. Keller, Vladimir L. Cherkassky, Nancy J. refer to semantically and/or phonologically deined categories.
Minshew, and Marcel Adam Just. 2006. Sentence comprehension in Consider as an example of this earlier tradition Otto Jespersens
autism: hinking in pictures with decreased functional connectivity.
description of the syntax of the SPEECH-ACT category of ques-
Brain 129: 248493.
tion: [T]he formal means by which questions are expressed,
Kanner, Leo. 1943. Autistic disturbances of afective contact. Nervous
Child 2: 21750.
are (1) tone; (2) separate interrogative words ; (3) word-order
Prizant, Barry, and Judith Duchan. 1981. he functions of immediate (1924, 305). Syntax (and intonation) is expressive of meaning.
echolalia in autistic children. Journal of Speech and Hearing Disorders However, autonomy requires that elements that participate
46: 2419. in purely syntactic regularities be syntactic themselves. hus, the
Russell, James, ed. 1997. Autism as an Executive Disorder. Oxford: Oxford feature Q for interrogative clauses of Chomsky (1995, 4.5.4)
University Press. is part of syntax. It is interpretable, but its interpretation is
Volden, Joanne, and Catherine Lord. 1991. Neologisms and idiosyncratic not pertinent to syntax (see illocutionary force and sen-
language in autistic speakers. Journal of Autism and Developmental tence types). But from a traditional point of view, it is the cat-
Disorders 21.2: 10930. egorial meaning of Q that, as with other syntactic elements,
Witkin, H., P. Oltman, E. Raskin, and S. Karp. 1971. A Manual for the
drives its syntax. Its status as (prototypically) a request for infor-
Embedded Figures Test. Palo Alto, CA: Consulting Psychologists Press.
mation is what demands, for instance, the presence in the sen-
tence of an open element, marked, for example, by wh-, or by
AUTONOMY OF SYNTAX
intonation or some such indication of openness of the truth of the
Autonomy of syntax refers to what in recent times has been the sentence itself.
dominant assumption concerning the formulation of syntactic he autonomy hypothesis is falsiiable only if there is an inde-
regularities: syntax is determined independently of phonolog- pendent notion of what constitutes syntax; otherwise, any appar-
ical realization or semantic interpretation. he formal prop- ent counterexample can be relegated to interaction between
erties of syntax are manipulated purely formally. syntax and some other module (see modularity). An unfalsii-
Such an assumption is familiar to modern students of lin- able assumption of autonomy deines a research program, rather
guistics from numerous textbook presentations, such as Andrew than constituting an empirical hypothesis: It is methodological
Radfords (1988, 31): rather than ontological. he program, as well as the hypothesis,
is based on the premise that it is fruitful to operate as if syntax
autonomous syntax principle. No syntactic rule can make ref-
is autonomous, in contrast with the more traditional view that
erence to pragmatic, phonological, or semantic information.
nonreference by a syntactic regularity to interpretation is excep-
Some such assumption is already in place in Noam Chomsky tional, involving demotivation (grammaticalization) within
(1957, 17): I think that we are forced to conclude that grammar a syntax whose central concern is with the role of sound and
is autonomous and independent of meaning. And in a later structure as expressive of meaning.
espousing of the assumption, Chomsky traces the idea back to Opponents of the autonomy assumption, whatever its sta-
what he refers to as structural linguistics (1972, 119): tus, tend to interpret it in the absolute form described here (as
116
Babbling
Langacker 1987,155). Chomsky, however, envisages autonomy vocal forms include both a complete or nearly complete supra-
theses of varying degrees of strength (1977, 43), whereby syn- glottal closure and a transition to a recognizable vocalic nucleus,
tax is not necessarily exhausted by the substructure of perfectly for example, [dada], [babababa], [aaa]. Prior to that, the
formal rules (1972, 119) of his formulation. hus, the signiicant child vocalizes in more primitive ways that are not thought to be
question with regard to the autonomy thesis may not be a ques- directly related to language.
tion of yes or no, but rather of more or less, or more correctly, he lack of an adult model is a question of interpretation
where and how much (Chomsky 1977, 42). Certainly, provided since the irst word forms are highly similar in form to concur-
again that we have an independent characterization of syntax, rent babbling. In fact, there is in most cases a gradual shift from
the extent of autonomy and its accommodation are in themselves a predominance of unidentiiable vocalizations, beginning with
interesting empirical questions, with consequences for modular- the emergence of canonical babbling (at 6 to 10 months), to a
ity, universal grammar, and the autonomy of language itself. predominance of word use identiiable in the situational con-
And seeking to answer them might be more comprehensible to text (16 to 22 months). he extent to which the shift is relatively
opponents of a strong interpretation of autonomy. abrupt or gradual is highly individual.
Work within the autonomist program(s), whatever the status Finally, there are vocal forms that do not appear to be
of the assumption, has undoubtedly had important results, but based on an adult model but that are nevertheless used with
there is room for debate as to how fruitful has been the pursuit of consistent broad communicative meaning such as request,
the autonomy assumption as such. And in addition to the ques- rejection, or interest; these transitional forms or protowords
tion of how it relates to independent notions of what syntax is, a (Vihman and Miller 1988) should thus be distinguished from
major diiculty in evaluating the assumption, and its contribu- babbling, which lacks any apparent communicative goal.
tion to these results, is the changing nature of the grammatical Babbling is a largely self-directed process of exploration
enterprise(s) in which autonomy has been invoked, as well as the (Elbers 1982, 45).
varying degrees of emphasis with which it has been put forward
or denied. Brief Modern History
THE CONTINUITY ISSUE: THE RELATIONSHIP OF BABBLE TO
John M. Anderson
WORDS. Roman Jakobson ([1941] 1968) was the irst linguist to
pay serious theoretical attention to babbling if only to deny its
WORKS CITED AND SUGGESTIONS FOR FURTHER READING relevance to language learning. On the basis of the diary accounts
Anderson, John M. 2005. Structuralism and autonomy: From Saussure available to him, Jakobson developed the (discontinuity) view
to Chomsky. Historiographia Linguistica 32: 11748. that babbling was merely random sound production, express-
Bloomield, Leonard. 1926. A set of postulates for the science of lan- ing the full range of human phonetic possibility but unrelated to
guage. Language 2: 15364. the more austere or constrained repertoire of the irst words.
Chomsky, Noam. 1957. Syntactic Structures. he Hague: Mouton. Jakobson saw the latter as relecting a well-ordered universal
. 1972. Some empirical issues in the theory of transformational
scheme for the emergence of phonological oppositions, such
grammar. In Goals of Linguistic heory, ed. Stanley Peters, 63130.
that the low vowel /a/ is primary, with contrast with /i/ following,
Englewood Clifs, NJ: Prentice-Hall.
while anterior stops are the irst consonants produced (labial /b/
. 1975. he Logical Structure of Linguistic heory. New York: Plenum
Press. or dental /d/), followed by nasals and only later by other places
. 1977. Essays on Form and Interpretation. Amsterdam: North- and manners of articulation.
Holland. his impressively articulated universalist theory held sway
. 1995. he Minimalist Program. Cambridge, MA: MIT Press. for many years but was challenged in the 1970s when diary data
Jespersen, Otto. 1924. he Philosophy of Grammar. London: George Allen began to be supplemented by planned recordings of infants (typi-
and Unwin. cally in free interaction with an adult). Charles A. Ferguson and
Langacker, Ronald W. 1987. Foundations of Cognitive Grammar, Olga K. Garnica (1975) and Paul Kiparsky and Lise Menn (1977)
I: heoretical Prerequisites. Stanford, CA: Stanford University Press. were among the irst to raise objections to Jakobsons ideas of
Radford, Andrew. 1988. Transformational Grammar: A First Course.
gradual phonemic diferentiation, which disregarded the efect
of word position on order of segment acquisition and which would
be diicult to defend on the basis of the very few words produced
in a childs earliest lexical period. On the other hand, Jakobsons
B claims regarding the phones that occur in the irst words were, on
the whole, quite accurate, based as they were on decades of diary
records provided by linguists and psychologists.
What was not supported by later studies was the strong
BABBLING
separation required by Jakobsons theory between words (or
Babbling can be deined as infant vocal production that is broadly phonology) and babble (or phonetic production). Far from
adultlike in phonetic shape but that lacks any identiiable babbling being unrelated to word production, later studies have
adult model or intended meaning. he formal criterion broadly established that the irst words draw their phonetic resources
adultlike limits study to the period that follows the childs irst from the particular inventory of sound patterns developed by
temporally regular or rhythmic C-V (Consonant-Vowel)-syllable the individual child through babbling (Vihman et al. 1985; con-
production, also known as canonical babbling (Oller 1980); these tinuity has also been reported for babbled gesture and irst
117
Babbling
signs: Cheek et al. 2001). For example, a French child whose Chen and Kent (2005) report an association of labials with back
prelinguistic babbling made considerable use of liquids (mainly vowels in their extensive Mandarin data, both child and adult.
[l]) was found to develop several irst words with [l(j)], which is he balance of ambient language (perceptual) inluence versus
uncommon in early phonology: allo hello (on the telephone) universal (physiological or motoric) tendencies thus remains
[ailo], [hailo], [haljo], [alo]; lolo bottle (babytalk term) [ljoljo]; controversial. Any early C-V associations can be expected to
donne (le) give (it) [d], [dl], [ld], [heldo] (Vihman 1993). fade with lexical growth as infants follow their individual paths
toward segmental independence (freeing the content from the
BABBLING DRIFT: THE EFFECT OF PERCEPTION ON PRODUCTION. A frame).
second issue that has aroused interest for half a century is that
of possible drift in babbling toward the sounds of the native lan- VOCAL MOTOR SCHEMES AND THE EFFECT OF PRODUCTION ON
guage (Brown 1958). he issue has generated considerable heat PERCEPTION. Lorraine McCune and Marilyn M. Vihman (2001)
and is important since it concerns the extent to which infants can introduced the concept of vocal motor schemes (VMS), or gen-
be taken to be capable of translating their perceptual experience eralized action patterns that yield consistent phonetic forms (p.
of the sound patterns of the ambient language into their limited 673), identiied on the basis of repeated high-frequency produc-
production repertoire. hat is, any identiiable ambient language tion of one or more consonants over the course of several record-
inluence on prelinguistic vocalizations means that infants have ings. VMS index emergent stability in consonant production, a
both perceived the typical sounds of their language and adjusted reliable predictor of lexical advance.
their vocal production accordingly. Vihmans articulatory ilter model (1993) posits that an
Many studies, from Atkinson, MacWhinney, and Stoel infants babbling patterns will efectively highlight related forms
(1968) to Engstrand, Williams, and Lacerda (2003), have used in the input. Once one or more VMS are established, it is possible
adult perceptual judgments of recorded vocalizations to deter- to test the model by measuring infants attentional response to
mine whether infants language of exposure can be identiied, a series of short sentences featuring nonwords that do or do not
as that would provide evidence of drift; the indings remain include that childs VMS. Capitalizing on wide infant variability
inconclusive, however. Meanwhile, Bndicte de Boysson- in the timing and nature of irst vocal forms (within the limits of
Bardies and colleagues, using acoustic analyses of vowels the strong universal contraints), Rory A. DePaolis (2006) estab-
(1989) and tallies of transcribed consonant types (Boysson- lished an efect of infant production on the perception of speech.
Bardies and Vihman 1991), established signiicant prelinguis- His indings support the idea that the irst words, typically pro-
tic adult language inluence, although the mechanism for such duced in priming situations (context-limited words: McCune
an efect remained unclear. More recent work demonstrating and Vihman 2001), are based on infant experience of a rough
the extent of early implicit or distributional learning (Safran, match between vocal forms established through babbling prac-
Aslin, and Newport 1996) suggests that infants are capable of tice and words heard frequently in input speech (Vihman and
registering dominant patterns of their language within the irst Kunnari 2006). Such selection of words to attempt based on
year. hus, the mechanism needed to account for drift may be the vocal forms available for matching would account for the rel-
the efect of implicit perceptual learning on production: hose ative accuracy of irst words (Ferguson and Farwell 1975), their
vocalizations that, as the producing child perceives them, acti- constrained shapes (e.g., one or two syllables in length, with little
vate perceptual responses already familiar from input pattern- variegation across word position or syllables), and their strong
ing would strengthen perceptuomotor connections, leading to rootedness in the biomechanical basis of babbling as established
their repeated use. by Davis and MacNeilage. It also explains the diiculty of distin-
guishing words from babble (continuity) and the subtlety of the
Theoretical approaches ambient language efect on babbling and early words (drift).
FRAME AND CONTENT: THE ARTICULATORY BASIS OF BABBLING. he
Marilyn Vihman
most widely accepted current model of babbling is that of
Peter F. MacNeilage and Barbara L. Davis (1990 and Davis and
MacNeilage 1990, 1995 (for a review of competing ideas, see Chen WORKS CITED AND SUGGESTIONS FOR FURTHER READING
and Kent 2005). he articulatory basis of babbling is claimed to Atkinson, Kay, Brian MacWhinney, and Carol Stoel. 1968. An experi-
be frame dominance, meaning that the patterns produced largely ment on the recognition of babbling. Language Behavior Research
relect mandibular oscillation without independent control of Laboratory Working Paper 14. University of California, Berkeley.
lip and tongue movement. he result is strong C-V associations, Boysson-Bardies, Bndicte de, Pierre Hall, Laurent Sagart, and
such that alveolars are followed by front vowels, labials by central Catherine Durand. 1989. A crosslinguistic investigation of vowel for-
vowels (the pure frames, requiring no particular tongue setting), mants in babbling. Journal of Child Language 16: 117.
Boysson-Bardies, Bndicte de, and Marilyn M. Vihman. 1991.
and velars by back vowels. Furthermore, the model predicts that
Adaptation to language. Language 67: 297319.
changes in the mandibular cycle will result in height changes for
Brown, Roger. 1958. Words and hings. Glencoe, IL: Free Press.
vowels and manner changes for consonants in variegated bab-
Cheek, Adrianne, Kearsy Cormier, Ann Repp, and Richard P. Meier. 2001.
bling sequences. he work of this team and collaborators investi- Prelinguistic gesture predicts mastery and error in the production of
gating a range of other languages (e.g., Dutch, French, Romanian, early signs. Language 77: 292323.
Turkish: Davis et al. 2005) have largely supported the predictions Chen, L. M., and Raymond D. Kent. 2005. Consonant-vowel co-
and have demonstrated a tendency for adult languages to show occurrence patterns in Mandarin-learning infants. Journal of Child
the C-V associations as well (MacNeilage and Davis 2000) but Language 32: 50734.
118
Babbling Basal Ganglia
Davis, Barbara L., and Peter F. MacNeilage. 1990. Acquisition of correct BASAL GANGLIA
vowel production. Journal of Speech and Hearing Research 33: 1627.
. 1995. he articulatory basis of babbling. Journal of Speech and
Nothing in biology makes sense except in the light of evolution.
Hearing Research 38: 11991211.
Dobzhansky 1973
Davis, Barbara L., Sophie Kern, Dilara Koba, and Inge Zink. 2005.
Vocalizations in canonical babbling. Paper presented at Symposium, he basal ganglia are subcortical structures that can be traced
10th International Congress of the Association for the Study of Child back to frogs and are traditionally associated with motor control.
Language, Berlin. However, current studies show that complex behaviors generally
DePaolis, Rory A. 2006. he inluence of production on the perception
are regulated by neural circuits that link local processes in difer-
of speech. In Proceedings of the 30th Boston University Conference on
ent parts of the brain. In humans, the basal ganglia play a critical
Language Development, ed. D. Bamman, T. Magnitskaia, and C. Zaller,
role in neural circuits regulating cognitive processes, including
14253. Somerville, MA: Cascadilla Press.
Elbers, L. 1982. Operating principles in repetitive babbling. Cognition language, as well as motor control and emotion. he capacities
12: 4563. that diferentiate humans from other species, such as being able
Engstrand, Olle, Karen Williams, and Francisco Lacerda. 2003. Does to talk, forming and comprehending sentences that have com-
babbling sound native? Phonetica 60: 1744. plex syntax, and possessing cognitive lexibility, devolve from
Ferguson, Charles A., and Carol B. Farwell. 1975. Words and sounds in neural circuits that link activity in diferent regions of the cortex
early language acquisition. Language 51: 41939. through the basal ganglia. he neural bases of human language
Ferguson, Charles A., and Olga K. Garnica. 1975. heories of phonologi- thus involve the interplay of processes that regulate motor con-
cal development. In Foundations of Language Development, ed. Eric trol, other aspects of cognition, mood, and personality. Given the
H. Lenneberg and Elizabeth Lenneberg, 15380. New York: Academic
involvement of multiple regions of the brain that are involved in
Press.
many activities, it is diicult to see how any organ of the brain
Jakobson, Roman. [1941] 1968. Child Language, Aphasia, and
could be speciic to language and language alone, such as the
Phonological Universals. he Hague: Mouton. Eng. translation of
Kindersprache, Aphasie und allgemeine Lautgesetze, Uppsala. narrow faculty of language that, according to Marc D. Hauser,
Kiparsky, Paul, and Lise Menn. 1977. On the acquisition of phonology. Noam Chomsky, and W. T. Fitch (2002) yields the recursive prop-
In Language Learning and hought, ed. John Macnamara, 4778. New erties of syntax.
York: Academic Press. Evidence from experiments-in-nature that attempt to link
MacNeilage, Peter F., and Barbara L. Davis. 1990. Acquisition of speech speciic behavioral deicits with damage to a particular part of
production: Frames, then content. In Attention and Performance. Vol. a patients brain led to the traditional Broca-Wernicke theory.
13: Motor Representation and Control. Ed. Marc Jeannerod, 45375. his traditional theory claims that linguistic processes are local-
Hillsdale, NJ: Lawrence Erlbaum. ized in these two regions of the neocortex, the outermost part of
. 2000. On the origin of internal structure of word forms. Science
the brain. However, evidence from brain-imaging techniques,
288: 52731.
such as computer augmented tomography (CT scans), demon-
McCune, Lorraine, and Marilyn M. Vihman. 2001. Early phonetic
and lexical development. Journal of Speech, Language and Hearing
strated that aphasia, permanent loss of language, never occurs
Research 44: 67084. in the absence of subcortical damage (Stuss and Benson 1986).
Oller, D. Kimbrough. 1980. he emergence of the sounds of speech Subsequent indings from techniques such as functional mag-
in infancy. In Child Phonology. Vol. 1: Production. Ed. Grace Yeni- netic resonance imaging (fMRI see neuroimaging) that
komshian, James F. Kavanagh, and Charles A. Ferguson, 93112. New indirectly map neural activity show that although Brocas area
York: Academic Press. and Wernickes area are active when neurologically intact sub-
. 2000. he Emergence of the Speech Capacity. Mahwah, NJ: Lawrence jects perform various linguistic tasks; these areas are elements of
Erlbaum. his book provides a thorough review of babbling studies complex neural circuits that link activity in other cortical regions
conducted with hearing, hearing-impaired, premature, and low SES and subcortical structures (Kotz et al. 2003). Studies of neurode-
(socioeconomic status) infants, as well as provocative ideas about the
generative disorders, such as Parkinsons disease (Lieberman,
evolution of language based on evidence from ontogeny.
Friedman, and Feldman 1990; Grossman et al. 1992), revealed
Safran, Jenny R., Richard N. Aslin, and Elissa L. Newport. 1996. Statistical
learning by 8-month-old infants. Science 274: 19268.
the role of the basal ganglia in regulating speech and language.
Vihman, Marilyn M. 1993. Variable paths to early word production. Speech production and the comprehension of distinctions in
Journal of Phonetics 21: 6182. meaning conveyed by syntax deteriorated when basal ganglia
. 1996. Phonological Development. Oxford: Blackwell. his book function was impaired. Basal ganglia dysfunction is implicated
provides an overview of research in infant speech perception and pro- in seemingly unrelated conditions, such as obsessive-compul-
duction and their interactions, as well as of theories of phonological sive disorder, schizophrenia, Parkinsons disease, and verbal
development, early word patterning, and the nature of the transition apraxia a condition in which orofacial, laryngeal, and respira-
into language. tory control during speech is impaired (Lieberman 2006)
Vihman, Marilyn M., and Sari Kunnari. 2006. he sources of phonologi-
cal knowledge. In Recherches Linguistiques de Vincennes 35: 13364.
Vihman, Marilyn M., Marlys A. Macken, Ruth Miller, Hazel Simmons, Neural Circuits
and James Miller. 1985. From babbling to speech: A re-assessment of hese syndromes follow from the basal ganglia activity in difer-
the continuity issue. Language 61: 397445. ent neural circuits. Neural circuits that link activity in diferent
Vihman, Marilyn M., and Ruth Miller. 1988. Words and babble at the parts of the brain appear to be the bases for most, if not all, com-
threshold of lexical acquisition. In he Emergent Lexicon, ed. Michael plex mammalian behaviors. In humans, a class of neural circuits
D. Smith and John L. Locke, 15183. New York: Academic Press. that links activity in diferent regions of the cortex through the
119
Basal Ganglia
basal ganglia and other subcortical structures appears to play a cognitive as well as motor acts. Brain imaging studies reveal
key role in regulating aspects of human linguistic ability, such increased basal ganglia activity in syntactically complex sen-
as talking and comprehending the meaning of a sentence, as tences, as well as at the points where a person must switch from
well as such seemingly unrelated phenomena as decision mak- one criterion to another, as is the case in studies using tests of
ing, walking, attention, and emotional state. To understand the cognition such as the WCST (Monchi et al. 2001).
nature of neural circuits, we must take account of the distinction hus, basal ganglia dysfunction arising from neurodegen-
that exists between local operations that are carried out within erative diseases, lesions, or the efects of oxygen deprivation
some particular part of the brain and an observable behavior (Lieberman et al. 2005) also can result in an inability to com-
that results from many local operations linked in a neural cir- prehend distinctions in meaning conveyed by complex syntax.
cuit. Complex brains, including the human brain, perform local Alicted individuals appear to have diiculty switching the cog-
operations involving tactile, visual, or auditory stimuli in partic- nitive pattern generators that code syntactic operations at clause
ular regions of the brain. Other neural structures perform local boundaries or that in sentences depart from a simple canonical
operations that regulate aspects of motor control or hold infor- form. hese subjects typically have diiculty sequencing motor
mation in short-term (working) memory, and so on. acts, including those involved in speech. heir motor acts are
he basic computational elements of biological brains are slower, resulting in longer vowel durations, and those subjects
neurons. Local operations result from activity in an anatomically have diiculty rapidly sequencing the tongue, lip, and laryngeal
segregated, population (a group) of neurons. A given part of the maneuvers necessary to diferentiate stop consonants, such as
brain many contain many distinct anatomically segregated neu- [b] for [p], or [d] from [t].
ronal populations that each carry out similar local operations.
But these local operations do not constitute observable behav- Motor Control and Syntax
iors. Each anatomically segregated neuronal population projects Linguists have long realized that the syntactic operations (i.e.,
to anatomically distinct neuronal populations in other regions of the rules that they use to describe the structure of a sentence)
the brain, forming a neural circuit. he linked local operations yield hierarchical structures. In describing the syntax of the sen-
performed in the circuit constitute the neural basis of an observ- tence John saw the cat, the words the cat are part of a constituent
able aspect of behavior, such as striking the keys of a computer that includes the verb saw. he rules that can be used to describe
keyboard. seemingly simple motor acts such as walking also yield hierarchi-
cal structures. Both motor control and syntax involve selectional
Basal Ganglia Operations constraints that result in hierarchical structures. For example, the
Research that initially focused on Parkinsons disease, a neu- motor pattern generator for heel strike cannot be activated before
rodegenerative disease that afects the operation of the basal or much after your foot meets the ground. his yields a hierarchi-
ganglia, largely sparing cortex, demonstrated their role in motor cal tree diagram similar to those commonly used to convey the
control, syntax, and cognitive lexibility. In their review article, C. grammatical structure of a sentence. he syntactic tree diagram
D. Marsden and J. A. Obeso (1994) noted that the basal ganglia for a square dance in which swing your partner occurred again
constitute a sequencing engine for both motor and cognitive and again would not difer in principle from that of a sentence
acts. he basal ganglia regulate routine motor acts by activating having embedded relative clauses. (For more on the similarities
and linking motor pattern generators that each constitute an between motor control rules and those of generative syntax, see
instruction set for a submovement to the frontal regions of the Lieberman 2006).
brain involved in motor control. As each submovement reaches
its goal, the pattern generator for the next appropriate submove- Genetic Findings
ment is activated. herefore, motor control deicits characterize Studies of the regulatory gene FOXP2 provide a starting point for
neurodegenerative diseases such as Parkinsons that degrade understanding the evolution of the cortical-striatal-cortical cir-
basal ganglia operations. cuits that confer human linguistic ability (see genes and lan-
he basal ganglia have other motor functions; in changing guage). Other genes undoubtedly are involved and FOXP2 is not
circumstances, they can switch to a set of motor pattern gen- a language gene. FOXP2 governs the embryonic development
erators that constitute a better it to the changed environment of the basal ganglia, other subcortical structures, and lung tissue
constituting adaptive motor control. Basal ganglia operations and other structures. Its discovery resulted from a long-term study
involving cognitive pattern generators (Graybiel 1997) account of an extended family in which many individuals are marked by
for the subcortical dementia associated with Parkinsons disease. a genetic anomaly. A syndrome, a suite of speech and orofacial
Alicted individuals perseverate: hey are unable to switch to a movement disorders, and cognitive and linguistic deicits mark
new train of thought when circumstances change. On cognitive these individuals. hey are not able to protrude their tongues
tests such as the Wisconsin Card Sorting Test (WCST), they have while closing their lips, cannot repeat two word sequences, and
diiculty switching to a new cognitive criterion. For example, a have diiculty comprehending distinctions in meaning conveyed
subject who has been successfully sorting cards by their color by syntax (Vargha-Khadem et al. 1998). On standardized intelli-
will have diiculty switching to sorting them by the number of gence tests, they have signiicantly lower scores than their nonaf-
symbols printed on each card. Neurophysiologic studies that licted siblings. MRI imaging shows that the caudate nucleus (a
trace the linkages between the segregated neuronal populations basal ganglia structure) is abnormal. fMRI imaging, which pro-
of the basal ganglia and cortex conirm circuits that project from vides a measure of neural activity, shows underactivation in the
the basal ganglia to regions of the brain that are implicated in putamen (the principal basal ganglia input structure), Brocas
120
Basal Ganglia Basic Level Concepts
area, and its right homolog (Watkins et al. 2002; Liegeois et al. Lieberman, Philip, A. Morey, J. Hochstadt, M. Larson, and S. Mather S.
2003). hese structures are connected by neural circuits through 2005. Mount Everest: A space analogue for speech monitoring of cog-
the striatum (Lehericy et al. 2004). he behavioral deicits of nitive deicits and stress. Aviation, Space and Environmental Medicine
alicted individuals are similar to those seen in Parkinsons dis- 76: 198207.
Liegeois, F., T. Baldeweg, A. Connelly, D. G. Gadian, M. Mishkin, and F.
ease and oxygen deprivation (cf. Lieberman 2006 for details).
Varhgha-Khadem, 2003. Language fMRI abnormalities associated
he role of FOXP2 during early brain development in humans
with FOXP2 gene mutation. Nature Neuroscience 6: 12307.
and of the mouse version ( foxp2) in mice was established by C. Marsden, C. D., and J. A. Obeso.1994. he functions of the basal ganglia
S. Lai and colleagues (2003). he gene governs the expression of and the paradox of stereotaxic surgery in Parkinsons disease. Brain
other genes during embryonic development. In both the human 117: 87797.
and mouse brain, the gene is active in the interconnected neu- Monchi, O., P. Petrides, V. Petre, K. Worsley, and A. Dagher. 2001.
ral structures that constitute the cortical-striatal-cortical circuits Wisconsin card sorting revisited: Distinct neural circuits participat-
regulating motor control and cognition in humans, including the ing in diferent stages of the task identiied by event-related functional
caudate nucleus and putamen of the basal ganglia, the thala- magnetic resonance imaging. Journal of Neuroscience 21: 773341.
mus, inferior olives, and cerebellum. Despite the high degree Stringer, Christopher B. 1998. Chronological and biogeographic per-
of similarity, the mouse and human versions are separated by spectives on later human evolution. In Neanderthals and Modern
Humans in Western Asia, ed. T. Akazawa, K. Abel, and O. Bar-Yosef,
three mutations. he chimpanzee and human versions are sepa-
2938. New York: Plenum.
rated by two mutations. W. Enard and colleagues (2002), using
Stuss, Donald T., and D. F. Benson. 1986. he Frontal Lobes. New
the techniques of molecular genetics, estimate that the human York: Raven
form appeared somewhere in the last 200,000 years, in the time Vargha-Khadem, Faraneh, K. E. Watkins, C. J. Price, J. Ashburner, K. J.
frame (Stringer 1998) associated with the emergence of anatomi- Alcock, A. Connelly, R. S. Frackowiak, K. J. Friston, M. E. Pembrey, M.
cally modern Homo sapiens. he appearance of human speech Mishkin, D. G. Gadian, and R. E. Passingham. 1998. Neural basis of an
anatomy 50,000 years ago presupposes the prior appearance of inherited speech and language disorder. PNAS USA 95: 12695700.
this neural substrate (see speech anatomy, evolution of). Watkins, Kate, F. Vargha-Khadem, J. Ashburn, R. E. Passingham, A.
In short, the basal ganglia are neural structures that were ini- Connelly, K. J. Friston, R. S. Frackiwiak, M. Miskin, and D. G. Gadian.
tially adapted for one function motor control. In the course of 2002. MRI analysis of an inherited speech and language disor-
der: Structural brain abnormalities. Brain 125: 46578.
evolution, the human basal ganglia were modiied, taking on
additional cognitive and linguistic tasks.
Philip Lieberman BASIC LEVEL CONCEPTS

A concept is a mental representation that allows people to
WORKS CITED AND SUGGESTIONS FOR FURTHER READING pick out a group of equivalent things or a category (see catego-
Dobzhansky, heodosius. 1973. Nothing in biology makes sense except
rization). For example, people use their concept of dog to pick
in the light of evolution. American Biology Teacher 35: 1259. out members of category of things that are called dogs.
Enard, W., M. Przeworski, S. E. Fisher, C. S. Lai, V. Wiebe, T. Kitano, A. P. Concepts are also organized into hierarchical taxonomies,
Monaco, and S. Paabo. 2002. Molecular evolution of FOXP2, a gene or sequences of progressively larger categories, in which each
involved in speech and language. Nature 41: 86972. category includes all the previous ones. For example, an object
Graybiel, Ann M. 1997. he basal ganglia and cognitive pattern genera- driven on a highway with four wheels and a top that folds back
tors. Schizoprenia Bulletin 23: 45969. can be called a convertible, a car, or a vehicle. he category car is
Grossman, Murray, S. Carvell, S. Gollomp, M. B. Stern, G. Vernon, and more general than convertible because it includes other objects
H. I. Hurtig. 1992. Sentence comprehension and praxis deicits in (e.g., station wagons) as well as the members of convertible. he
Parkinsons disease. Neurology 41: 16208.
category vehicle is more general than convertible and car because
Hauser, Marc D., N. Chomsky, and W. T. Fitch. 2002. he faculty of
it contains other objects (e.g., trucks) as well as the members of
language: What is it, who had it, and how did it evolve? Science
298: 156979.
these categories.
Kotz, Sonia A., M. Meyer, K. Alter, M. Besson, D. Y. von Cramon, and A. D. Strong evidence from cognitive psychology (Rosch et al.
Frederica. 2003. On the lateralization of emotional prosody: An fMRI 1976) and anthropology (Berlin 1992) suggests that one level of
investigation. Brain and Language 86: 36676. such hierarchies is cognitively privileged. Eleanor Rosch and
Lai, C. S., D. Gerrelli, A. P. Monaco, S. E. Fisher, and A. J. Copp. 2003. colleagues (1976) used a wide range of converging methods
FOXP2 expression during brain development coincides with adult that singled out the basic level as playing a central role in many
sites of a pathology in a severe speech and language disorder. Brain categorization processes. For example, the category level repre-
126: 245562. sented by chair and dog is typically considered the basic level, in
Lehericy, S. M., M. Ducros, P. F. Van de Moortele, C. Francois, L. hivard, contrast to more general superordinate concepts, such as furni-
C. Poupon, N. Swindale, K. Ugurbil, and D. S. Kim. 2004. Difusion
ture and animal, and more speciic subordinate concepts, such
tensor tracking shows distinct corticostriatal circuits in humans.
as recliner and labrador retriever.
Annals of Neurology 55: 5229.
Lieberman, Philip. 2006. Toward an Evolutionary Biology of Language.
Basic level concepts have advantages over other concepts.
Cambridge: Harvard University Press. Pictures of objects are categorized faster at the basic level than
Lieberman, Philip, J. Friedman, and L. S. Feldman. 1990. Syntactic dei- at other levels (Jolicoeur, Gluck, and Kosslyn 1984). As noted
cits in Parkinsons Disease. Journal of Nervous and Mental Disease by Rosch and her colleagues, people primarily use basic level
178: 3605. names in naming tasks, and the basic level is the highest level
121
Basic Level Concepts Bilingual Education
for which category members have similar overall shape (cf. car Cantor, Nancy, and Walter Mischel. 1979. Prototypes in person percep-
versus vehicle). Children learn basic level concepts sooner than tion. In Advances in Experimental Social Psychology, ed. L. Berkowitz,
other concepts (Brown 1958; Horton and Markman 1980). Basic 452. New York: Academic Press.
level advantages are found in many other domains, including Horton, Marjorie, and Ellen Markman. 1980. Developmental difer-
ences in the acquisition of basic and superordinate categories. Child
environmental scenes (Tversky and Hemenway 1983), social
categories (Cantor and Mischel 1979), and actions (Morris and
Jolicoeur, Pierre, Mark Gluck, and Steven Kosslyn. 1984. Pictures and
Murphy 1990). names: Making the connection. Cognitive Psychology 16: 24375.
One explanation for the advantages of basic level categories Lassaline, Mary, Edward Wisniewski, and Douglas Medin. 1992.
over other categories is that they are more diferentiated (Rosch Basic levels in artiicial and natural categories: Are all basic cat-
et al. 1976; Murphy and Brownell 1985). Members of basic level egories created equal? In Percepts, Concepts, and Categories: he
categories have many features in common. hese features are Representation and Processing of Information, ed. B. Burns, 32880.
also distinct from those of other categories at this level. In con- North Holland: Elsevier.
trast, although members of more speciic, subordinate catego- Morris, Michael, and Gregory Murphy. 1990. Converging operations on
ries (e.g., sports car) have slightly more features in common than a basic level in event taxonomies. Memory and Cognition 18: 40718.
do those of basic level categories, many of these features are not Murphy, Gregory, and Hiram Brownell. 1985. Category diferentiation in
object recognition: Typicality constraints on the basic category advan-
distinctive. hat is, members of a subordinate category share
tage. Journal of Experimental Psychology: Learning, Memory and
their features with other subordinates (e.g., members of sports
Cognition 11: 7084.
car share a number of features with other subcategories of car). Murphy, Gregory, and Mary Lassaline. 1997. Hierarchical structure
In contrast, the members of more general, superordinate catego- in concepts and the basic level of categorization. In Knowledge,
ries (e.g., vehicle) have few common features. Concepts, and Categories, ed. K. Lamberts and D. Shanks, 93131.
Diferentiation explains the basic level advantage because London: Psychology Press
it relects a compromise between two competing functions of Rosch, Eleanor, Carolyn Mervis, Wayne Gray, David Johnson, and Penny
concepts. Categories should be informative so that one can draw Boyes-Braem. 1976. Basic objects in natural categories. Cognitive
inferences about an entity on the basis of its category member- Psychology 8: 382439.
ship. Emphasizing this function leads to the formation of large Tanaka, James, and Marjorie Taylor. 1991. Object categories and exper-
tise: Is the basic level in the eye of the beholder? Cognitive Psychology
numbers of categories with the inest possible discriminations
23: 47282.
between categories (Rosch et al. 1976, 384). However, the for-
Tversky, Barbara, and Kathy Hemenway. 1983. Categories of environ-
mation of categories should only preserve important diferences
mental scenes. Cognitive Psychology 15: 12149.
between them that are practical: It is to the organisms advantage
not to diferentiate one stimulus from others when that diferen-
tiation is irrelevant to the purposes at hand (ibid.). his function BILINGUAL EDUCATION
counteracts the tendency to create large numbers of categories
and relects the principle of cognitive economy (Rosch et al. In principle, bilingual education is just the use of two languages
1976). Overall, basic level categories have an advantage because in instruction in a school setting. However, in practice, it covers
they are relatively general and informative, whereas superordi- a wide array of programs. Bilingual education programs range
nate categories, though general, are not informative and subor- from high-status schools promoting international education
dinate categories, though informative, are not general. through prestige languages, such as English and French, to
he basic level may change with expertise in a way that is highly marginalized schools devoted to the bare-bones school-
consistent with the diferentiation explanation. For example, ing of immigrant children.
James Tanaka and Marjorie Taylor (1991) investigated exper- In its weak form, bilingual education may involve transi-
tise efects on the basic level in expert dog breeders and bird- tional or subtractive bilingualism, leading to monolingualism
watchers. Using a number of tasks, they tested each expert in (e.g., teaching Spanish-speaking children English to ensure their
both the dog and bird domains. For instance, in a speeded cate- assimilation and integration into mainstream America). In its
gorization task, experts in their novice domain were fastest at the strong form, bilingual education aims at maintaining the lan-
basic level and slowest at the subordinate level (as Rosch et al. guage of a minority child in addition to the learning of a major-
1976 found). However, in their area of expertise, categorization ity language, thus leading to additive bilingualism. Heritage
was equally fast at the basic and subordinate levels. For more bilingual schools often practice an ideal version of additive
detailed reviews of this literature, see Lassaline, Wisniewski, bilingualism, which stresses bicultural education in addition to
and Medin 1992; Murphy and Lassaline 1997.) bilingualism.
One of the central concerns of bilingual education is to
Edward Wisniewski address the educational needs/performance of minority chil-
dren by maintaining their mother tongue. he proponents of
WORKS CITED AND SUGGESTIONS FOR FURTHER READING maintaining the mother tongue claim that such maintenance
is critical for linguistic and cognitive growth of the child, school
Berlin, Brent. 1992. Ethnobiological Classiication: Principles of
Categorization of Plants and Animals in Traditional Societies. performance, psychological security, ethnic and cultural iden-
Princeton, NJ: Princeton University Press. tity (see ethnolinguistic identity), self-esteem, and many
Brown, Roger. 1958. How shall a thing be called? Psychological Review other positive personal and intellectual characteristics. he sup-
65: 1421. porters of transitional bilingualism claim that only transitional
122
Bilingualism, Neurobiology of
bilingualism is capable of saving children from poor academic Historical Overview

performance and allowing for assimilation. From a historical standpoint, the irst approach to studying
Bilingual education has been steadily gaining strength around brain organization for bilingualism was the study of bilingual
the globe since the era of decolonization. It is further fueled by aphasics. Several clinical aphasia studies have shown that bilin-
the growth of ethnic awareness and the movement to prevent gual aphasics do not necessarily manifest the same language
the extinction of languages of the world. Many countries, disorders with the same degree of severity in both languages. In
particularly in Europe, that earlier fostered monolithic ideology some cases, L1 is recovered better than a L2. In other cases, the
have begun to recognize their diversity as a source of social and converse obtains. Since the landmark 1895 study of the French
economic capital, thus marking a new era of bilingual/multilin- neurologist Albert Pitres, who was the irst to draw attention to
gual education (e.g., in the United Kingdom, France, and Spain, the relative frequency of diferential language recovery follow-
among others; see language policy). ing aphasia in bilinguals, many diferent recovery patterns have
Multilingual countries of Asia and Africa continue to nurture been described: from selective recovery of a given language
a long tradition of bilingual education. Since 1956, India, for (i.e., one language remains impaired while the other recov-
example, has had as oicial policy a three-language formula in ers); parallel recovery of both languages, successive recovery
education. his formula calls for multilingual education. In addi- (i.e., after the recovery process of one language, the other lan-
tion to learning the two national languages Hindi and English guage recovers); alternating recovery (i.e., the language that
students are expected to learn a third or a fourth language. was irst recovered will be lost again due to the recovery of the
Because of its deep-rooted association with immigrants, bilin- language that was not irst recovered); and alternating antago-
gual education in the United States is particularly notable for its nistic recovery (i.e., on one day the patient is able to speak in
turbulent history. On June 2, 1998, the people of California voted one language while on the next day only in the other); to the
to end a tradition of bilingual education by passing Proposition pathological mixing of two languages (i.e., the elements of the
227, which gives immigrant children just one year to learn English two languages are involuntarily mixed during language pro-
before they enroll in regular classes. Many school systems in duction). he study of bilingual aphasia is important because
other states are waiting either to put in place severe restrictions it indicates the cortical regions necessary for performance of a
on bilingual instruction or to eliminate it completely by passing linguistic task (e.g., speaking in L1).
English only policies (for details, see Genesee 2006). Clinical case reports indicate a set of relevant factors and
While bilingual education is often associated with the edu- have led to theoretical conjectures. However, at present we lack a
cation of minority students (e.g., in the United States), the causal account for the various recovery patterns and cannot pre-
Canadian immersion programs in French devoted to the dict clinical outcomes. Concerning the possible factors involved,
majority Anglophones serve as a model of bilingual education no correlation has been found between the pattern of recovery
for majority students (for details, see Genesee 2006). and neurological, etiological, experiential, or linguistic param-
eters: not site, size or origin of lesion, type or severity of aphasia,
Tej K. Bhatia
type of bilingualism, language structure type, or factors related to
acquisition or habitual use.
heoretical conjectures arising from the study of bilingual
Crawford, James. 2004. Educating English Learners. Los Angeles: Bilingual aphasia developed along two distinct lines, a more traditional
Educational Services. approach and a more dynamic approach. he more traditional
Genesee, Fred. 2006. What do we know about bilingual education for localizationist view argued, for instance, that the speciic loss
majority-language students? In Handbook of Bilingualism, ed. Tej K. of one language would occur because the bilinguals languages
Bhatia and William C. Ritchie, 54776. Oxford: Blackwell.
are represented in diferent brain areas or even in diferent
hemispheres, and hence, a focal brain lesion within a language-
speciic area may alter only that speciic language, leaving the
BILINGUALISM, NEUROBIOLOGY OF
other language intact. In contrast, according to the dynamic view
Neurobiology of bilingualism refers to the study of the cere- selective recovery arises because of compromise to the language
bral organization of multiple languages in the human brain. system, rather than to damage to diferential brain representa-
From early accounts of selective loss and recovery in bilingual tions. A selective loss of a language arises because of increased
aphasia (i.e., loss of language due to a brain lesion) to recent inhibition, that is, of a raised activation threshold for the afected
electrophysiological and functional neuroimaging studies, issues or lost language, or even because of an imbalance in the means
inherent to the bilingual brain have inspired researchers for more to activate the language due to the lesion. It is worth underlining
than a century. Investigations into the neural basis of bilingual- that Pitres himself proposed a dynamic explanation of language
ism focus not only on how two languages (L1 and L2) are repre- recovery in bilingual aphasics: Language recovery could occur
sented in the brain (i.e., the anatomical location) but also on how only if the lesion had not entirely destroyed language areas but
these languages are processed. Indeed, the main assumption is temporarily inhibited them through a sort of pathological iner-
that a weaker L2 may be processed through brain mechanisms tia. In Pitress opinion, the patient generally irst recovered the
that may difer from those underlying L1 processing. After a brief language to which she/he was premorbidly more exposed (not
historical overview, I illustrate indings inherent to the represen- necessarily the native language) because the neural elements
tation of languages, followed by a section focusing on language subserving the more exposed language were more strongly
processing. associated.
123
Bilingualism, Neurobiology of
he dynamic view not only explains the so-called selec- L2 is acquired beyond the critical periods. he dependence of
tive recovery of a language but can also explain many reported grammatical processing upon these age efects was conirmed
recovery patterns in bilingual aphasia. As outlined by M. Paradis by early event-related potentials (ERP) studies (Weber-Fox and
(1998), a parallel recovery would then occur when both lan- Neville 1996) and by recent functional brain imaging studies
guages are inhibited to the same degree. When inhibition afects (Wartenburger et al. 2003). In particular, I. Wartenburger and
only one language for a period of time and then shifts to the other colleagues reported no diferences in brain activations for gram-
language (with disinhibition of the prior inhibited language) a mar in L1 and L2 in very early (from birth) highly proicient bilin-
pattern of alternating antagonistic recovery occurs (see Green guals. On the other hand, late highly proicient bilinguals were
1986). Selective recovery would occur if the lesion permanently in need of additional neural resources in order to achieve a com-
raised the activation threshold for one language, and pathologi- parable nativelike performance in grammatical tasks. he same
cal mixing among languages would occur when languages could did not apply to lexical-semantic processing, for which the only
no longer be selectively inhibited. diference in the pattern of brain activity in bilinguals appeared
In general, the aphasia data have provided a rich source of evi- to depend upon the level of attained proiciency.
dence on the range of language disorders and language recovery As mentioned, the degree of language proiciency seems to
patterns in bilinguals. However, there are limitations to the gen- exert a more pervasive inluence on the lexical-semantic level
eralizability of such data to neurologically healthy individuals. of L2. According to psycholinguistics, during the early stages
Concerns about the lesion-deicit approach include the inability of L2 acquisition there may be a dependency on L1 to mediate
to determine whether speciic language deicits are the result of access to meaning for L2 lexical items. As L2 proiciency grows,
damage to a specialized language component at the lesion site, this dependency disappears. Higher levels of proiciency in L2
or if the damaged area is simply part of a larger neural network produce lexical-semantic mental representations that more
that mediates a given component of language. Likewise, aphasia closely resemble those constructed in L1. According to D. W.
data do not allow one to separate the efects of injury from those Greens convergence hypothesis (2003), any qualitative difer-
of neural plasticity or a reallocation of healthy cortical tissue ences between native and L2 speakers disappear as proiciency
for the mediation of language functions lost as a result of brain increases. he convergence hypothesis claims that the acquisi-
injury. Nevertheless, studying the efects of brain damage on lin- tion of L2 arises in the context of an already speciied or partially
guistic function in bilinguals has led to a number of interesting speciied system and that L2 will receive convergent neural rep-
observations about the nature and course of language impair- resentation within the representations of the language learned
ment and recovery, which in turn has stimulated researchers to as L1.
apply functional neuroimaging techniques to the investigation of Whether word or sentence production and word completion
bilingual language processing. were used as experimental tasks, neuroimaging studies reported
common activations in the left hemisphere when the degree
The Neural Representation of L2 of L2 proiciency was comparable to that of L1. his happened
Since its inception, neuroimaging work on bilinguals has been irrespective of the diferences in orthography, phonology and
motivated by the same localizationist questions that run through syntax among languages. Conversely, bilinguals with low pro-
the bilingual aphasia literature: whether multiple languages are iciency in L2 engaged additional brain activity, mostly in the left
represented in overlapping or separate cerebral systems. In addi- prefrontal cortex. Similar results were found in studies that did
tion, neuroimaging and neurophysiological data on this issue not directly address lexical retrieval, but employed judgment
have often been inluenced by possible biases, such as lack of tasks in the lexical-semantic domain.
information on the age of acquisition and degree of proiciency It is worth underlining that the activity found in the left pre-
in the experimental subjects. Both these variables indeed exert frontal cortex is located anteriorily to the classical language areas
profound inluences on the brain organization of L2. and, thus, not directly linked to language functions but rather
According to psycholinguistic evidence grounded on the linked to other cognitive functions, such as cognitive control
concept of universal grammar, the age of L2 acquisition is and attention. Crucially, the engagement of the left prefrontal
expected to be crucial for grammatical processing. In fact, gram- cortex was reported for bilinguals with a low degree of L2 proi-
matical processing may be particularly deicient when L2 is ciency and/or exposure. One may conclude that the diferences
learned later in life. On the other hand, lexical-semantic process- found between high and low proicient bilinguals are not due to
ing seems to be less afected by age of acquisition than to depend anatomical diferences of L2 brain representations but instead
on the degree of L2 proiciency. relect the cognitive dynamics of processing a weaker L2 as com-
It is likely that other factors, such as usage and exposure to a pared to L1.
given language, can afect brain plasticity mechanisms, leading
to modiications of the neural substrate of language. I consider Neural Aspects of L2 Processing
separately how these variables may inluence L2 processing. One of the most salient aspects and one speciic to bilingual lan-
An ongoing issue in neurobiology concerns the fact that the guage processing is language control. Language control refers to
acquisition of language seems to depend on appropriate input the fact that there may be competition between languages and
during a biologically based critical periods. It has also been that this competition is resolved by actively inhibiting the so-
suggested that L2 learning may be subject to such crucial time- called non-target language. Consider that individuals can perform
locked constraints. However, L2 can be acquired at any time in diferent actions on the same stimulus. For instance, a bilingual
life, although L2 proiciency is rarely comparable to that of L1 if can name a presented word in L1 or translate it into L2. he task
124
Bilingualism, Neurobiology of Bilingualism and Multilingualism
goal must be maintained in the face of conlicting goals, and the Consider as an example the complex process of L2 acquisition.
various actions required to perform the task must be coordinated his process may be considered as a dynamic process, requiring
(e.g., retrieve or compute the words phonology from its spelling additional neural resources in the early stages of L2 acquisition.
or retrieve the meaning of the word and select its translation). hese additional neural resources are mostly found within the
Once a given task is established, however (e.g., speaking in L2), left prefrontal cortex (more anteriorily to the classical language
competition with alternative possible tasks (speaking in L1) may areas), the left basal ganglia, and the anterior cingulated cor-
be resolved more automatically. Where individuals wish to alter tex and seem to be associated with the greater control demand
their goal (for example, to switch from speaking in one language when processing a weaker L2. However, once the L2 learner
to speaking in another), they must disengage from the current gains suicient L2 proiciency, the neural representation of L2
goal and switch to the new goal. Lexical concepts matching the converges to that of L1, at least at the macroanatomical level.
intended language must be selected and produced, while those At this stage, one may suppose that L2 is processed in the same
not matching the intended language must be inhibited through fashion as L1, as psycholinguistic evidence points out (Kroll and
language control mechanisms. For instance, in word production Stewart 1994).
studies, language control would inhibit potential interferences his latter point is an important one because many functional
from the non-target language. Psycholinguistic evidence points neuroimaging studies did not take into consideration linguistic
to the fact that such interference is more common during pro- and psycholinguistic evidence (Paradis 2004). Yet evidence from
duction in a language that is mastered to a lower degree of pro- neuroimaging should be integrated with the psycholinguistic
iciency, for example, a weak L2. In that case, for example, when indings to the mutual advantage of both research traditions.
asked to name a picture in L2, the bilingual speaker has to inhibit Integrating these indings with the psycholinguistic theory may
L1 in order to prevent a prepotent interference from L1. allow us to demonstrate the biological consistency of diferent
Functional neuroimaging studies using experimental tasks models, organize and consolidate existing indings, and gener-
like picture naming, switching, translating, and so on have ele- ate novel insights into the nature of the cerebral organization of
gantly shown that these tasks are paralleled by the activation of bilingualism.
a set of brain areas that are not directly linked to language repre-
Jubin Abutalebi
sentation, such as the brain activity within the left prefrontal cor-
tex, the left caudate nucleus, and the anterior cingulate cortex.
he engagement of these areas is even more relevant when sub- WORKS CITED AND SUGGESTIONS FOR FURTHER READING
jects have to process a weak L2. he functions generally ascribed Abutalebi J., S. F. Cappa and D. Perani. 2005. Functional neuroimaging
to the prefrontal cortex comprise working memory, response of the bilingual brain. In Handbook of Bilingualism: Psycholinguistic
inhibition, response selection, and decision making, while the Approaches, ed. J. F. K Kroll and A. De Groot, 497515. Oxford: Oxford
left caudate was reported to be imported for language selection University Press.
and set switching. he anterior cingulate cortex is related to such Green, D. W. 1986. Control, activation and resource. Brain and
Language 27: 21023.
functions as conlict monitoring, attention, and error detection.
. 2003. he neural basis of the lexicon and the grammar in L2
It becomes clear that the engagement of these structures pro-
acquisition. In he Interface Between Syntax and the Lexicon in Second
vides a cerebral testimony of the cognitive processes inherent to
Language Acquisition, ed. R. van Hout, A. Hulk, F. Kuiken, and R.
bilingual language processing: competition and control between Towell, 197218. Amsterdam: John Benjamins.
languages. Kroll, J. F., and E. Stewart. 1994. Category interference in translation
and picture naming: Evidence for asymmetric connections between
Conclusions bilingual memory representations. Journal of Language and Memory
Extensive reviews focusing on the bilingual brain as studied with 33: 14974.
functional neuroimaging are available in the literature to which Paradis, M. 1998. Language and communication in multilinguals. In
the reader is referred (Abutalebi, Cappa, and Perani 2005; Perani Handbook of Neurolinguistics, ed. B. Stemmer and H. Whitaker. San
and Abutalebi 2005; but see also Paradis (2004) for a critical Diego, CA: Academic Press, 417430.
. 2004. A Neurolinguistic heory of Bilingualism. Amsterdam and
viewpoint). In broad outlines, functional neuroimaging has shed
new light on the neural basis of L2 processing and on its relation-
Perani, D., and J. Abutalebi. 2005. Neural basis of irst and second lan-
ship to native language (L1). First of all, the long-held assump- guage processing. Current Opinion of Neurobiology 15: 2026.
tion that L1 and L2 are necessarily represented in diferent brain Wartenburger, I., H. R. Heekeren, J. Abutalebi, S. F. Cappa, A. Villringer,
regions or even in diferent hemispheres in bilinguals has not and D. Perani. 2003. Early setting of grammatical processing in the
been conirmed. On the contrary, functional neuroimaging has bilingual brain. Neuron 37: 15970.
elegantly outlined that L1 and L2 are processed by the same Weber-Fox, C. M., and H. J. Neville. 1996. Maturational constraints on
neural devices. Indeed, the patterns of brain activation associ- functional specialization for language processing: ERP and behavioral
ated with tasks that engage speciic aspects of linguistic process- evidence in bilingual speakers. Journal of Cognitive Neuroscience
ing are remarkably consistent among diferent languages, which 8: 23156.
share the same brain language system. hese relatively ixed
brain patterns, however, are modulated by a number of factors.
BILINGUALISM AND MULTILINGUALISM
Proiciency, age of acquisition, and exposure can afect the cere-
bral representations of each language, interacting in a complex Growing recognition that bilingualism/multilingualism is
way with the modalities of language performance. not an exception or irregular phenomenon but is, in fact, a
125
Bilingualism and Multilingualism
growing global phenomenon marks a new challenge and a shift conceptual and linguistic levels (see bilingualism, neurobi-
for linguistic research. For instance, the traditional domain of ology of). hese labels and dichotomies demonstrate the com-
psycholinguistic research, which has been the monolingual plex attributes of bilingualism that make the task of deining and
child, is now shifting to the bilingual child and multilingual lan- measuring bilinguals a daunting one. A working deinition of
guage processing (see, e.g., de Bot and Kroll 2002, 133). What is bilingualism is ofered by Leonard Bloomied ([1933] 1984, 53),
bilingualism and who is a bilingual? he questions of identiica- who claimed that a bilingual is one who has native-like control
tion and measurement that are considered irrelevant in the con- over two languages (i.e., balanced bilingual).
text of monolingualism become more pertinent and urgent in
the context of bilingual language acquisition, production, com- Bilinguals Language Organization
prehension, and processing. Bilinguals organization of a verbal repertoire in the brain is also
very diferent from that of monolinguals. When a monolingual
Bilingualism/Multilingualism: Two Conceptual Views decides to speak, his/her brain does not have to make complex
Is a bilingual a composite of two monolinguals? Does the bilin- decisions concerning language choice as does the bilingual. Such
gual brain comprise two monolinguals crowded into a limited a decision-making process for a monolingual is restricted at best
space? For some researchers, the answer to these questions has to the choice of a variety/style (informal vs. formal) selection.
traditionally been airmative. Such a view of bilingualism is It is inconceivable for monolinguals to imagine that a mul-
termed the fractional view. According to this view, monolingual- tilingual person, such as this author, has to make a choice from
ism holds a key to the understanding of bilingualism. However, among four languages and their varieties while communicating
a more balanced and accurate picture of bilingualism emerges within his family in India. he language choice is not a random
from the holistic view of bilingualism. According to this view, nei- one but is unconsciously governed by a set of factors. he author
ther is a bilingual person the mere sum of two monolinguals nor is a speaker of Multani, Punjabi, Hindi, and English. Normally,
is the bilingual brain a composite of two monolingual brains. he he used Multani to talk with his brothers and parents while grow-
reason for this position is that the cooperation, competition, and ing up. He speaks Punjabi with two of his sisters-in-law, Hindi
coexistence of the bilinguals two languages make a bilingual a with his nephews and nieces, and English with his children. In
very complex and colorful individual (for details, see Grosjean short, each language in his brain is associated with a well-de-
1989). ined domain. A violation of such a domain allocation has seri-
ous implications not only for communication mishaps but also
Deining and Measuring Bilingualism: Input Conditions and for interpersonal relationships. In addition to the language-per-
Input Types son domain allocation, other factors such as topics and emotions
Deining and measuring bilingualism is a very complex and determine his language choice. While discussing an academic
uphill task due to the number and types of input conditions. For topic, he switches from Multani to English with his brothers and
instance, while a monolingual child receives input from his or from English to Hindi with his children if the context is emotive.
her parents only in one language in all settings, a bilingual child In short, the determinants of language choice are quite com-
is provided input at least in two separate languages (e.g., one- plex among bilinguals, and this, in turn, presents evidence that
parent one-language input; one-place one-language input) in bilinguals organization of their verbal repertoire is quite difer-
addition to a code-mixed input in a variety of environments. In ent from monolinguals organization.It is interesting to note that
addition, biological (age of acquisition), sociopsychological, and language choice (or language negotiation) is a salient feature of
other nonlinguistic factors lead to a varying degree of bilingual bilingual linguistic competence and performance. he com-
language competencies. herefore, it is natural that no widely plexity of language choice and its unconscious determinants
accepted deinition or measure of bilingualism exists. Instead, a pose a serious challenge for the psycholinguistic theory of bilin-
rich range of scales, dichotomies, and categories are employed gual language production.
to describe bilinguals. A bilingual who can speak and under-
stand two languages is called a productive bilingual, whereas a Individual, Societal, and Political Bilingualism
receptive bilingual is an individual who can understand but can- Bilingualism can be viewed from individual, societal, and politi-
not speak a second language. A child who has acquired two lan- cal perspectives. In a bilingual family, not all members are always
guages before the age of ive at home (natural setting) is called a bilinguals. Parents may be monolingual, while children may be
simultaneous or early bilingual, whereas those who learn a sec- bilinguals or vice versa. Societal factors such as the overt prestige
ond language after the age of ive, either at home or in school set- of a language (or a presence of a majority language) often leads
ting, are described as late or sequential bilinguals. Other labels to individual or family bilingualism. However, individual or fam-
and dichotomies, such as luent versus nonluent, balanced ver- ily bilingualism can persist even without societal support. Such
sus nonbalanced, primary versus secondary, and partial versus bilingualism can be termed covert prestige bilingualism, which is
complete, are based upon diferent types of language proiciency often motivated by the consideration of group identity.
(speaking, writing, listening) or on an asymmetrical relationship In those societies of Asia or Africa where bilingualism exists
between the two languages. as a natural phenomenon as the result of a centuries-long tradi-
Compound versus coordinate bilingualism refers to the difer- tion of bilingualism, an ethnic or local language becomes a Low
ential processing of language in the brain. Compound bilinguals variety, that is, it is acquired at home and/or in an informal set-
process two languages using a common conceptual system, ting outside school (e.g., on a playground), whereas a language of
whereas coordinate bilinguals keep language separation at both wider communication or a prestige language functions as a High
126
Bilingualism and Multilingualism
variety, which is learned formally in schools. In a diglossic Effects of Bilingualism/Multilingualism

society, a single language develops two distinct varieties, the What is the efect of bilingualism/multilingualism on an indi-
L- and the H-variety. vidual, particularly on a child? he research on this question
People become bilingual for a wide variety of reasons: immi- is fundamentally driven by two hypotheses: the linguistic def-
gration, jobs, marriage, or religion, among others. hese factors icit hypothesis and the linguistic augmentation hypothesis.
create a language contact situation but do not always lead to According to the former, bilingual children show serious linguis-
stable bilingualism. For instance, it is well known that immigrant tic and cognitive adverse efects of bilingualism. Exposure to two
communities in the United States often give up their mother languages leads to semilingualism; that is, they become deicient
tongue in favor of English and become monolingual after a brief in both languages, which in turn leads to other disabilities (e.g.,
period of bilingualism. stuttering) and cognitive impairments (low intelligence, mental
he classiication of countries as monolingual, bilingual, or retardation, and even schizophrenia).
multilingual often refers to the language policies of a country, Such a hypothesis has become obsolete in light of the ind-
rather than to the actual incidence of bilingualism or multilin- ings of the research driven by the linguistic augmentation
gualism. Canada is a bilingual country in the sense that its lan- hypothesis. Solid on theoretical and methodological grounds,
guage policies are receptive to bilingualism. It makes provision research by Elizabeth Peal and Wallace E. Lambert (1962) put
for learning French in those provinces that are Anglophone. to rest such negative and frightening efects of bilingualism.
Such a provision is called territorial bilingualism. However, it heir research and the indings of the succeeding research pro-
does not mean that everybody in Canada is a bilingual, nor does vide ample evidence that the negative conclusions of the earlier
it mean that the country guarantees individual bilingualism (per- research were premature and misguided due to the theoretical
sonality bilingualism) outside territorial bilingualism. In multi- and methodological laws. Contrary to the indings of the previ-
lingual countries such as India, where 20 languages are oicially ous research, bilingual children exhibit more cognitive lexibility
recognized, the government language policies are receptive to than do monolinguals and perform better on verbal and nonver-
multilingualism. Indias three-language formula is the oicial bal measures. Peal and Lamberts study, which was conducted in
language policy of the country. In addition to learning Hindi and Montreal, revolutionized research on bilingualism and multilin-
English, the conational languages, schoolchildren can learn a gualism by highlighting a positive conception of bilinguals. heir
third language, spoken outside their state. research has been replicated in many countries, conirming
the augmenting rather than subtracting efect of bilingualism.
Bilingual Verbal Behavior: Language Separation and Beyond this research, the economic, communicative (intergen-
Language Integration erational and cross-cultural), and relational (building relations)
Language separation and language integration are the two most advantages of bilingualism are inarguable.
salient characteristics of bilinguals and thus of the bilingual
brain. Whenever deemed appropriate, bilinguals can turn of one Conclusion
language and turn on the other language. his enables them to In short, bilingualism/mulitilingualism is a global phenomenon
switch from one language to another with the ease of a driver of that continues to gain further momentum in the age of globaliza-
a stick-shift car shifting into diferent gears whenever necessary. tion. It is a by-product of a number of biological, sociopsychologi-
he fractional view of bilingualism can account for such a verbal cal, and linguistic factors. hese factors lead to individuals with
behavior of bilinguals. In addition to keeping the two linguistic varying degree of language competencies. herefore, it is not sur-
systems separate, bilinguals can also integrate the two systems prising that deining and measuring bilingualism/multilingual-
by mixing two languages. Language mixing is a far more complex ism continues to be a challenging task. Bilinguals are complex
cognitive ability than language separation. he holistic view of and colorful in the way they manage and optimize their linguistic
bilingualism can account for these two types of competencies. resources. For that reason, they are not a sum of two monolinguals.
Language mixing comes naturally to bilinguals. herefore, it is Language mixing and shifting are two deining characteristics of
not surprising that such mixed languages as Spanglish, Hinglish, bilinguals. Current socio- and psycholinguistic research attempts
Japlish, and Germlish are emerging around the globe. to account for these two salient properties of the bilingual brain.
Contrary to the claims of earlier research, the grammar of lan-
Tej K. Bhatia
guage mixing is complex yet systematic. he search for explana-
tions of cross-linguistic generalizations about the phenomenon
of code mixing (particularly, code mixing within sentences) in
terms of independently justiied principles of language structure Bhatia, Tej K., and William C. Ritchie, eds. 2006. he Handbook of
and use has taken two distinct forms. One approach is formulated Bilingualism. Oxford: Blackwell. his book presents a multidisciplinary
and comprehensive collection of state-of-the-art research on bilingual-
in terms of the theory of linguistic competence, for example, Jef
ism and multilingualism. Chapters 7 and 8 deal with bilingual produc-
MacSwan (2005). he other approach as best exempliied by
tion models, including the discussion of Garrett 1988 and Levelt 1989.
the Matrix Language Frame (MLF) model (Myers-Scotton and
Bhatia, Tej K., and William C. Ritchie. 1996. Bilingual language mixing,
Jake 1995; see codeswitching) is grounded in the theory of Universal Grammar, and second language acquisition. In Handbook
sentence production, particularly that of M. Garrett (1988) and of Second Language Acquisition, ed. W. C. Ritchie and T. K. Bhatia,
W. Levelt (1989) (see Bhatia and Richie 1996, 6557 for discus- 62782. San Diego, CA: Academic Press.
sion). For further development of these ideas and a critique, see Bloomield, Leonard. [1933] 1984. Language. Chicago: University of
Bhatia and Ritchie (1996) and MacSwan (2005). Chicago Press.
127
Binding
de Bot, Kees, and Judith F. Kroll. 2002. Psycholinguistics. In An logic: he bindee must be contained in the sister constituent
Introduction to Applied Linguistics, ed. Norbert Schmitt, 13349. to the binder, a relation usually called c(onstituent)-command
London: Arnold. (see c-command). For movement relations, this amounts to the
Edwards, John. 2006. Foundations of bilingualism. In Bhatia and Richie ban on downward or sideways movement, the proper binding
2006, 731.
condition, which is pervasive across languages. For quantiier-
Garrett, M. E. 1988. Process in sentence production. In he
pronoun relations, it blocks sideways binding as in (4a) (neither
Cambridge Linguistic Survey. Vol. 3. Ed. F. Newmeyer, 6996.
noun phrase [NP] c-commands the other), and upward binding
Grosjean, Francois. 1989. Neurolinguists, beware! he bilingual is not as in (4b) (the putative binder is c-commanded by the pronoun);
two monolinguals in one person. Brain and Language 36: 315. note that in both examples the pronouns have to be interpreted
Hakuta, Kanji. 1986. Mirror of Language. New York: Basic Books. his as referentially independent of no one/actress:
work ofers an excellent multidisciplinary account of bilingualism in
(4) a. If no one is here, hes elsewhere.
general and bilingualism in the United States in particular. Among
other topics, it presents an excellent account of the linguistic dei-
b. Her calendar showed that no actress had left early.
ciency and linguistic augmentation hypotheses. A systematic class of exceptions to the c-command require-
Levelt, W. 1989. Speaking: From Intention to Articulation. Cambridge, ment is found in so-called indirect binding, for example (5), where
MA: MIT Press.
the object can be bound from within the subject (sideways):
MacSwan, Jef. 2005. Remarks on Jake, Myers-Scotton and Grosss
response: here is no Matrix Language. Language and Cognition (5) Somebody from every city likes its beaches.
8.3: 27784.
Myers-Scotton, Carol and J. Jake. 1995. Matching lemmas in a bilingual Unlike semantic binding, coreference among two NPs does not
language competence and production model: Evidence from intrasen- require c-command:
tential code switching. Linguistics 33: 9811024.
(6) His/Jacquess teacher said that he/Jacques failed.
Peal, Elizabeth, and Wallace E. Lambert. 1962. Relation of bilingualism
to intelligence. Psychological Monographs 76: 123. Yet certain prohibitions against coreference, for example, that
Ritchie, William C., and Tej K. Bhatia. 2007. Psycholinguistics. In nonrelexive pronouns in English cannot corefer with expres-
Handbook of Educational Linguistics, ed. Bernard Spolsky and Francis sions in the same inite clause, only regard NPs that c-commands
Hult, 3852. Oxford: Blackwell.
the pronoun, (7) (similarly for nonpronominal NPs):
BINDING Your mother
In quantiied logic, binding names the relation between a (7) defended you.
quantiier and one or more variables, for example, x and x *You
in x[P(x) Q(x)]. In linguistics, the term has been used in at
Likewise, relexive pronouns in English need an antecedent
least three domains: irst, for the relation between quantiied
that is not just within the same inite clause but also c-commands
expressions and pronouns that referentially depend on them,
them:
(1); second for coreference, the relation between two referring
expressions with the same referent, (2), including hypothesized She
empty pronouns, (2c); and third, in theories that assume trans-
formations, for the relation between a dislocated phrase and its (8) defended herself.
trace, (3) (see government and binding theory): *Her mother
(1) Every cat chased its tail. hese conditions on the distribution of relexive and nonre-
lexive restrict binding by quantiied nominals as well and are
(2) a. Sue hopes that she won.
indiscriminately referred to as binding conditions.
b. Edgar spoke for himself.
While c-command seems relevant in binding conditions
c. Wesley called PRO to apologize.
cross-linguistically, other aspects, such as the number of mor-
(3) a. Which book did Kim read t? phological classes (relexives, nonrelexives, etc.) or the size of
b. Antonia was promoted t. relevant structural domains, vary widely.
Semantically, only (1) and (3) are clear instances of binding Daniel Bring
(the pronouns/traces are interpreted like variables, and their
antecedents are often nonreferring), yet coreference is almost WORKS CITED AND SUGGESTIONS FOR FURTHER READING
universally subsumed under the binding label in linguistics.
Bring, Daniel. 2005. Binding heory. Cambridge Textbooks in Linguistics.
All three binding relations are frequently represented by
coindexing the binder (or antecedent) and the bound element
Chomsky, Noam. 1981. Lectures on Government and Binding. Dordrecht,
(e.g., Every cat6 chased its6 tail), though richer, asymmetrical the Netherlands: Foris Publications.
representations have been proposed and are arguably required Dalrymple, Mary. 1993. he Syntax of Anaphoric Binding. Stanford,
for semantic interpretation. CA: CSLI.
Semantic binding relations are subject to a structural con- Kuno, Susumo. 1987. Functional Syntax Anaphora, Discourse and
straint, to a irst approximation, the same as in quantiied Empathy. Chicago: Chicago University Press.
128
Biolinguistics
BIOLINGUISTICS of Richard Kaynes (2004) and Charles Yangs (2004) work on com-
petitive theories of language acquisition. (From other perspectives,
Biolinguistics is the study of the biology of language. he mod- see universals, nongenetic; absolute and statistical
ern biolinguistic program was initiated by Noam Chomsky in universals, implicational universals, and typological
the 1950s (Chomsky 2006), although it has much earlier histori- universals.)
cal antecedents (see cartesian linguistics). It investigates In addition to comparative grammar (see also morpholog-
the form and function of language, the development (ontogeny) ical typology), universals of language change, syn-
of language, and the evolution of language (phylogeny), among tactic change, semantic change, pidgins, and creoles
other topics. Biolinguists study such questions as the following: provide additional evidence for the nature of universal grammar
(1) What is knowledge of language? and language acquisition.
Moreover, the study of genetic language disorders, as well as
(2) How does knowledge of language develop in the child? familial and twin studies, has been very fruitful for the study of
(3) How does knowledge of language evolve in the species? language acquisition (see genes and language; specific
language impairment; see also the extensive literature on the
To answer the question of what knowledge of language is (1), FOXP2 gene [Marcus and Fisher 2003]). Studies of language-iso-
biolinguists have proposed various generative grammars, lated children provide information about the critical period
that is, explicit models of the faculty of language. for language learning. he study of sign languages has been
he study of generative grammars draws from a variety invaluable for investigating language outside the modality of
of areas, including syntax, semantics, the lexicon, mor- sound (see also sign language, acquisition of; sign lan-
phology, phonology, and articulatory and acoustic guages, neurobiology of). Finally, the study of linguistic
phonetics. savants has been quite useful for delineating the modularity
In addition, the biolinguist investigates the neurologi- of the language faculty as distinct from other cognitive faculties.
cal mechanisms underlying the faculty of language (see To answer the question of how knowledge of language evolves
syntax, neurobiology of; semantics, neurobiology in the species, (3), biolinguists integrate data from a variety of
of; morphology, neurobiology of; phonetics and areas, including comparative ethology (see Hauser, Chomsky,
phonology, neurobiology of). Such studies of brain and and Fitch 2002; see also animal communication and human
language include studies of expressive and receptive apha- language; speech anatomy, evolution of), comparative
sia, split brain patients, neuroimaging, and the electrical neuroanatomy, and comparative genomics. Since evolution of
activity of the brain. language took place in the distant past, mathematical modeling
he biolinguist also studies performance models (language of populations of speaker-hearers has recently attracted much
processing), including parsing, right hemisphere lan- interest in work on dynamical systems (see self-organizing
guage processing, left hemisphere language pro- systems). Such studies have proven useful not only for the study
cessing, and speech perception. of evolution but also for the study of language acquisition and
To answer the question of how knowledge of language devel- change. (For some hypotheses on origins of language, see ori-
ops in the child (2), one may visualize this as the study of the gins of language; grooming, gossip, and language.)
language acquisition device: Questions (1) to (3) might be called the what and how ques-
experience ? language (English, Japanese, etc.) tions of biolinguistics. One can also ask why the principles of
language are what they are, a deeper and more diicult question
where the box represents what the child brings to language learn- to answer. he investigation into why questions is sometimes
ing. We ask how the child maps experience (primary linguistic data) referred to as the minimalist program or minimalism. In addi-
to a particular language. It is posited that the child moves through a tion, there is the related question of how the study of language
number of states from an initial state, corresponding to the childs can be integrated with the other natural sciences, a problem that
genetic endowment, to a inal state, corresponding to a particular Chomsky calls the uniication problem (see Jenkins 2000). All
language. For each subarea discussed, biolinguistics studies the of these questions are certain to continue to fascinate investiga-
development or growth of language, often referred to as language tors of the biology of language for decades to come.
acquisition (e.g., syntax, acquisition of; semantics, acqui- (For more information on other explicit models of the lan-
sition of; and phonology, acquisition of). guage faculty, see transformational grammar; standard
he initial state may be characterized by a universal gram- theory and extended standard theory; categorial
mar, which is a set of general principles with parameters that grammar; head-driven phrase structure grammar;
are set by experience, thus accounting for the variation across lexical-functional grammar; optimality theory; role
languages. For example, there are general principles of word and reference grammar; cognitive grammar; connec-
order that permit some variation; for example, the verb precedes tionism and grammar; construction grammars).
the object (English) or the verb follows the object (Japanese) (see
x-bar theory). Such a theory is referred to as a principles Lyle Jenkins
and parameters theory. (For speciic subareas, see syntax,
universals of; semantics, universals of; morphology, WORKS CITED AND SUGGESTIONS FOR FURTHER READING
universals of; and phonology, universals of.) For some Chomsky, Noam. 2006. Biolinguistics and the human capacity. In
diferent parametric proposals, see the microparametric approach Language and Mind, 17385. Cambridge: Cambridge University Press.
129
Birdsong and Human Language
Hauser, M. D., N. Chomsky, and W. Tecumseh Fitch. 2002. he faculty parallels are what one would expect if both species rely on a simi-
of language: What is it, who has it, and how did it evolve? Science lar neural substrate for learning and using their communicative
298: 156979. systems.
Kayne, Richard. 2004. Antisymmetry and Japanese. In Relevant genetic evidence is also available. he much-dis-
Variation and Universals in Biolinguistics, ed. Lyle Jenkins, 35.
cussed FOXP2 gene is similarly expressed in the basal ganglia of
Amsterdam: Elsevier.
humans and songbirds (Teramitsu et al. 2004; Vargha-Khadem
Jenkins, Lyle. 2000. Biolinguistics: Exploring the Biology of Language.
et al. 2005). A FOXP2 mutation in humans results in deicits in
Marcus, G. F., and S. E. Fisher. 2003. FOXP2 in focus: What can genes language production and comprehension, especially aspects of
tell us about speech and language? Trends in Cognitive Sciences (morpho)syntax that involve combining and sequencing linguis-
7.6: 25762. tic units (Marcus and Hisher 2003; Vargha-Kadham et al. 2005).
Yang, Charles. 2004. Toward a theory of language growth. In One of the neurobiological efects of the mutation is a notable
Variation and Universals in Biolinguistics, ed. Lyle Jenkins, 3756. reduction in the gray matter of the striatum (Vargha-Kadham
Amsterdam: Elsevier. et al. 2005). Perhaps, then, the combinatorial aspects of human
language were enabled by the preadaptation of an anterior neu-
ral circuit that has been highly conserved over evolutionary time
BIRDSONG AND HUMAN LANGUAGE and across species, and by a genetic mutation in this circuit that
Language is often claimed to be uniquely human (see ani- increased its computational space.
mal communication and human language ). his belief Finally, some birdsong, like human language, is composi-
has discouraged eforts to identify potential animal models of tional; songbirds learn units and rules of combination (Rose et
language, even though animal models have been essential in al. 2004), although the rules of combination are obviously far
ascertaining the neurobiology of other cognitive functions. less sophisticated than those that characterize human language.
It is conceivable, however, that useful homologies or analo- A skeptic might argue that the syntax of human language is too
gies exist between human language and the communicative complex (too highly structured, too recursive, too creative; see
systems of other species, even if language is unique in some recursion, iteration, and metarepresentation ) to be
respects. modeled as a simple patterned sequence processor that relies
One particularly interesting homology might exist between on associative learning mechanisms. In fact, the explanatory
human language and birdsong. Songbirds rely on a special- burden placed on rule-based, recursive syntax has diminished
ized frontal lobebasal ganglia loop to learn, produce, over recent decades. Modern grammars tend to be lexicalist
and perceive birdsong (Brenowitz and Beecher 2005) (see also in nature; that is, much of the knowledge relevant to sentence
brocas area). Disruptions to this circuit disrupt the sensorim- structure is stored in the lexicon with individual words, rather
otor learning needed to acquire song, and also the sequencing than being computed by abstract phrase structure rules (see
skills needed to produce and properly perceive it. Recent work lexical-functional grammar ). Recursion, while clearly
has revealed a remarkable homology in this circuit between a characteristic of human language, is much more limited in
birds and mammals (Doupe et al. 2005). he homologous circuit actual language usage than would be predicted given the stan-
in human and nonhuman primates involves loops connecting dard model. And, because conceptual knowledge (see seman-
many regions in the frontal cortex to the basal ganglia. Aferents tics) has its own structure (Jackendof 1990), it seems plausible
from the frontal cortex densely innervate the striatum of the basal that some of the burden for structuring the input rests with the
ganglia, which also receives inputs from many other areas of the conceptual stream (Jackendof 2002), rather than entirely with
cortex. he striatum seems to control behavioral sequencing in the syntax.
many species (Aldridge and Berridge 1998). Spiny neurons, the Birds and humans are fundamentally diferent in many ways,
principal cells of the striatum, have properties that make them as are their systems of communication. Nonetheless, birds and
ideal for recognizing patterned sequences across time (Beiser, humans are two of only a handful of vocal learners, and recent
Hua, and Houk 1997). Damage to this loop in primates produces work points to communication-relevant homologies and similar-
problems with motor and cognitive skills that require plan- ities. It is not unreasonable to think that a comparative approach
ning and manipulating patterns of sequences over time (Fuster might provide important clues to how language evolved and,
1995). perhaps, to the nature of language itself.
hese observations lend plausibility to the notion that the
Lee Osterhout
frontal cortexbasal ganglia circuit might play a role in the syn-
tax of human language. If so, then it is probably not coincidental WORK CITED AND SUGGESTIONS FOR FURTHER READING
that the acquisition of human language and birdsong have com-
pelling parallels (Doupe and Kuhl 1999). Humans and songbirds Aldridge, J. Wayne, and Kent C. Berridge. 1998. Coding serial order
by neostriatal neurons: A natural action approach to movement
learn their complex, sequenced vocalizations in early life. hey
sequence. Journal of Neuroscience 18: 277787.
similarly internalize sensory experience and use it to shape vocal
Beiser, David G., Sherwin S. Hua, and James C. Houk. 1997. Network
outputs, by means of sensorimotor learning and integration.
models of the basal ganglia. Current Opinion in Neurobiology
hey show similar innate dispositions for learning the correct 7: 18590.
sounds and sequences; as a result, humans and some species Brenowitz, Eliot, and Michael D. Beecher. 2005. Song learning in
of songbird have similar critical periods for vocal learning, birds: Diversity and plasticity, opportunities and challenges. Trends
with a much greater ability to learn early in life. hese behavioral in Neurosciences 28: 12732.
130
Blended Space Blindness and Language
Doupe, Allison J., and Patricia Kuhl. 1999. Birdsong and human constructs in the conceptual blending framework. Some research-
speech: Common themes and mechanisms. Annual Review of ers use blended space and blend interchangeably to refer to the
Neuroscience 22: 567631. particular kind of mental space described here (e.g., Fauconnier
Doupe, Allison J., David J. Perkel, Anton Reiner, and Edward A. Stern. and Turner 1994). Elsewhere blend is used to describe the entire
2005. Birdbrains could teach basal ganglia research a new song.
integration network, as in double-scope blend (e.g., Nez 2005),
Trends in Neurosciences 28: 35363.
or the process of generating such a network, as in running the
Fuster, Joaquin M. 1995. Memory in the Cerebral Cortex: An Empirical
Approach to Neural Networks in the Human and Nonhuman Primate.
blend (e.g., Fauconnier and Turner 2002, 48). Where the use may
Cambridge, MA: MIT Press. be ambiguous, blended space provides maximal clarity.
Jackendof, Ray. 1990. Semantic Structures. Cambridge, MA: MIT Press. Vera Tobin
. 2002. Foundations of Language: Brain, Meaning, Grammar,
Evolution. New York: Oxford University Press.
Lieberman, Philip. 2000. Human Language and Our Reptilian Brain.
Cambridge: Harvard University Press. Fauconnier, Gilles, and Mark Turner. 1994. Conceptual projection and
Marcus, Gary F., and Simon E. Fisher. 2003. FOXP2 in focus: What can middle spaces. UCSD Department of Cognitive Science Technical
genes tell us about speech and language? Trends in Cognitive Sciences Report 9401.
7: 25762. . 1998. Principles of conceptual integration. In Discourse
Rose, Gary, Franz Goller, Howard J. Gritton, Stephanie L. Plamondon, and Cognition, ed. Jean-Pierre Koenig, 26983. Stanford, CA: CSLI
Alexander T. Baugh, and Brendon G. Cooper. 2004. Species-typical Publications.
songs in white-crowned sparrows tutored with only phrase pairs. . 2002. he Way We hink: Conceptual Blending and the Minds
Nature 432: 7538. Hidden Complexities. New York: Basic Books.
Teramitsu, Ikuku, Lili C. Kudo, Sarah E. London, Daniel H. Geschwind, Grady, Joseph, Todd Oakley, and Seana Coulson. 1999. Conceptual
and Stephanie A. White. 2004. Parallel FOXP1 and FOXP2 expres- blending and metaphor. In Metaphor in Cognitive Linguistics, ed.
sion in songbirds and human brain predicts functional interaction. Raymond W. Gibbs, Jr., and Gerard J. Steen, 10124. Amsterdam and
Journal of Neuroscience 24: 315263. Philadelphia: John Benjamins.
Vargha-Khadem, Faraneh., David G. Gadian, Andrew Copp, and Nez, Rafael E. 2005. Creating mathematical ininities: Metaphor,
Mortimer Mishkin. 2005. FOXP2 and the neuroanatomy of speech blending, and the beauty of transinite cardinals. Journal of Pragmatics
and language. Nature Reviews: Neuroscience 6: 1318. 37: 171741.
BLENDED SPACE BLINDNESS AND LANGUAGE

A blended space is one element of the model of meaning con- Reading by Touch
struction proposed by conceptual blending theory. In this Blind people achieve literacy by reading braille, a tactile coding
framework, mental representations are organized in small, self- system for reading and writing. Coding is based on raised dots
contained conceptual packets (Fauconnier and Turner 2002, arranged in rectangular cells that consist of paired columns of
40) called mental spaces, which interconnect to form complex three dots each. Patterns of one or more dots represent letters,
conceptual networks. In a conceptual integration network, or numbers, punctuation marks, or partial and whole word con-
blend, some mental spaces serve as input spaces that contrib- tractions (Figure 1). Initially, braille was coded for the Latin
ute elements to a new, blended mental space (Fauconnier and alphabets of French or English. For languages with non-Latin
Turner 1994, 1998, 2002). alphabets, braille patterns are assigned according to a translit-
he minimal conceptual integration network connects four eration of the Latin alphabet. For example, the third Greek letter
mental spaces: two inputs, a generic space that contains all the gamma has the dot pattern for the third Latin letter c. Chinese
structures that the inputs seem to share, and a blended space. he and other Asian languages use phonetic adaptations of braille.
conventional illustration of this prototypical network, sometimes Chinese braille codes syllables into one, two, or three pat-
called the Basic Diagram (Fauconnier and Turner 2002, 467), terns for, respectively, an initial consonant sound, a inal vowel
shows four circles marking the points of a diamond, with the sound, and a word tone. here are no braille patterns for indi-
circle representing the generic space at the top and the blended vidual Chinese ideograms. Japanese orthography is more com-
space at the bottom. However, this four-space model is only the plex, as it includes a combination of Kanji (ideograms imported
minimal version of the integration network; in conceptual blend- from China), Kana (phonograms), Western alphabet, and Arabic
ing theory, networks can contain any number of input spaces. numerals. Kanji is converted to Kana irst before translation to
Blended spaces can also serve as inputs to new blends, making braille. While alphabet represents a single sound, Kana repre-
elaborate megablends (Fauconnier and Turner 2002, 1513). sents a syllable (a consonant and a vowel).
What makes a blended space special is that it contains newly Standard braille in English and many European languages is
emergent structure that does not come directly from any of the usually read in a contracted form (Grade II) in which selected
inputs. For example, understanding his surgeon is a butcher single patterns signify commonly used words, part-words, or syl-
involves selective projection from inputs of butchery and sur- lables. Hence, many words require only one, two, or three braille
gery, but the inference that the surgeon is incompetent arises cells and spaces, which reduce reading efort and space for text.
only in the blended space. he same braille pattern can represent a letter or a contraction,
here is some potential for confusion regarding the terminol- depending on context, thus expanding 63 to 256 interpretable
ogy used to distinguish blended spaces from other theoretical dot patterns in Grade II English braille. Although all alphabet-
131
Blindness and Language
Figure 1. American standard braille cell patterns for

alphabet, punctuation marks, some contractions, and
whole words.
based languages use the same braille patterns, associated con- Braillists generally prefer bimanual reading (Davidson,
tractions vary. hus, multilingual reading requires the learning Appelle, and Haber 1992), with each hand conveying diferent
of language-unique contractions. information. While one hand reads, the second marks spatial
During reading, scanning movement across text evokes position in the text (e.g., lines, locations within lines, spaces
intermittent mechanical stimulation from contacting succes- between braille cells or words). Photographic records reveal skin
sive braille cells, which activates most low-threshold cutaneous compression of only one ingertip even during tandem move-
mechanoreceptors found in the ingertip (Johnson and Lamb ments across text; there is no coincident reading of diferent
1981). A spatial-temporal transformation of the evoked periph- braille cells by multiple ingers (Millar 1997, 337). Text and spa-
eral activity indicates an isomorphic reproduction of braille tial layout are tracked simultaneously in bimanual reading; there
cell shapes across a population of mechanoreceptors (Phillips, is no best hand (Millar 1984). Some individuals read an initial
Johansson, and Johnson 1990). hrough connecting sensory line segment with the left and a inal segment with the right hand
pathways, these physiological representations of braille cell (Bertelson, Mousty, and DAlimonte 1985). Despite bimanual
shape are conveyed to primary somatosensory cortex (Phillips, reading, the left hemisphere is generally dominant for lan-
Johnson, and Hsaio 1988) in the parietal lobe. Despite the guage even in left-handed braillists (Burton et al. 2002a).
expected isomorphic representation of braille cell shapes in
somatosensory cortex, we do not know whether tactile reading in Visual Cortex Contribution to Language
fact relies on holistically discriminating shape. Blindness requires numerous adjustments, especially for language.
Braille cell patterns also difer in the density of dot-gaps, hese adjustments appear to involve substantial reorganization of
which is perceived as variations in texture (Millar 1985). hese the visual cortex (occipital lobe), which in sighted people is
texture changes produce a dynamically shifting lateral mechani- dominated by visual stimulation. In blind people, the visual cor-
cal shearing across the ingertip as it moves over braille text in tex responds more readily to nonvisual stimulation and especially
luent reading. Good braille readers attend to these temporally contributes to language processing. A clinical case study of a con-
extended stimulation patterns, as opposed to global-holistic genitally blind, highly luent braille reader is particularly salient.
spatial shapes (Millar 1997, 337). In addition, top-down linguis- Following a bilateral posterior occipital ischemic stroke, she lost
tic content drives perceptual processing in skillful readers, for the ability to read braille (Hamilton et al. 2000). However, auditory
whom the physical attributes of the text are subservient to lexical and spoken language were unimpaired, and she retained normal
content. In other words, they do not puzzle out words, letter by tactile sensations on her braille reading hand despite a destroyed
letter; instead, they recognize them due in part to their physical visual cortex. A similar but transient disruption in tactile reading
properties but also to semantic context, the familiarity of words occurs in congenitally blind people following repetitive transcra-
stored in their mental lexicon, and so on. Less accomplished nial magnetic stimulation (rTMS) to occipital cortex (Hamilton
readers trace shape by making more disjointed trapezoidal in- and Pascual-Leone 1998; Pascual-Leone et al. 2005).
ger movements over individual cells, a strategy that luent read- he obvious explanation for these observations is that the
ers utilize when asked to identify particular letters, which is a visual cortex reorganizes after blindness. But things are more
shape-based task (Millar 1997, 337). complex. First, occipital cortex normally processes some tactile
132
Blindness and Language
information in sighted people, especially following short peri- Neuroplasticity has been observed in visual cortex of blind
ods of visual deprivation. hus, blindfolding sighted people for individuals at all ages of blindness onset. Such observations
ive days, during which they train to discriminate braille letters, garner surprise only when primary sensory areas are viewed as
leads to visual cortex activity to tactile stimulation and sensitiv- unimodal processors that funnel computations into a cascade of
ity to disrupting braille letter discrimination by occipital rTMS cortical areas, including multisensory regions. Activation of reor-
(Pascual-Leone and Hamilton 2001). Even without visual depri- ganized visual cortex by nonvisual stimulation most parsimoni-
vation, occipital rTMS impairs macrogeometric judgments of ously relects innate intracortical connections between cortical
raised-dot spacing in sighted people (Merabet et al. 2004). hese areas that normally exhibit nonvisual and multisensory respon-
indings indicate that visual cortex normally contributes to some siveness in sighted people. he demanding conditions of blind-
tactile discrimination. ness possibly alter and expand the activity in these connections
Brain-imaging studies have dramatically revealed the role of and thereby reallocate visual cortex to language processing.
visual cortex in language for blind people. For example, gener-
Harold Burton
ating a verb to an ofered noun activates visual cortex in blind
people (see Color Plate 2), irrespective of whether the noun is WORKS CITED AND SUGGESTIONS FOR FURTHER READING
read through braille (Burton et al. 2002a) or heard (Burton et al.
2002b). In early blind individuals, verb generation engages both Amedi, A., N. Raz, P. Pianka, R. Malach, and E. Zohary. 2003. Early
visual cortex activation correlates with superior verbal memory per-
lower-tier (e.g., V1, V2, VP) and higher-tier (e.g., V7, V8, MT)
formance in the blind. Nature Neuroscience 6: 75866.
visual areas (Color Plate 2). Similar adaptations occur in late
Amedi, A., A. Floel, S. Knecht, E. Zohary, and L. G. Cohen. 2004.
blind individuals, though fewer areas are afected. he semantic Transcranial magnetic stimulation of the occipital pole interferes with
task of discovering a common meaning for a list of heard words verbal processing in blind subjects. Nature Neuroscience 7: 126670.
also evokes extensive visual cortex activation in the early blind Bertelson, P., P. Mousty, and G. DAlimonte. 1985. A study of braille
and a smaller distribution in the late blind (Color Plate 2). Similar reading: 2. Patterns of hand activity in one-handed and two-handed
distributions of visual cortex activity occur when the early blind reading. Quarterly Journal of Experimental Psychology A: Human
listen to sentences with increased semantic and syntactic com- Experimental Psychology 37: 23556.
plexity (Rder et al. 2002) and during a semantic retrieval task Burton, H., J. B. Diamond, and K. B. McDermott. 2003. Dissociating cor-
(Noppeney, Friston, and Price 2003). A phonological task of tical regions activated by semantic and phonological tasks to heard
identifying a common rhyme for heard words activates nearly all words: A fMRI study in blind and sighted individuals. Journal of
Neurophysiology 90: 196582.
visual areas bilaterally in early blind but few in late blind people
Burton, H., D. G. McLaren, and R. J. Sinclair. 2006. Reading embossed
(Color Plate 2). he sublexical task of identifying block capital
capital letters: A fMRI study in blind and sighted individuals. Human
letters translated passively across a ingertip activates only parts Brain Mapping 27: 32539.
of visual cortex in early and late blind people (Color Plate 2). Burton, H., A. Z. Snyder, T. E. Conturo, E. Akbudak, J. M. Ollinger, and
In general, semantic language tasks activate a greater extent M. E. Raichle. 2002a. Adaptive changes in early and late blind: A fMRI
of visual cortex than lower-level language and most perceptual study of braille reading. Journal of Neurophysiology 87: 589611.
tactile or auditory processing tasks. he functional relevance of Burton, H., A. Z. Snyder, J. Diamond, and M. E. Raichle. 2002b. Adaptive
occipital cortex to semantic processing is demonstrated when changes in early and late blind: A fMRI study of verb generation to
rTMS over left visual cortex transiently increases errors in verb heard nouns. Journal of Neurophysiology 88: 335971.
generation to heard nouns in the early blind without interrupting Davidson, P. W., S. Appelle, and R. N. Haber. 1992. Haptic scanning of
the articulation of words (Amedi et al. 2004). Of course, perform- braille cells by low- and high-proiciency blind readers. Research in
Developmental Disabilities 13: 99111.
ing any semantic task depends on retrieving word associations.
Hamilton, R., J. P. Keenan, M. Catala, and A. Pascual-Leone. 2000. Alexia
hus, two studies report a stronger relationship between reten-
for braille following bilateral occipital stroke in an early blind woman.
tion performance and visual cortex activity, predominantly in Neuroreport 11: 23740.
V1, than with verb generation. V1 response magnitudes corre- Hamilton, R., and A. Pascual-Leone. 1998. Cortical plasticity associated
late more positively with verbal retention (Amedi et al. 2003) and with braille learning. Trends in Cognitive Sciences 2: 16874.
the accuracy of long-term episodic memory (Raz, Amedi, and Johnson, K. O., and G. D. Lamb. 1981. Neural mechanisms of spatial tac-
Zohary 2005) in congenitally/early blind participants. tile discrimination: Neural patterns evoked by braille-like dot patterns
As in sighted people, blind individuals still utilize traditional in the monkey. Journal of Physiology (London) 310: 11744.
left-lateralized frontal, temporal and parietal language areas Merabet, L., G. hut, B. Murray, J. Andrews, S. Hsiao, and A. Pascual-
(Burton et al. 2002a, 2002b; Burton, Diamond, and McDermott Leone. 2004. Feeling by sight or seeing by touch? Neuron 42: 1739.
Millar, S. 1984. Is there a best hand for braille? Cortex 20: 7587.
2003; Noppeney, Friston, and Price 2003; Rder et al. 2002).
. 1985. he perception of complex patterns by touch. Perception
hus, the visual cortex activity represents an addition to the cor-
14: 293303.
tical language-processing areas. Visual cortex activity distributes
. 1997. Reading by Touch. London: Routledge.
bilaterally in all blind people for semantic tasks. However, left Noppeney, U., K. J. Friston, and C. J. Price. 2003. Efects of visual depriva-
visual cortex is more active in early blind individuals (Color Plate tion on the organization of the semantic system. Brain 126: 16207.
2). In contrast, right hemisphere responses predominate in Pascual-Leone, A., A. Amedi, F. Fregni, and L. B. Merabet. 2005.
late blind participants when they read braille with the right hand he plastic human brain cortex. Annual Review of Neuroscience
but are more symmetrically bilateral when verbs are generated 28: 377401.
to heard nouns (Color Plate 2). It is currently unknown whether Pascual-Leone, A., and R. Hamilton. 2001. he metamodal organization
reorganized visual cortex contains speciic language domains. of the brain. Progress in Brain Research 134: 42745.
133
Bounding
Phillips, J., R. Johansson, and K. Johnson. 1990. Representation of (3) *[Handsomei though [S I believe [NP the claim that [S Dick is ti]]],
braille characters in human nerve ibers. Experimental Brain Research Im still going to marry Herman.
81: 58992.
Phillips, J. R., K. O. Johnson, and S. S. Hsiao. 1988. Spatial pattern rep- (4) [Handsomei though [S I believe that [S Dick is ti]], Im still going
resentation and transformation in monkey somatosensory cortex. to marry Herman.
Proceedings of the National Academcy of Sciences (USA) 85: 131721.
To correct this undesirable efect of subjacency, Chomsky
Raz, N., A. Amedi, and E. Zohary, E. 2005. V1 activation in congenitally
hypothesized that long-distance movement proceeds in short
blind humans is associated with episodic retrieval. Cerebral Cortex
15: 145968. steps, passing through successive cycles. In particular, he postu-
Rder, B., O. Stock, S. Bien, H. Neville, and F. Rosler. 2002. Speech pro- lated that movement can stop by at the edge of the clause (Sor
cessing activates visual cortex in congenitally blind humans. European COMP; the modern complementizer phrase [CP] area). In other
Journal of Neuroscience 16: 9306. words, instead of moving long distance in one fell swoop, move-
Van Essen, D. C. 2004. Organization of visual areas in macaque and ment irst targets the closest clausal edge and from there pro-
human cerebral cortex. In he Visual Neurosciences, ed. L. Chalupa ceeds from clausal edge to clausal edge, typically crossing only
and J. S. Werner, 50721. Cambridge, MA: MIT Press. one S(/IP)-node at a time:
(5) [Handsomei though [S I believe [S ti that [S Dick is ti]]], Im still

BOUNDING going to marry Herman.
For all its modernity and insights into the fundamental workings Successive cyclicity may at irst seem like a patch, an exemption
of language,Noam Chomskys early writings (1955, 1957) con- granted to ix a bad problem (without it, the theory would wrongly
tain a curious gap: hey do not contain any explicit discussion of rule out acceptable constructions). But subsequent research has
locality. One does not even ind extensive discussion of the fact uncovered a wealth of data, reviewed in Boeckx (2007), that con-
that movement appears to be potentially unbounded. his gap is verge and lend credence to the successive cyclic movement
all the more curious from our current perspective where locality hypothesis, making it one of the great success stories of modern
and long-distance dependencies are arguably the major area of generative grammar. It appears to be the case that long-dis-
study in theoretical syntax. tance, unbounded dependencies are the result of the conjunc-
We owe our modern interest in locality to John R. Rosss tion of small, strictly bounded steps.
([1967] 1986) seminal work in which the concept of island was Currently, our most principled explanation for the phenome-
introduced. Rosss thesis is full of examples of long-distance non of successive cyclicity is that it is the result of some economy
dependencies like (1a and b). condition that requires movement steps to be kept as short as
(1) a. Handsome though Dick is, Im still going to marry possible (see Chomsky and Lasnik 1993; Takahashi 1994; Boeckx
Herman. 2003; Bokovi 2002).
b. Handsome though everyone expects me to try to force Bill As eljko Bokovi (1994) originally observed, some addi-
to make Mom agree that Dick is, Im still going to marry tional condition is needed to prevent this economy condition
Herman. requiring movement steps to be kept as short as possible from
forcing an element that has taken its irst movement step to be
Ross systematically investigated the fact that seemingly minute stuck in creating ininitesimally short steps. Put diferently, some
manipulations dramatically afected the acceptability of sen- condition is needed to prevent chain links from being too short.
tences. Witness (2a and b). he idea that movement that is too short or superluous
(2) a. Handsome though I believe that Dick is, Im still going to ought to be banned has been appealed to in a variety of works
marry Herman. in recent years, under the rubric of anti-locality (see Grohmann
b. *Handsome though I believe the claim that Dick is, Im still 2003 for the most systematic investigation of anti-locality; see
going to marry Herman. also Boeckx 2007 and references therein.)
he anti-locality hypothesis is very desirable conceptually. It
Rosss thesis contains a list of contexts, technically known as places a lower bound on movement, as Chomskys subjacency
islands, which disallow certain types of dependencies. condition places an upper bound on movement. Since it blocks
Chomsky (1973) set out to investigate what the various vacuous movement, it is also an economy condition (dont do
domains identiied by Ross as islands have in common. hus anything that is not necessary), on a par with the underlying
began the modern study of locality and, in many ways, the nature force behind subjacency.
of current linguistic theorizing. We thus arrive at a beautifully symmetric situation, of the kind
Chomskys central insight in 1973 is that movement is subject promoted by the recently formulated minimalist program for
to the subjacency condition, a condition that forbids move- linguistic theory (Chomsky 1995 and Boeckx 2006, among oth-
ment from being too long. Speciically, his notion of subjacency ers): Long-distance dependencies, which are pervasive in natural
prevented movement from crossing two bounding nodes. For languages, are not taken in one fell swoop. he patterns observed
Chomsky, the bounding nodes were the top clausal node (S for in the data result from the conjunction of two economy condi-
sentence; our modern inlectional phrase [IP]) and NP (noun tions: movement must be kept as short as possible, but not too
phrase; our modern DP [determiner phrase]). he condition cor- short.
rectly captured the unacceptability of (2b), but wrongly predicted
(2a) to be out (see [3] and [4]). Cedric Boeckx
134
Brain and Language
WORKS CITED AND SUGGESTIONS FOR FURTHER READING for language that predicted another type of aphasia that could
Boeckx, Cedric. 2003. Islands and Chains. Amsterdam: John be, and was, found: conduction aphasia. In conduction apha-
Benjamins. sia, it was not centers but connections between them that were
. 2006. Linguistic Minimalism: Origins, Methods, Concepts, and impaired: Neither brocas area of the brain nor wernickes
Aims. Oxford: Oxford University Press. area was itself damaged, but the link between them was; the pro-
. 2007. Understanding Minimalist Syntax: Lessons from Locality in duction of speech and comprehension of it were not impaired,
Long-Distance Dependencies. Oxford: Blackwell. but, rather, repetition of auditory input became problematic.
Bokovi, eljko. 1994. D-structure, -criterion, and movement into he model postulated by Wernicke in his paper showed cen-
-positions. Linguistic Analysis 24: 24786. ters for speech, comprehension, and ideas overlaid on the image
. 2002. A-movement and the EPP. Syntax 5: 167218.
of a right [sic] hemisphere, and his colleague Ludwig Lichtheim
Chomsky, Noam. 1955. he logical structure of linguistic theory.
abstracted this localizationist model away from a picture of under-
Manuscript, Harvard/MIT. Published in part in 1975 by Plenum, New
lying brain, expanding it to include reference to centers for reading
York.
. 1957. Syntactic Structures. he Hague: Mouton. and writing. In England, John Hughlings Jackson took exception to
. 1973. Conditions on transformations. In A Festschrift for Morris this localizationist/connectionist approach, taking the holist posi-
Halle, ed. S. Anderson and P. Kiparsky, 23286. New York: Holt, tion. He pointed out that even in patients with extensive damage to
Rinehart, and Winston. the dominant hemisphere, some language remained (e.g., a sub-
. 1995. he Minimalist Program. Cambridge, MA: MIT Press. set of emotional words, often curse words, as had been the case
Chomsky, Noam, and Howard Lasnik. 1993. Principles and param- with Brocas irst patient), suggesting, by way of the subtractionist
eters theory. In Syntax: An International Handbook of Contemporary logic these researchers employed, that the nondominant hemi-
Research, ed. J. Jacobs, A. von Stechow, W. Sternefeld, and T. sphere also participated in language in the healthy individual.
Vennemann, 50669. Berlin: de Gruyter.
In France, the debate between those who believed in multiple
Grohmann, Kleanthes K. 2003. Proliic Domains. Amsterdam: John
types of aphasia, each associated with brain damage in a difer-
Benjamins.
ent area, and those who believed in a unitary aphasia associated
Ross, John R. [1967] 1986. Constraints on variables in syntax. Ph.D.
diss., MIT. Published as Ininite Syntax! Norwood, NJ: Ablex. with a single location continued in a series of debates (analyzed in
Takahashi, Daiko. 1994. Minimality of movement. Ph.D. diss., English by Lecours et al. 1992). Neuropathologist Auguste Djrine
University of Connecticut. and her neurologist husband Jules led proponents for the multi-
ple-connected-centers position, whereas Pierre Marie argued for
a unitarist one. He asserted that what we now call Maries quad-
BRAIN AND LANGUAGE rilateral, a region near the insula that has only recently been seri-
he brains of humans have developed to control our articulators ously implicated in language again, was the seat of all language.
and our sensory systems in ways that permit human language. In addition to discussions of localization, bilingualism
Our knowledge of how the brain subserves language was cur- earned a place in explanations of neuropsychological phenom-
sory in the early millennia of recorded history; in the late 1800s, ena among students of the neurologist Jean-Martin Charcot in
developments in neurology in Europe gave us the tools to form the later nineteenth century, as Sigmund Freud, in his 1891 book
a more precise understanding of how brains support language. On Aphasia, and Albert Pitres, in his 1895 article on bilingual
Subsequent advances in neurolinguistic knowledge arose when aphasia, respectively championed the irst-learned versus the
clinician-scientists abstracted patterns and other researchers best-known language in predicting patterns of diferential recov-
developed technical tools (of neuropathology, linguistics, psy- ery from aphasia.
chology, psycholinguistics, and brain localization via imaging) Into the early twentieth century, European neurologists con-
that permitted both groups to understand the complexity of tinued developing their careful clinical examination of patients,
brain-behavior relationships at ever-iner-grained levels. which they then followed with an examination of their brains,
via advanced staining techniques, post mortem. In 1906, the
German neurologist Alois Alzheimer isolated a type of disease
Clinical Observation: The Behavioral Sequelae of Brain
among those theretofore housed in psychiatric institutions
Damage and the Brain Structures Underlying Them
when he discovered distinctive cellular changes in speciic
As neurology developed in France, psychiatric patients were dis-
levels of cortex associated with what we now call Alzheimers
tinguished during their lives from aphasics (those with language
dementia. His extended descriptions of the communication
problems but not emotional ones), and post mortem dissection
problems of his patients are models of the careful observation
and advances in staining techniques permitted localization of
of the semantic and conversational breakdown associated with
the brain areas that could be linked to the language behaviors
this disease.
recorded prior to the patients death. he developing under-
standing of French neurologist Paul Broca that not only the
frontal lobe (1861) but also the dominant, left, hemisphere Rehabilitation
(1865) was linked to articulated language was extended by he next major step forward in neurolinguistics lay in developing
German neurologist Carl Wernicke (1874). Wernicke suggested rehabilitation techniques for those with language impairment.
an additional region, farther back in the brain, that was respon- One important group had been identiied by the ophthalmolo-
sible not for articulated speech but, rather, for comprehension of gist James Hinshelwood (1902), who described the case of a child
it. In his paper, moreover, Wernicke proposed a model of centers who had particular diiculty learning to read despite normal
135
Brain and Language
intelligence and vision. he American neurologist Samuel Orton, to the brain hemisphere opposite a given ear (the contralateral
who examined an increasing number of such children through one), participants are better able to recall more words from the
referrals, published his 1937 book Reading, Writing and Speech ear contralateral to their language-dominant hemisphere, that
Problems in Children and Selected Papers [sic], classifying their is, the right ear for language stimuli.
problems. He worked with Anna Gillingham to develop a mul- Tachistoscopic presentation permitted a visual analogue to
tisensory, systematic, structured system for training which, like dichotic listening: When visual information is lashed so that it is
others that have been derived from it, enables children whose visible only to a single visual ield, that information is processed irst
brains do not naturally pick up reading today termed dyslex- by the brain hemisphere contralateral to the visual ield. he eyes
ics or developmental dyslexics (see dyslexia) to learn to do so. cannot turn quickly enough to see the stimulus in the central visual
he recognition that dyslexics might have normal (or better than ield, which projects to both hemispheres. hus written language,
normal) intelligence but have substantial diiculty learning to but not non-language visual information such as pictures, is pro-
read conirmed a second point that Wernicke had included in his cessed faster and better by the language-dominant hemisphere.
1874 article, that language and intelligence were dissociable. It From such laterality studies we came to understand the dom-
may also be seen as the earliest vision of individual diferences in inant importance of the left hemisphere for processing auditory
brain organization that went beyond the donation of their brains and written language for most humans, and the link between this
by the Paris Anthropology Society to determine whose was bigger lateralized brain dominance and handedness. For a period, such
and phrenologys assertion that difering sizes of cortical regions, techniques were used as well to determine if bilinguals brains
as evidenced by diferences in skull coniguration, explained per- employed relatively more right hemisphere in language pro-
sonality diferences. cessing than monolinguals did, following up the suggestions of
his focus on rehabilitation resulted in the initiation of the a number of researchers that bilingualism might be more bilat-
ield of speech therapy, today speech-language pathology in North erally organized, or that early less-proicient language abilities
America and logopedics elsewhere. he seminal work of Hildred might rely relatively more on right hemisphere contributions.
Schuell, James Jenkins, and E. Jimenez-Pabon (1964) classiied hey hypothesized this possibility because they thought they
the language disorders resulting from injury to the brain in adult- saw a disproportionately large number of instances of crossed
hood according to the primary impairment of either comprehen- aphasia, that is, aphasia resulting from right hemisphere dam-
sion or production. A more holist approach developed alongside age rather than the more usual left hemisphere damage. Today
this one, that of Bruce Porch, whose system of classiication it appears, instead, that crossed aphasia is no more frequent
showed a set of language abilities clustering together. among bilinguals than among monolinguals.
During this same midcentury period, behavioral neurologist
Lateral Dominance Norman Geschwind and neuropathologist Marjorie LeMay under-
In the 1950s and 1960s, American psychology was also developing took post mortem studies of sizable numbers of brains, demonstrat-
more rigorous methods of studying behavior, and brain, though ing that the apparent symmetry of the hemispheres is misleading.
not necessarily linking them yet. Brocas late-nineteenth-century Rather, they demonstrated precisely that the cortical region around
observation that aphasia tended to arise primarily from damage the Sylvian issure (the perisylvian region; see Figure 1) that was
to the left hemisphere of the brain, rather than the right, took on understood to be crucial for language difered markedly, with a
a new life as the techniques of dichotic listening and tachisto- steeper rise of the Sylvian issure in most right hemispheres cor-
scopic presentation evolved to study lateral dominance in non- responding to more temporal lobe cortex available for language on
brain-damaged individuals. In dichotic listening, series of three the left. (In a small percentage of brains, presumably those from
or so pairs of words are presented simultaneously, one to each left-handers for the most part, the two hemispheres were indeed
ear, and participants are asked to recall as many words as they identical in this regard; in another small set the cortical asymme-
can. Because the primary connections between ear and brain are tries were reversed.)
Figure 1. Brain and language (cortical structures).
136
Brain and Language
Lenticular nucleus
(putamen and globus pallidus)
Figure 2. Schematic representation of the chief gangli-

BASAL GANGLIA
onic categories. Adapted from Gray, Henry. Anatomy
OF FORK-BRAIN of the Human Body. Edited by Warren H. Lewis.
Philadelphia: Lea & Febiger, 1918.
The Return of Localization more precision has become available to distinguish areas of the
Geschwind brought a localizationist approach back to aphasiology brain that are damaged when language is impaired. In addition,
in his work at the Boston Veterans Administration Hospital during as functional tools are developed (e.g., functional MRI, also called
the mid-1960s. He anonymously translated Wernickes 1874 article fMRI), it is no longer necessary to rely on the traditional neuro-
into English, and himself published a seminal work (1965) on dis- linguistic premise (if area X is damaged and language problem Y
connection syndromes, reminding readers of the particular pair of results, then X must be where Y is localized, or, at least, X is crucial
brain-damaged sites required for alexia without agraphia, that is, a for normal performance of Y). It is interesting to note that these
diiculty in reading but not in rewriting resulting from brain dam- advances in technology point to a number of regions outside the
age in adults who had previously been literate. With his colleagues traditional perisylvian language area of cortex that appear linked
at the Aphasia Research Center of the Boston VA Hospital, Edith to language areas such as prefrontal cortex, the supplementary
Kaplan and Harold Goodglass, he developed this approach into motor area, and the like. J. Sidtis (2007) points out the logical prob-
the aphasia classiication schema behind the Boston Diagnostic lem that arises as one would want to reconcile the aphasia data,
Aphasia Exam (Goodglass and Kaplan, 1972), which includes cat- which suggest a relatively delimited area of the dominant hemi-
egories for Wernickes and Brocas aphasias, as well as conduction sphere that serves language, with the imaging data that indicate
aphasia, anomia, and the transcortical aphasias. (his test and areas beyond the perisylvian region in the dominant hemisphere,
classiication system is quite similar to that of their student Andrew as well as many subcortical and non-dominant-hemisphere
Kertesz, the Western Aphasia Battery). In the 1970s, aphasia grand regions (usually, counterparts to those of the dominant hemi-
rounds at the Boston VA Hospital were structured as a localization- sphere) that appear to be involved in language processing.
ist quiz: Neurologists, neuropsychologists, and speech-language
pathologists who had tested a patient would irst present their ind- Linguistic Phenomena Driving Neurolinguistic Study
ings; then Geschwind or Goodglass would test the patient in the Alongside developments in tools for measuring brain regions
front of the room. After the patient had left, those gathered would involved in language during the last quarter of the twentieth
guess what the angiography and, later, CT scan results would dem- century and the beginning of the twenty-irst, developments in
onstrate; then Naeser would report the lesion location. linguistics both within and beyond Chomskys (itself protean)
Perhaps precisely because CT scans permitted inspection of school have permitted reinements of the questions asked in
brain damage beneath the cortex, an understanding of the sub- neurolinguistics and, thus, the answers received.
cortical aphasias was developed from the linking of patients who Sheila Blumstein opened up the study of phonology within
were neither luent (as Wernickes aphasics are) nor nonluent aphasiology, researching, irst, the regression hypothesis of her
(as Brocas aphasics are in that the speech they produce con- mentor Roman Jakobson and demonstrating, consistent with
sists largely of substantives, with functor grammatical words and the literature on speech errors in non-brain-damaged indi-
aixes omitted or substituted for). Early work distinguished the viduals, that errors tended to difer from their targets by fewer,
cortical aphasias from the subcortical ones (e.g., Alexander and rather than more, distinctive features. She and her students have
Naeser 1988), while more recent work has distinguished apha- studied suprasegmental phonology as well as segmental pho-
sias in ever more discrete subcortical regions (e.g., thalamus nology, demonstrating diferences in intonation patterns used
and globus pallidus; see basal ganglia and Figure 2). exclusively for languages from those that are not. Indeed, other
With further advances in neuroimaging technology such as speech scientists have turned to tone languages, such as hai
magnetic resonance imaging (MRI) to study brain regions and, and Chinese, to demonstrate the parallels there: Brain areas
more recently, difusion tensor imaging (DTI) to study pathways, of the dominant hemisphere process those features that ones
137
Brain and Language
language treats as linguistic, even if ones language is a signed a characteristic electrical response around 400 msec after that
one (sign language), where phonemic elements are not, word indexes our surprise.
strictly speaking, phonemic. Study of pragmatics in brain-damaged individuals rarely
Phonology of lexical items is treated, in some classiicatory focuses on aphasics, as their pragmatic abilities tend to be
schemata, as a part of semantics. Lexical-word-shape, how- remarkably spared. Rather, it is patients with damage to the
ever, can be divorced from word-meanings, as patients with the right hemisphere who evidence problems with pragmatic abili-
syndrome of word-deafness the ability to recognize that they ties, such as verbal humor appreciation, inferencing, con-
know a lexical item without knowing what it means demon- versational coherence, and the like. Patients with the dementias
strate. Studies of aphasic errors have demonstrated the psycho- show an interesting combination of the sparing of some basic
logical reality of such concepts as phoneme, syllable, word pragmatic abilities (e.g., eye contact during communication, use
stress, and the like. Moreover, priming studies show that lexical of formulaic language) and deicits in higher-level pragmatic
items are organizationally linked to others in their phonological behaviors, including, among those who are bilingual, the appro-
(or spelling or both) neighborhoods. priate choice of whether to address their interlocutors in one or
morphology has been studied via aphasiology with par- the other, or both, of their two languages.
ticular reference to agrammatism, that syndrome associated with Written language, of course, is not studied diferently from
Brocas aphasia in which the language that patients produce is auditory language by linguists, and rarely even by psycholin-
telegraphic, consisting largely of substantive nouns and, to a lesser guists, but the literature on brain-damaged individuals ofers
extent, verbs, and relatively devoid of functor grammatical ele- numerous examples of selective impairment of one of these
ments, including inlectional and derivational aixes. A number modes of input and/or output. Historically, such studies focused
of theories have been developed to account for this interesting on alexia and agraphia, that is, disturbances of reading and/or
phenomenon, and it is clear that the salience of the omissible ele- writing in previously literate individuals who had diiculties with
ments plays a role, as their production can be induced in a number these skills as the result of adult-onset brain damage. Currently,
of ways, suggesting that they are not lost but, rather, costly to pro- information about brainlanguage links for reading and writing
duce in Brocas aphasia. Evidence for salience varying across dif- comes from the study of dyslexia, which, of course, is not linked
ferent languages can be found in the reports on agrammatism in to frank brain damage but has been shown to co-occur with
14 languages in Menn and Obler (1990) and in the work of E. Bates unusual distribution of certain cellular types in language-related
and her colleagues (see Bates and Wulfeck 1989). Compounding, brain areas (e.g., Galaburda and Kemper, 1979). Psycholinguistic
too, has recently gained attention in neurolinguistics, and can be and brain-imaging studies of both groups of individuals have
seen to pose problems for agrammatic patients. shown diferences, as well as similarities, between the process-
Agrammatic patients are particularly pertinent in studying ing of written and spoken language. he same can be said for
syntax as well, since not only does their production suggest signed languages, as evident from those who are bilingual
that they minimize syntactic load, but their comprehension is speakers of a signed and a spoken language.
arguably impaired syntactically as well. Whether this is because
traces are nonexistent in such individuals or because of more
Conclusion: Language in Humans Brains
general processing deicits associated with brain damage, per-
In many branches of science, pendulum swings are evident
haps half of the patients with agrammatism have diiculty in
between a focus on overarching patterns achieved by ignoring
processing passive sentences, suggesting that the brain areas
details of individual diferences and a focus on those individ-
impaired in Brocas aphasia are required for full syntactic func-
ual diferences. In neurolinguistics, the latter can show the full
tion. In non-brain-damaged individuals, fMRI studies suggest
range of human brains substrates for humans language abili-
that substantial regions beyond the traditional perisylvian lan-
ties. We assume that all individuals (except those with speciic
guage areas regions such as prefrontal cortex, linked to general
language impairment) learn their irst language in pretty much
control functions of brain activity subserve comprehension
similar fashion, though we are well aware that even in the irst
(e.g., Caplan et al. 2007).
year of life, some of us start talking sooner and others later. In
Semantics is better studied in patients with dementing dis-
adulthood, too, we acknowledge certain individual diferences
eases such as Alzheimers disease, in whom it breaks down,
that are linked to language performance: Some of us are detail
than in patients with aphasia, in whom it appears better
oriented and others more big picture in cognitive style; some
spared, at least for nonluent (Brocas) and anomic aphasics.
of us are good second-language learners postpubertally and oth-
Nevertheless, there have been indications that patients with
ers less so, some of us naturally good spellers and others not, and
severe, global aphasia have diiculty with aspects of semantic
some of us slow readers and others of us fast. Indeed, we can
processing.
assume that at many levels, from the cellular to brain regional,
Psycholinguistic studies of lexical priming are also useful in
from the electrophysiological to iber connectivity, diferences
studying semantics in non-brain-damaged individuals. When
subserve our human ability to communicate via language. From
they have to judge if nurse is a word or not in English, they are
the irst century of work primarily in Europe to the late-twenti-
faster if they have seen doctor previously than the word horse,
eth-century addition of North American contributions, centers
suggesting semantic networks in our lexica. Event-related poten-
worldwide now participate in moving the ield of neurolinguis-
tial (ERP) measures, moreover, demonstrate that we process
tics forward.
top-down for semantic consistency. When we are presented with
a sentence that includes a word that is semantically anomalous, Loraine K. Obler
138
Brain and Language Brocas Area
WORKS CITED AND SUGGESTIONS FOR FURTHER READING BROCAS AREA

Alexander, M. P., and M. A. Naeser. 1988. Cortical-subcortical dif- In 1861, Pierre Paul Broca presented indings from two patients
ferences in aphasia. In Language, Communication and the Brain
who had diiculty speaking but relatively good comprehension
Research Publications: Association for Research in Nervous and Mental
(Broca 1861a, 1861b, 1861c). At autopsy, he determined that both
Disorders. Vol. 66. Ed F. Plum, 21528. New York: Raven Press.
of these patients, Leborgne and Lelong, sufered from injury to
Bates L. and B. Wulfeck. 1989. Comparative aphasiology: A cross-
linguistic approach to language breakdown. Aphasiology 3: 11142. the inferolateral frontal cortex. He concluded, he integrity
Caplan, D., G. Waters, D. Kennedy, N. Alpert, N. Makris, G. DeDe, J. of the third frontal convolution (and perhaps of the second) seems
Michaud, and A. Reddy. 2007. A study of syntactic processing in apha- indispensable to the exercise of the faculty of articulate language
sia II: Neurological aspects. Brain and Language 101: 15177. (1861a, 406). Four years later, Broca realized that these and sub-
Freud, S. [1891] 1953. Zur Aufassung der Aphasien. Trans. E. Stengel as sequent cases all had lesions to the left inferior frontal gyrus, thus
On Aphasia. New York: International University Press. making the association between language and the LEFT hemi-
Galaburda, A., and T. Kemper. 1979. Cytoarchitectonic abnormalities in sphere (Broca 1865). his assertion proved to be a landmark
developmental dyslexia: A case study. Annals of Neurology 6: 94100. discovery that laid the groundwork not only for the study of lan-
Geschwind, N. 1965. Disconnexion syndromes in animals and man.
guage but also for modern neuropsychology.
Brain 88: 585644.
he region of left hemisphere cortex described by Broca sub-
Goodglass, H., and E. Kaplan. 1972. he Assessment of Aphasia and
sequently came to be known as Brocas area and the speech dis-
Related Disorders. Philadelphia: Lea & Febiger.
Goodglass, H., and Wingield, A. 1997. Anomia: Neuroanatomical and order, Brocas aphasia. Today, Brocas area is generally deined
Cognitive Correlates. San Diego, CA: Academic Press. as Brodmanns cytoarchitectonic areas (BA) 44 and 45, corre-
Lecours, A.R., F. Chain, M. Poncet, J.-L. Nespoulous, and Y. Joanette. sponding to the pars opercularis and pars triangularis, respec-
1992. Paris 1908: he hot summer of aphasiology or a season in the tively. hese regions make up the posterior part of the inferior
life of a chair. Brain and Language 42: 10552. frontal gyrus (see Figure 1). Recent investigations have suggested
Menn, L., and L. K. Obler, eds. 1990. Agrammatic Aphasia: A Cross- important diferences between BA 44 and 45, both in anatomi-
Language Narrative Sourcebook. Vol. 3. Amsterdam: John Benjamins. cal asymmetries and in function. However, Broca himself never
Obler, L.K., and K. Gjerlow. 1999. Language and the Brain. designated the region so speciically, considering the posterior
half of the inferior frontal gyrus to be most crucial for the speech
Orton, S. [1937] 1989. Reading, Writing and Speech Problems in Children
disturbance he described (Dronkers et al. 2007).
and Selected Papers. Repr. Austin, TX: International Dyslexia
Association.
Although Brocas area is widely described as a critical speech
Pitres, A. 1895. Etude sur laphasie des polyglottes. Rev. Md. and language center, its precise role is still debated. Broca origi-
15: 87399. nally thought of this region as important for the articulation of
Schuell, H., J. Jenkins, and E. Jimenez-Pabon. 1964. Aphasia in Adults, speech (Broca 1864). More recently, a large body of research has
New York: Harper & Row. discussed its potential role in processing syntax. his premise
Sidtis, J. 2007. Some problems for representations of brain organiza- irst arose from behavioral studies of patients with Brocas apha-
tion based on activation in functional imaging. Brain and Language sia in the early part of the twentieth century. It was noted that
102: 13040. patients with Brocas aphasia produced agrammatic speech,
often omitting functor words (e.g., a, the) and morphologi-
Journals worth checking: Behavior and Brain Science, Brain, Brain
and Language, Journal of Cognitive Neuroscience, Journal of
cal markers (e.g., -s, -ed). he following is an example of such
Neurolinguistics, Nature Reviews Neuroscience, and NeuroReport.
Figure 1. Three-dimensional MRI reconstruction of the

lateral left hemisphere of a noraml brain in vivo, showing
the pars operculars (Brodmann's area 44, anterior to the
precentral sulcus) and the pars triangularis (Brodmann's
area 45, between the ascending and horizontal limbs of
the sylvian issure; see perisylvian cortex).
Reprinted with permission from Brain (2007), 130, pg.
1433, Oxford University Press.
139
Brocas Area
Figure 2. Lesion reconstruction of a patient with

Brocas aphasia who does not have a Brocas area
lesion (left) and a patient with a Brocas area lesion
without Brocas aphasia (right).
Figure 3. Photographs of the brains of Leborgne

(A) and Lelong (C), Paul Brocas irst two aphasic
patients, with close-ups of the lesion in each brain
(B and D). Reprinted from N. F. Dronkers, O. Plaisant,
M. T. Iba-Zizen, E. A. Cabanis (2007), Paul Brocas
historic cases: High resolution MR imaging of the
brains of Leborgne and Lelong, Brain, 130.5: 1436,
by permission of Oxford University Press.
telegraphic speech in a patient with Brocas aphasia, describing a overlap with those of Brocas aphasics (Caplan, Hildebrandt, and
drawing of a picnic scene by a lake: Makris 1996). In addition, individual patients with Brocas apha-
sia do not always show the same pattern of grammatical deicit
O, yeah. Dets a boy an a girl an a car house but, rather, vary in the types of errors they make (Caramazza et al.
light po (pole). Dog an a boat. N dets a mm a cofee, 2001). Cross-linguistic studies also put a damper on the agram-
an reading. Dets a mm a dets a boy ishin. matic theory of Brocas aphasia. E. Bates and colleagues have
During the 1970s, a number of studies reported that compre- shown that in other languages such as German, where grammati-
hension of complex syntactic forms was also disrupted in this cal markers are critical for conveying meaning and semantic
patient group. A seminal study by A. Caramazza and E. B. Zurif content, patients with Brocas aphasia do not omit morphemes
(1976) reported that Brocas aphasics had particular diiculty as they do in English (Bates, Wulfeck, and MacWhinney 1991).
understanding semantically reversible versus irreversible sen- Finally, recent studies have shown that grammatical errors can be
tences (e.g., he cat that the dog is biting is black vs. he apple induced in normal participants through the use of degraded stim-
that the boy is eating is red). his study concluded that Brocas uli or stressors, such as an additional working memory load
area mediated syntactic processes critical to both production (Dick et al. 2001). hese indings would argue against a grammar
and comprehension. By extension, Brocas area came to be asso- center but, rather, suggest that competition for resources could
ciated with syntactic processing. also underlie deicits observed in Brocas aphasia.
Although many subsequent studies supported this general
notion, several others pointed out the need for caution. For exam- Lesion Studies of Brocas Area
ple, M. C. Linebarger, M. Schwartz, and E. Safran (1983) showed As with the previous claims, it is often assumed that all patients
that agrammatic patients could make accurate grammatical- with Brocas aphasia have lesions in Brocas area, and thus that
ity judgments, which would seem to challenge the notion the deicits in Brocas aphasia equate to dysfunction in Brocas
of Brocas area being broadly involved in syntactic processing. area. However, many studies making claims about Brocas area
Second, many patients with luent aphasia and lesions outside of and its functions did not actually verify lesion site. In fact, lesion
Brocas area have been found to exhibit grammatical deicits that studies have shown that Brocas aphasia typically results from
140
Brocas Area
a large left hemisphere lesion that extends beyond Brocas area possible that subregions of what is now interpreted as Brocas
to include underlying white matter, adjacent frontal cortex, and area (i.e., BA 44, 45) may be functionally distinct, which could
insular cortex (Alexander, Naeser, and Palumbo 1990; Mohr et explain the heterogeneity of functions associated with this
al. 1978). Color Plate 3 shows a lesion overlay map of 36 patients area. An alternative explanation is that functional activations of
with chronic Brocas aphasia persisting more than one year. As Brocas area may be due to task demands involving articulation
can be seen, the region of common overlap (shown in dark red) and/or subvocal rehearsal. hese ideas remain to be explored.
is not Brocas area but rather medial regions (i.e., more central), A number of theories have arisen to try to reconcile the early
namely, insular cortex and key white matter tracts. Indeed, only lesion studies of agrammatism and newer functional neuroimag-
5060 percent of patients with lesions extending into Brocas ing indings. For example, it has been suggested that Brocas area
area have a persistent Brocas aphasia. Lesions restricted to is crucial not for syntactic processing but for aspects of on-line
Brocas area tend to cause a transient mutism followed by altered storage (i.e., verbal working memory) that in turn may under-
speech output, but not a chronic Brocas aphasia (Mohr et al. lie the ability to process complex grammatical forms (Stowe,
1978; Penield and Roberts 1959). hese indings would suggest Haverkort, and Zwarts 2005). C. J. Fiebach and colleagues (2005)
that Brocas area proper might be more involved in later stages of showed that BA 44 was active when participants processed sen-
speech production (e.g., articulation). tences with a large working-memory load, and many neuroimag-
Even early studies reported contradictory cases, namely, ing studies have suggested that Brocas area is involved in verbal
patients with Brocas area afected but no Brocas aphasia or working memory, in particular (e.g., Awh, Smith, and Jonides
patients with Brocas aphasia but no lesion in Brocas area (e.g., 1995). Such indings would suggest that Brocas area may be cru-
Marie 1906; Moutier 1908). Figure 2 shows an example of an cial for the understanding of complex syntactic forms due to a
individual with Brocas aphasia whose lesion spares Brocas area basic role in subvocal rehearsal (but see Caplan et al. 2000).
(left) and one with word-inding problems but no Brocas apha- In sum, though the term Brocas area has persisted for more
sia after a lesion to Brocas area (right). than a hundred years, the precise anatomical demarcation of
Although Broca deduced that the critical region for his this brain region, along with its exact role in speech and language
patients articulation disturbance was the inferior frontal gyrus, processing, are still being debated. At a minimum, Brocas area
he realized that his patients lesions most likely extended more plays a role in end-stage speech production but is unquestion-
medially. However, he wanted to maintain the brains for poster- ingly a signiicant part of a larger network that supports speech
ity and chose not to dissect them (see Figure 3). Recently, N. F. and language functions in the left hemisphere. Further work is
Dronkers and colleagues (2007) had the opportunity to acquire needed to determine whether it plays a direct or indirect role in
three-dimensional MRI images of the brains of Brocas two orig- a number of other cognitive processes that have been suggested
inal patients (Leborgne and Lelong), which are kept in a Paris and whether these relate to neighboring regions within the infe-
museum. hey found that the lesions in both patients extended rior frontal gyrus or distinct functional subregions within the ter-
quite medially, involving underlying white matter, including the ritory known as Brocas area.
superior longitudinal fasciculus. Moreover, one of the patients
Nina F. Dronkers and Juliana V. Baldo
brains (Leborgnes) had additional damage to the insula, basal
ganglia, and internal and external capsules. With respect to
the extent of frontal involvement, Leborgnes lesion afected the
middle third of the inferior frontal gyrus to the greatest extent, Alexander, M., M. Naeser, and C. Palumbo. 1990. Brocas area apha-
with only some atrophy in the posterior third. In Brocas second sias: Aphasia after lesions including the frontal operculum. Neurology
patient, Lelong, the lesion spared the pars triangularis, afect- 40: 35362.
Awh, E., E. Smith, and J. Jonides. 1995. Human rehearsal processes and
ing only the posterior portion of the pars opercularis. hus, even
the frontal lobes: PET evidence. Annals of the New York Academy of
what is commonly referred to as Brocas area (BA 44, 45) is not
Sciences 769: 97117.
exactly the region afected in Brocas original patients. Bates, E., B. Wulfeck, and B. MacWhinney. 1991. Crosslinguistic research
in aphasia: An overview. Brain and Language 41: 12348.
Functional Neuroimaging of Brocas Area Broca, P. 1861a. Nouvelle observation daphmie produite par une
More recently, functional neuroimaging techniques, such lsion de la troisime circonvolution frontale. Bulletins de la Socit
as functional magnetic resonance imaging (fMRI) and positron danatomie (Paris), 2e serie, 6: 398407.
emission tomography (PET), have opened up new avenues for Broca, P. 1861b. Perte de la parole: Ramollissement chronique
the study of brain areas involved in language and cognition. et destruction partielle du lobe anterieur gauche du cerveau. Bulletins
Consistent with the behavioral studies described previously, a de la Socit danthropologie, 1re serie, 2: 2358.
number of functional neuroimaging studies with normal par- . 1861c. Remarques sur le sige de la facult du langage articul,
ticipants have suggested a link between Brocas area and syn- suivies dune observation daphmie (perte de la parole). Bulletins de
la Socit danatomie (Paris), 2e serie, 6: 33057.
tactic processing (e.g., Caplan et al. 2000; Friederici, Meyer, and
. 1864. Sur les mots aphmie, aphasie et aphrasie; Lettre M. le
von Cramon 2000). However, Brocas area has also been linked
Professeur Trousseau. Gazette des hopitaux 23.
to a number of other nonsyntactic cognitive processes in the . 1865. Sur le sige de la facult du langage articul. Bulletin de la
left hemisphere, such as verbal working memory, semantics, Socit dAnthropologie 6: 33793.
frequency discrimination, imitation/mirror neurons (see mir- Caplan, D., N. Alpert, G. Waters, and A. Olivieri. 2000. Activation of
ror systems, imitation, and language ), tone percep- Brocas area by syntactic processing under conditions of concurrent
tion (in tonal languages), and phonological processing. It is articulation. Human Brain Mapping 9: 6571.
141
Cartesian Linguistics Case
Caplan, D., N. Hildebrandt, and N. Makris. 1996. Location of lesions in Rationalists adopt a nativist (see innateness and innatism)
stroke patients with deicits in syntactic processing in sentence com- and internalist approach to the study of language. Support for
prehension. Brain 119: 93349. nativism is found in poverty-of-the-stimulus observations. To take
Caramazza, A., E. Capitani, A. Rey, and R. S. Berndt. 2001. Agrammatic
these observations seriously, rationalists believe, someone con-
Brocas aphasia is not associated with a single pattern of comprehen-
structing a theory of language is advised to assume that much of
sion performance. Brain and Language 76: 15884.
linguistic structure and content is somehow latent in the infants
Caramazza, A., and E. B. Zurif. 1976. Dissociation of algorithmic and
heuristic processes in language comprehension: Evidence from apha- mind: Experience serves to trigger or occasion structure and
sia. Brain and Language 3: 57282. content, not form and constitute them. Descartes himself was a
Dick, F., E. Bates, B. Wulfeck, M. Gernsbacher, J. A. Utman, and rationalist and appealed to poverty facts to support his views of
N. Dronkers. 2001. Language deicits, localization, and gram- innate and adventitious (but not made up) concepts/ideas.
mar: Evidence for a distributive model of language breakdown in Until taken up by his Port-Royal followers, however, there was lit-
aphasics and normals. Psychological Review 108: 75988. tle attention to the innateness of the structure of language itself.
Dronkers, N. F., O. Plaisant, M. T. Iba-Zizen, and E. A. Cabanis. 2007. Descartess greater contribution to the strategy with his name
Paul Brocas historic cases: High resolution MR imaging of the brains lies in less discussed but equally important observations con-
of Leborgne and Lelong. Brain 130.5: 143241.
cerning the creative aspect of language use (see creativity in
Fiebach, C. J., M. Schlesewsky, G. Lohmann, D. Y. von Cramon, and A.
language use). Encapsulated in Descartess Discourse V, these
D. Friederici. 2005. Revisiting the role of Brocas area in sentence
note that speakers can on occasion produce any of an unbounded
processing: Syntactic integration versus syntactic working memory.
Human Brain Mapping 24: 7991. set of expressions (unboundedness), without regard to external
Friederici, A. D., M. Meyer, and D. Y. von Cramon. 2000. Auditory and internal stimulus conditions (stimulus freedom), sentences
language comprehension: An event-related fMRI study on the pro- that nevertheless are appropriate and coherent with respect to
cessing of syntactic and lexical information. Brain and Language discourse context (appropriateness and coherence). Taking
74: 289300. these seriously suggests a scientiic research strategy that focuses
Linebarger, M. C., M. Schwartz, and E. Safran. 1983. Sensitivity to not on linguistic action/behavior (language use) itself, for that
grammatical structure in so-called agrammatic aphasics. Cognition is in the domain of free human action, but on an internal system,
13: 36193. the language faculty. A science of linguistic action would not
Marie, P. 1906. Revision de la question de laphasia: La troiseme circon-
only have to take into account the speakers understanding of rea-
volution frontale gauche ne joue aucun role special dans la function du
sons for speaking and the job that an utterance is understood to
langage. Semaine Medicale 26: 2417.
perform but would also have to say what a person will utter on a
Mohr, J., Pessin, S. Finkelstein, H. H. Funkenstein, G. W. Duncan, and
K. R. Davis. 1978. Broca aphasia: Pathologic and clinical. Neurology speciic occasion. No extant science can do that, and likely none
28: 31124. ever will. Given stimulus freedom, independently speciiable
Moutier, F. 1908. Laphasie de Broca. Paris: Steinhell. conditions for utterance are unavailable, and there is no upper
Penield, W., and L. Roberts. 1959. Speech and Brain Mechanisms. bound on sentences appropriate to a speakers understanding of
Princeton, NJ: Princeton University Press. discourse circumstance. he scientist of language should focus
Stowe, L. A., M. Haverkort, and F. Zwarts. 2005. Rethinking the neuro- on language as an internal system, on competence, not on what
logical basis of language. Lingua 115: 9971042. people do by using the tools their systems ofer. Science can
say what a language can yield; a theory of competence does that.
But it likely cannot say what will be said, when it will be said, or
whether it is appropriate and, if so, why.
C Generally speaking, the rationalist strategy treats languages
as natural (native) systems in the head, not artifacts. he empiri-
cist strategy (not empirical science) treats languages as artifacts
CARTESIAN LINGUISTICS in the head, as socially constituted sets of practices or behaviors,
mastery of which (learning) requires training and negative evi-
his term began as the title of a 1966 monograph by Noam dence. Cartesian Linguistics emphasizes that there are few, if
Chomsky. It has become the name of a research strategy for the any, linguistic practices and that children grow languages.
scientiic study of language and mind that Chomsky in other
works calls rationalist or biolinguistic, which he contrasts James McGilvray
to an empiricist strategy (see biolinguistics). Cartesian
Linguistics illuminates these strategies by focusing on contrasting WORKS CITED AND SUGGESTIONS FOR FURTHER READING
assumptions concerning mind and language and their study that
Chomsky, Noam. 1966. Cartesian Linguistics. New York: Harper and
are found in the writings of a selection of philosophers and lin- Row.
guists from the late sixteenth century through 1966. he rational- . 2003. Cartesian Linguistics. 3rd ed. Ed. and introd. J. McGilvray.
ists include Descartes, the Port-Royal Grammarians, Humboldt, Cambridge: Cambridge University Press.
Cudworth, and clearly Chomsky himself in 1966 and now.
he empiricists include Harris, Herder, the modern linguists
CASE
(L. Bloomield, M. Joos, etc.), and again clearly behaviorists,
connectionists, and others attracted to the idea that children his term has been traditionally employed to designate the type of
learn languages rather than growing them. morphological ending that indicates the syntactic function of the
142
Categorial Grammar
noun phrase that bears it. In Latin, for instance, the word meaning project word order and build logical forms via general rules of
girl may surface as puella (nominative case), puellam (accusative grammar. First proposed by Kazimierz Ajdukiewicz (1935), it is
case), or puellae (genitive case), depending on whether it is the sub- arguably the oldest lexicalized formalism. CG is not itself a the-
ject of the sentence, is the object of a transitive verb, or stands in a ory of natural language grammar, but its use has many linguisti-
possessor relation with respect to another noun. Languages vary with cally relevant ramiications: lexicalization, lexible constituency,
respect to the morphological case distinctions they make. Languages semantic transparency, control over generative capacity, and
such as Latin have six case distinctions (nominative, genitive, dative, computational tractability.
accusative, vocative, and ablative) whereas languages such as Chinese
have none. One may also ind languages like English in which only a Basic Principles of Categorial Grammar
subset of nominal elements display case distinctions as in he (nomi- In Ajdukiewiczs system, words are assigned categories, which
native), him (accusative), and his (genitive). are atomic types like np (noun phrase) and s (sentence) or com-
On the basis of a suggestion by Jean-Roger Vergnaud (Rouveret plex types like s\np, which indicate the arguments that a function
and Vergnaud 1980), Noam Chomsky (1981) developed a theory (such as the category for an intransitive verb) subcategorizes for.
of abstract case (annotated as Case). According to this theory, it is Words and phrases combine with others by cancellation of sub-
a property of all languages that noun phrases can only be licensed categorized arguments through a general operation akin to mul-
in a sentence if associated with Case. Whether or not the abstract tiplication. hat is, just as 4 (3 4) = 3, there is a grammatical
Cases get morphologically realized as speciic markings on (some) correlate: np . (s\np) = s. Given that the word Olivia has the type
noun phrases is a language-speciic property. Research in the last np and that sleeps has the type s\np, their concatenation Olivia
two decades has been devoted to identifying i) which elements sleeps has the type s via cancellation of the np argument of sleeps.
are Case-licensers, ii) which structural conigurations allow Case CG assumes compositionality: he global properties
licensing, and iii) what the precise nature of such licensing is. he associated with a linguistic expression are determined entirely by
contrast between John/he/*him/*his sang and *John/*he/*him/*his the properties of its component parts. Linguistic expressions are
to sing, for example, has led to the conclusion that in English, the multidimensional structured signs containing phonological/
past tense may license nominative Case, but the ininitival to is not orthographic (), syntactic (), and semantic () speciications
a Case-licenser. In turn, the contrast between John/he/*him/*his for the expression. CG is distinguished in that it uses categories
was greeted and was greeted *John/*he/*him/*his indicates that as syntactic types, such as those mentioned. Complex catego-
nominative is licensed by the past tense if the relevant noun phrase ries encode both subcategorization and linear order constraints,
occupies the subject, but not the object, position. using the leftward slash \ and the rightward slash /.
When a given Case only encodes syntactic information, it is Some (simpliied) lexical entries are given in the following
referred to as structural Case. Nominative and accusative cases format := : .
in English are prototypical examples of structural Case. In John
greeted her and she was greeted by John, for example, the pronoun
bearing the thematic role of patient has accusative Case in
the irst sentence, but nominative in the second. On the other
hand, when a given Case also encodes thematic information,
it is referred to as inherent Case (Chomsky 1986; Belletti 1988).
he preposition of in English, for example, has been analyzed
as a marker of inherent Case, for it only licenses a noun phrase
that is the complement of the preceding noun: It may license that he transitive verb category (s\np)/np seeks an object noun
country in the invasion of that country but not in *the belief of that phrase to its right and then a subject noun phrase to its left;
country to be progressive because that country is the complement after these arguments are consumed, the result is a sentence.
of invasion but not of belief. Semantics are given as -calculus expressions that reduce to
Jairo Nunes predicate and argument structures after syntactic com-
bination. he -calculus is a standard system used in CG (and
WORKS CITED AND SUGGESTIONS FOR FURTHER READING many other frameworks) for representing semantics derived by
syntax, where variables in the -terms are bound to correspond-
Belletti, A. 1988. he Case of unaccusatives. Linguistic Inquiry 19: 134.
ing syntactic arguments. For the category for likes, the x variable
Chomsky, N. 1981. Lectures on Government and Binding.
Dordrecht: Foris. is bound to the object (the /np argument), and the y is bound to
. 1986. Knowledge of Language: Its Nature, Origin and Use. the subject (the \np argument) an example of how this works
New York: Praeger. in a derivation follows. hese semantics (and the categories
Rouveret, A., and J. R. Vergnaud. 1980. Specifying reference to the sub- themselves) are obviously simpliied and are intended here only
ject: French causatives and conditions on representations. Linguistic to demonstrate how the correct dependencies are established.
Inquiry 11: 97202. Leftward and rightward slashes project directionality via two
order-sensitive, universal rules of function application:
CATEGORIAL GRAMMAR
Categorial grammar (CG) is a family of formalisms that model
syntax and semantics by assigning rich lexical categories that
143
Categorial Grammar
Combinatory Categorial Grammar and Categorial

Type Logics
CG moves further from PSGs and other frameworks by incorpo-
rating other rules that provide new kinds of inference over cat-
egories. hese rules are responsible for the type-driven lexible
constituency for which CG is well known. he two main branches
of CG can be broadly construed as rule based, exempliied by
Figure 1. combinatory categorial grammar (CCG) (Steedman 2000), and
deductive, exempliied by categorial type logics (CTL) (Moortgat
1997; Oehrle in press). We discuss both of these briely.
In words, the forward rule states that a category of type X/Y CCG adds a small set of syntactic rules that are linear counter-
can combine with one of type Y found to its right to produce parts of the combinators from combinatory logic. Two combina-
a category of type X. he symbol > is an abbreviation used in tors, composition (B) and type-raising (T), lead to the following
derivations (as in the next example). he function is applied rules, among others:
similarly for the backward rule. When these rules are used to
combine two syntactic categories, their semantic components
are also combined via function application in the -calculus
(indicated in the rules as f a). For example, the result of
applying the function x.y.like(y,x) to the argument chocolate
is y.like(y,chocolate). his lock-step syntactic-semantic com- he rules are guaranteed to be semantically consistent.
bination underlies the transparent syntax-semantics interface Composition of categories leads to composition of the semantic
ofered by CG. functions in the -calculus. Type-raising turns an argument into
With these rules and lexicon, the derivation can be given in a function over functions that seek that argument. See the follow-
Figure 1. ing for an example of both of these rules in a derivation.
he subcategorized arguments of the verb are consumed one CTL is a family of resource-sensitive linear logics, complete
after the other, and the semantic relexes of the syntactic rules with hypothetical reasoning. his approach began with Joachim
are carried out in parallel. his derivation is isomorphic to a Lambek (1958) recasting basic CG as a logical calculus in which
standard phrase structure grammar (PSG) analysis of such slashes are directionally sensitive implications; for example, the
sentences. Derivational steps can be viewed as instantiations of English transitive category (s\np)/np is (np s) np. As such,
rules of a PSG written in the accepting, rather than producing, categories are provided sound and complete model theoretic
direction (e.g., np s\np s instead of s np vp). interpretations. he application rules given earlier are then just
leftward and rightward variants of modus ponens. Additional
Type Dependency abstract rules may be deined that allow structured sequents of
CG with just function application is weakly equivalent to stan- proof terms to be reconigured to allow associativity and permu-
dard context-free phrase structure grammar. Nonetheless, the tativity. One result is that many rules can be derived as theorems
approach is radically diferent: Syntactic well-formedness in CG of a given CTL system. For example, any expression with the cat-
is type dependent rather than structure dependent, and deriva- egory np can be shown to also have the category s/(s\np), among
tion is an artifact rather than a representational level. Also, cat- others. his is an instance of type-raising; similarly, CCGs com-
egories labeling the nodes of categorial derivations are much position rules (as well as others) can be show to follow from CTL
more informative than the atomic symbols of constituent systems that allow associativity. With CTL, such rules follow from
structure produced by PSGs. Subcategorization is directly the logic, whereas rule-based systems like CCG tend to incorpo-
encoded in categories like s\np, (s\np)/np, and ((s\np)/np)/ rate a subset of such abstract rules explicitly based on empirical
np, rather than with stipulated nonterminal symbols such as evidence.
V-intrans, V-trans, and V-ditrans that have no transparent con- As an example of how CCGs rules engender lexible con-
nection to their semantic types. Furthermore, there is a system- stituency, the sentence Olivia likes chocolate in Figure 2 has an
atic correspondence between notions such as intransitive and alternative derivation using composition and type-raising. his
transitive: After the transitive category (s\np)/np consumes its derivation involves a nontraditional constituent with category
object argument, the resulting category s\np is that of an intran- s/np for the string Olivia likes, which then combines with choc-
sitive verb. olate to produce the same result as the previous derivation. A
More importantly, type dependency shifts the perspective similar analysis can be given with CTL.
on grammar shared with tree-adjoining grammar and head- Whether through CCGs rules or through CTL proofs, semanti-
driven phrase structure grammar away from a top- cally coherent interpretations for a wide variety of non-traditional
down one in which phrase structure rules dictate constituent constituents can be created. his forms the core of accounts of
structure into a bottom-up one in which lexical items project extraction and coordination, as well as intonation and infor-
structure through the non-language-speciic rules (i.e., CGs mation structure and incremental processing in CG. For example,
universal grammar). Recent developments in the trans- subject relative pronouns like who have the category (n\n)/(s\np),
formational grammar tradition, such as minimalism, have which seeks an intransitive verb type to produce a post-nominal
also incorporated such a lexically driven perspective. modiier, while object relativizers like whom have the category
144
Categorial Grammar
Figure 2.
(n\n)/(s/np), which seeks types which are missing objects such

Figure 3.
as Olivia likes and Olivia gave Finn, which both have the type s/
np. Extraction is thus handled without appeal to movement or
traces. long-distance dependencies in object extraction are
Current Applications and Developments
captured because forward composition allows the unsaturated
While CG is well known but not widely practiced in mainstream
argument to be successively passed up until it is revealed to the
linguistics, it has considerable uptake in both mathematical
relativizer, as in Figure 3. Under the standard assumption that
logic and computational linguistics. Computational
coordination combines constituents of like types, then right-node
implementations of CCG are used for parsing and generation
raising is simply constituent coordination:
for dialog systems. Current probabilistic CCG parsers, trained
[Kestrel heard]s/np and [Finn thinks Olivia saw]s/np the plane in the CCGbank corpus of CCG derivations for newspaper texts,
lying overhead. are among the fastest and most accurate available for identify-
ing deep syntactic dependencies. A useful aspect of CG for such
he compositional semantic terms, omitted here for brevity,
implementations that also has implications for its relevance for
are guaranteed to be consistent with the semantics projected
psycholinguistics is that the competence grammar is used
by the lexical entries because the composition and type-raising
directly in performance.
rules themselves are semantically consistent.
Despite a long divergence between the rule-based and
hese processes conspire in other contexts to create constitu-
deductive approaches to CG, recent work has brought them into
ents for argument clusters that allow similarly straightforward
greater alignment. CCG has adopted CTLs multimodal perspec-
analyses for sentences like Kestrel gave Finn comics and Olivia
tive. his connection allows eicient rule-based parsing systems
books.
to be generated from CTL grammars the lexicon remains the
his phenomenon has been called nonconstituent coordina-
same regardless of the approach. CCG itself can be viewed as the
tion, relecting the diiculty in assigning a meaningful phrase
caching out of an underlying deinition given in CTL; as such,
structure that groups indirect objects with direct objects. From
CTL can be seen as providing metatheories for ruled-based CGs
the CG perspective, it is simply treated as type-driven constitu-
like CCG. Work in CTL explores ine-grained control over gram-
ent coordination.
matical processes within the space of sound and complete logics;
One of the key innovations CTL has brought to CG is the incor-
work in CCG focuses on cross-linguistic, wide-coverage parsing
poration of a multimodal system of logical reasoning (including
and computational grammar acquisition. Researchers in both of
unary modes) that allows selective access to rules that permit
these traditions continue to expand the range of languages and
associativity and permutation. It is thus possible for a grammar
syntactic phenomena receiving categorial treatments.
to allow powerful operations (like permutative ones needed for
scrambling) without losing discrimination (e.g., engendering a Jason Baldridge
collapse of word order throughout the grammar), while enjoying
a (quite small) universal rule component. WORKS CITED AND SUGGESTIONS FOR FURTHER READING
Other rules be they CCG-style rules or CTLs structural Ajdukiewicz, Kazimierz. 1935. Die syntaktische Konnexitt. In Polish
rules support analyses for phenomena such as nonperipheral Logic 19201939, ed. Storrs McCall, 20731. Oxford: Oxford University
extraction, heavy-NP shift, parasitic gaps, scrambling, ellip- Press. Translated from Studia Philosophica 1: 127.
sis, and others. he commitment to semantic transparency Jacobson, Pauline. 2008. Direct compositionality and variable-free
and compositionality remains strong throughout; for example, semantics: he case of antecedent contained deletion. In Topics in
Pauline Jacobson (2008) tackles antecedent-contained deletion Ellipsis, ed. Kyle Johnson, 3368. Oxford: Oxford University Press.
in a directly compositional manner with CG and variable-free Lambek, Joachim. 1958. he mathematics of sentence structure.
semantics. American Mathematical Monthly 65: 15470.
Moortgat, Michael. 1997. Categorial type logics. In Handbook of Logic
See Wood (1993) for a balanced overview of many CG
and Linguistics, ed. Johan van Benthem and Alice ter Meulen, 99177.
approaches and analyses. Steedman and Baldridge (in press)
Amsterdam: Elsevier; Cambridge, MA: MIT Press.
gives a more recent introduction to and overview of work in CCG. Oehrle, Richard. Multi-modal type-logical grammar. In Non-
Vermaat (2005) provides a clear and concise introduction to CTL Transformational Syntax: A Guide to Current Models, ed. Robert
(including pointers to connections between CTL and minimal- Borsley and Kersti Brjars. Malden, MA: Blackwell. In press.
ism), as well as an extensive cross-linguistic account of wh-ques- Steedman, Mark. 2000. he Syntactic Process. Cambridge, MA: MIT
tions using a very small set of universal structural rules. Press.
145
Categorization
Steedman, Mark, and Jason Baldridge. Combinatory categorial gram- A major landmark in the development of a nonclassical the-
mar. In Non-Transformational Syntax: A Guide to Current Models, ed. ory of categorization was the work of psychologist Eleanor Rosch
Robert Borsley and Kersti Brjars. Malden, MA: Blackwell. In press. (1978). She argued that categories have a prototype struc-
Vermaat, Willemijn. 2005. he logic of variation. A cross-linguistic ture, that is, are centered around good examples, and that things
account of wh-question formation. Ph.D. diss., Utrecht University.
belong to the category in virtue of their exhibiting some similar-
Wood, Mary McGee. 1993. Categorial Grammar. London: Routledge.
ities with the prototype. he members of a category, therefore,
do not need to share the same set of features. Moreover, some
members can be better or more representative examples of the
CATEGORIZATION
category than others.
William Labov (1973, 342) stated, If linguistics can be said to be Rosch addressed not only the internal structure of categories
any one thing, it is the study of categories: that is, the study of how but also the question of what makes a good category. Good cat-
language translates meaning into sound through the categoriza- egories the ones that people operate with, and which are likely
tion of reality into units and sets of units. Labov is here address- to be encoded in human languages are those that deliver maxi-
ing the relation between linguistic expressions and the things mum information to the user with minimal cognitive efort. We
and situations to which the expressions are used to refer. he can approach this matter in terms of the interplay of cue validity
circumstances of the world are limitless in their variety; linguis- and category validity. Cue validity means that having observed
tic resources are inite. Since it is not possible to have a unique that an entity exhibits a certain feature, you can assign the entity,
name for every entity that we encounter or a special expression with a fair degree of conidence, to a certain category. Category
for every event that happens, we need to categorize the world validity means that having learned that an entity belongs to a cer-
in order to speak about it. We need to regard some entities, and tain category, you have expectations about the likely properties
some events, as being the same as others. of the entity. In this way, we can infer quite a lot about the things
he relation between a word and a referent is not direct but that we encounter, on the basis of minimal information about
is mediated by the words meaning. It is in virtue of its mean- them.
ing that a word can be used to refer. A words meaning can be Categories also need to be studied against broader concep-
thought of as a concept, and a concept, in turn, can be thought of tual and cultural knowledge having to do with human inten-
as a principle of categorization. To know the word mug (to take tions and purposes, presumed causal relations between things
one of Labovs examples) is to have the concept of a mug, which and events, and beliefs about how the world is structured. We
in turn means being able to use the word appropriately, namely, can imagine all kinds of hypothetical categories say, a cate-
for things that are called mugs. his goes not only for names of gory comprising things that are yellow, weighing under ive kilo-
concrete things like mugs but also for names of abstract entities grams, and manufactured in 1980. Such a category is unlikely to
and for words of other syntactic categories. To state that X is on be lexicalized in any human language. It displays very low cue
Y is to categorize the relation between X and Y as an on-relation and category validity and would, therefore, not be useful to its
rather than an in- or an at-relation. We can make similar claims users. It would also have no role to play in any broader knowl-
for other elements in a language, such as markers of tense and edge system.
aspect. To describe an event in the present perfect as opposed to As already recognized by Labov, the issue of categorization
the past simple or to use progressive as opposed to non-progres- applies not only to the categories we use in talking about the
sive aspect is to categorize the event in a manner consistent with world but also to the analysis of language itself. he very termi-
the concept designated by the morpho-syntactic elements. nology of linguistic description is replete with names of catego-
On many counts, therefore, linguists need a theory of categories, such as phoneme, noun, direct object, word, dialect,
rization. he theory must provide answers to two related ques- and so on, and practical linguistic description involves assigning
tions. On what basis are entities assigned to a category? And why linguistic phenomena to these various categories. Although the
do we categorize the world in just the way that we do? classical approach to categorization is still very strong among
According to what has come to be known as the classical linguistic theoreticians, it is increasingly recognized that the cat-
or Aristotelian theory, a category is deined in terms of a set of egories of linguistics may have a prototype structure and are to
necessary and sufficient conditions; it follows that be understood, in the irst instance, in terms of good examples.
things belong in a category because they exhibit each of the As Labov remarked, linguistics is indeed the study of
deining features. here are many problems associated with categories!
this view. First, it often is just not possible to list the deining
John R. Taylor
features. What, for example, are the deining features of mug as
opposed to cup? hen there is the question of the features them- WORKS CITED AND SUGGESTIONS FOR FURTHER READING
selves. Each feature will itself deine a category, which in turn
Labov, William. 1973. he boundaries of words and their meanings. In
must be deined in terms of its necessary and suicient features.
New Ways of Analyzing Variation in English, ed. C. J. Bailey and R. W.
Unless we are prepared to postulate a set of primitive features
Shuy, 34072. Washington, DC: Georgetown University Press.
out of which all possible categories are constructed, we are
Lakof, George. 1987. Women, Fire, and Dangerous hings: What
faced with an ininite regress. Finally, the classical theory makes Categories Reveal About the Mind. Chicago: University of Chicago
no predications about why we should have the categories that Press.
we do. Any conceivable combination of features could consti- Murphy, Gregory. 2002. he Big Book of Concepts. Cambridge, MA: MIT
tute a valid category. Press.
146
Causative Constructions
Rosch, Eleanor. 1978. Principles of categorization. In Cognition semantic hierarchy (6) proposed by Peter Cole (1983). It relects
and Categorization, ed. E. Rosch and B. Lloyd, 2748. Hillsdale, the greater-to-lesser degree of control retained by a causee NP.
NJ: Lawrence Erlbaum.
Taylor, John R. 2003. Linguistic Categorization. Oxford: Oxford University (6) Agent > Experiencer > Patient
Press.
Further Semantic/Pragmatic Dimensions of CCs
It is not unusual for a language to have more than one type of
CAUSATIVE CONSTRUCTIONS CC, for example, Japanese morphological and lexical causatives
(3) and (2). In lexical causative (3), causative and causativized
Causative construction (CC) is deined as form-meaning map- verbs are completely fused, while in morphological causative (2)
ping that encodes a causative situation (CS) in which an entity they are separated by morpheme boundary. Crucially, as dem-
(typically human), or causer, acts upon another entity (typically onstrated by John Haiman (1983), the diferential degrees of
human), or causee, to induce some action. Cross-linguistically, fusion semantically correlate with the difering degrees of direct-
this mapping is known to operate using periphrastic syntactic ness involved in causing an event. For instance, while a lexical
construction, valence-increasing morphology, or lexical verb. causative (3) encodes the causers nonmediated action of put-
hree structural types of CCs thus identiied are a) the syntactic ting clothes on the causee, a morphological causative (2) can
causative, which employs a periphrastic causative verb like make express the situation where the causee put on his clothes upon
in English (1); b) the morphological causative, which employs a the causers request (e.g., verbal command).
causative aix like -(s)ase- in Japanese (2); and the lexical caus- Languages can employ two causatives in a single sentence, as
ative, wherein a transitive verb with inherently causative mean- in a Korean example ( 7), to encode a sequence of CSs.
ing is employed (3), e.g. kiseru to put (clothes) on (someone)
in Japanese. (7) John-i Tom-eykey Mary-lul cwuk-i-key ha-ess-ta.
nom dat acc die- caus-caus-past-decl
(1) Mary made him read the book. (Syntactic) John made Tom kill Mary.
(2) Mary-ga musuko-ni huku-o ki-sase-ta. (Morphological) (Ishihara, Horie, and Pardeshi 2006, 323)
nom son-dat clothes-acc wear-caus-past Double CCs do not always encode a sequence of CSs and can
Mary made her son put on clothes. serve some pragmatic function instead. For instance, as observed
(3) Mary-ga musuko-ni huku-o kise-ta. (Lexical) by J. Okada (2003), the Japanese double causative occurs most
nom son-dat clothes-acc put (clothes) on (someone)-past frequently in highly conventionalized benefactive expressions
Mary put clothes on her son. indexing a speakers expression of humbleness toward his/her
own action, as well as politeness toward the addressee, such
as -(s)ase-sase-te itadaku (to have someone allow one to do
Grammatical and Semantic Hierarchies of the Causee something), as in (8).
Nominal Case Marking
CCs can involve the adjustment of case marking of a causee NP, (8) Otayori yom-as-ase-te itadaki-masu.
accompanied by the increased valence of a causer NP. As shown letter read- caus-caus-conj humbly receive-pol:nonpast
in Japanese examples (2)(3), with the causer/subject NP Mary Allow me to read this letter.
assigned the nominative case marking -ga, the causee NP John (Okada 2003, 29)
is demoted or deranked to lower case marking, in this case the In this instance, as contrasted with its single causative coun-
dative ni, since the accusative o is already assumed by another terpart yom-ase-ite itadaku (read-caus-humbly receive), the
NP/direct object hon (book). he deranking order of a cau- double causative serves to reinforce the speakers expression of
see NP relects a hierarchy of grammatical relations established humbleness and politeness.
cross-linguistically (4): CCs have also been productively investigated by more for-
(4) Subject > Direct Object > Indirect Object > Oblique (Whaley mally oriented linguists (e.g., Kuroda 1993, Miyagawa 1998).
1997, 193) Kaoru Horie
Functional-typological studies (Shibatani 1976a; Givn 1980;
Cole 1983; see functional linguistics, typology) have
noted that diferential case marking indexes the difering degrees Cole, Peter. 1983. he grammatical role of the causee in universal gram-
of control that a causee can exercise over his or her action rela- mar. International Journal of American Linguistics 49: 11533.
tive to the causer. Consider a Japanese example (5). Comrie, Bernard. 1976. he syntax of causative constructions: Cross-
language similarities and divergences. In Shibatani 1976b, 261312.
(5) Mary-ga John-{o/ni} Tokyo-e ik-ase-ta. . 1985. Causative verb formation and other verb-deriving mor-
nom acc/dat to go-caus-past phology. In Language Typology and Syntactic Description. Vol.
Mary {made/let} John go to Tokyo. 3: Grammatical Categories and Lexicon. Ed. Timothy Shopen, 30948.
he accusative case marker o indexes a lesser degree of con- Givn, Talmy. 1980. he binding hierarchy and the typology of comple-
trol retained by the causee than the dative case marker -ni. ments. Studies in Language 4: 33377.
his semantic diference between accusative (patient-marking) Haiman, John. 1983. Iconic and economic motivation. Language
case and dative (experiencer-marking) case is captured by the 59: 789811.
147
C-Command Cerebellum
Haspelmath, Martin. 1993. More on the typology of inchoative/causative herself, Marys and herself are unable to enter into such a relation
verb alternation. In Causatives and Transitivity, ed. Bernard Comrie (see anaphora and binding).
and Maria Polinsky, 87120. Amsterdam: John Benjamins. Although Reinharts pioneering deinition is formally explicit
Ishihara, Tsuneyoshi, Kaoru Horie, and Prashant Pardeshi. 2006. What and strongly supported empirically, questions arise regard-
does the Korean double causative reveal about causation and Korean?
ing explanatory depth, as with any deinition. In this respect,
A corpus-based contrastive study with Japanese. In Japanese/Korean
S. Epstein and colleagues (1998) ask:
Linguistics. Vol. 14. Ed. Vance, Timothy, 32130. Stanford, CA: CSLI.
Kemmer, Susanne, and Ariel Verhagen. 1994. he grammar of caus- (i) Why should this formal relation, and not any deinable
atives and the conceptual structure of events. Cognitive Linguistics other, constrain syntactic relations?
5: 11556.
Kuroda, Shige-Yuki. 1993. Lexical and productive causatives in
(ii) Why is (irst) branching relevant?
Japanese: An examination of the theory of paradigmatic structure. (iii) Why must not dominate or equal ?
Journal of Japanese Linguistics 15: 181.
Miyagawa, S. 1998. (S)ase as an elsewhere causative and the syntactic
Epstein and colleagues argue that (iiii) receive natural answers
nature of words. Journal of Japanese Linguistics 16: 67110. under a bottom-up derivational approach with recursive appli-
Okada, J. 2003. Recent trends in Japanese causatives: he sa-insertion cation of the binary operation merge, as independently moti-
phenomenon. In Japanese/Korean Linguistics. Vol. 12. Ed. McClure, vated in minimalism (Chomsky 1995). C-command is then
William. 2839. Stanford, CA: CSLI. arguably an emergent property of this structure-building process
Shibatani, Masayoshi. 1976a. he grammar of causative constructions: A and is expressible in terms of merge, as in (3):
conspectus. In Shibatani 1976b, 140.
Shibatani, Masayoshi, ed. 1976b. Syntax and Semantics. Vol. 6, he (3) c-commands all and only the terms of the category with
Grammar of Causative Constructions. New York: Academic Press. which was merged in the course of the derivation.
. 2002. he Grammar of Causation and Interpersonal Manipulation.
Under (3), c-command is not deined on assembled trees, but
Amsterdam: John Benjamins.
emerges as a consequence of the merger process by which trees
Shibatani, Masayoski, and Prashant Pardeshi. 2002. he causative con-
tinuum. In Shibatani 2002, 85126.
are built. It then follows that only the irst branching node is rel-
Song, Jae Jung. 1996. Causatives and Causation. London: Longman. evant for computing what c-commands, since this branching
Whaley, Lindsay. 1997. Introduction to Typology. he Unity and Diversity node is precisely the syntactic object resulting from merging with
of Language. New York: Sage Publications. another syntactic category. It also follows that must not domi-
nate , since dominance entails non-merger. Finally, because a
category cannot merge with itself, does not c-command itself.
C-COMMAND
Gerardo Fernndez-Salgueiro, and Samuel David Epstein
An enduring and fundamental hypothesis within syntactic the-
ory is that the establishment of most, if not all, syntactic rela-
tions (agreement, binding, case, control structures,
movement, etc.) requires c-command. Aoun, Joseph, and Dominique Sportiche. 1983. On the formal theory of
Tanya Reinhart (1979) provides the following deinition of government. Linguistic Review 2: 21135.
c-command (see also Edward Klimas (1964) in construction Brody, Michael. 2000. Mirror theory: Syntactic representation in perfect
syntax. Linguistic Inquiry 31.1: 2956.
with):
Chomsky, Noam. 1995. he Minimalist Program. Cambridge, MA: MIT
(1) c-commands if and only if Press.
a. he irst branching node dominating dominates , Epstein, Samuel David, Erich M. Groat, Ruriko Kawashima, and Hisatsugu
and b. does not dominate , Kitahara. 1998. A Derivational Approach to Syntactic Relations.
and c. does not equal . Oxford: Oxford University Press.
Kayne, Richard. 1984. Connectedness and Binary Branching.
To illustrate, consider (2). Dordrecht: Foris.
Klima, Edward. 1964. Negation in English. In he Structure of
Language, ed. Jerry Fodor and Jerrold Katz, 246323. Englewood Clifs,
(2) Sentence
NJ: Prentice-Hall.
Noun Phrase Verb Phrase Langacker, Ronald. 1969. On pronominalization and the chain of com-
mand. In Modern Studies in English, ed. D. Reibel and S. Schane,
Noun Phrase Noun Verb Noun Phrase 16086. Englewood Clifs, NJ: Prentice-Hall.
Lasnik, Howard. 1976. Remarks on coreference. Linguistic Analysis
Marys mother criticizes herself
2: 122.
Reinhart, Tanya. 1979. Syntactic domains for semantic rules. In Formal
Semantics and Pragmatics for Natural Languages, ed. F. Guenthner
Does the noun phrase Marys mother c-command herself in (2)? and S. Schmidt, 10730. Dordrecht: D. Reidel Publishing Company.
he irst branching node dominating Marys mother is sentence,
which dominates herself. Also, Marys mother does not dom-
CEREBELLUM
inate or equal herself. Since c-command obtains, in this sen-
tence Marys mother corefers with herself (i.e., herself must mean he cerebellum is a brain structure located underneath the pos-
Marys mother). By contrast, since Marys fails to c-command terior part of the cerebral hemispheres. hree anatomical loops
148
Cerebellum
connect the cerebellum to various parts of the nervous system evidence for the involvement of the cerebellum in speech pro-
(Ramnani 2006). he cerebro-cerebellar loop has been of partic- duction. In a recent study, participants performed a syllable rep-
ular interest to language researchers because this pathway sup- etition task in which speech rate was varied. he results showed
ports anatomical connections between the cerebellum and the increases in cerebellar activity that corresponded with increases
cortex, potentially including language-related cortical regions in in speech rate (Riecker et al. 2006).
the contralateral cerebral hemisphere. Recent advances in neu- Cerebellar contributions to speech extend to the domain of
roimaging techniques, clinical testing, and anatomical methods perception. Lesions to the cerebellum produce deicits in tem-
provide evidence that strongly implicates the cerebellum in a poral duration discrimination and impair categorical perception
broad range of language-related tasks, including those involv- for consonants that difer in the onset of voicing (Ackermann et
ing speech production and perception, single-word reading, and al. 1997). his clinical evidence is consistent with neuroimag-
higher-level language processing. ing data that show increases in right cerebellar activity during a
duration discrimination task for linguistic items (Mathiak et al.
Historical Perspectives 2002). Other neuroimaging results suggest that the cerebellum
Historically, the functions of the cerebellum were thought to be is also involved in learning new perceptual distinctions, such
limited to motor processes, such as motor control, performance, as the non-native /r/-/l/ phonetic contrast for Japanese speak-
and skill acquisition. Basic neuroscience research has led to difer- ers (Callan et al. 2003). As in the motor literature, many of these
ent proposals regarding its role in motor processes such as error- studies provide evidence that the cerebellum may be important
driven learning (Marr 1969) and internal timing (Ivry 1996). In the for coordination and timing in the production as well as the per-
late 1980s, H. C. Leiner, A. L. Leiner, and R. S. Dow (1986) proposed ception of speech (for a discussion, see Ivry and Spencer 2004).
that the cerebellum is not exclusively involved in motor functions Other research also draws upon knowledge from the motor
but that it also contributes to cognitive processes. Speciically, they literature to emphasize a potential role of the cerebellum in error
argued for a putative role in language because the evolutionary correction. he luency of normal speech has led many models
development of the cerebellum paralleled a similar evolution of of speech production to incorporate a mechanism for monitor-
cortical areas associated with linguistic functions (e.g., brocas ing and correcting speech errors (Postma 2000). More detailed
area) (Leiner, Leiner, and Dow 1993). Based on the homogeneity computational work has mapped certain processes in speech
of cerebellar cellular organization, a similar role was attributed to production to speciic brain regions and deined the cerebellum
the cerebellum across both motor and non-motor domains. as an important component in monitoring (Guenther, Ghosh,
Empirical work providing support for the claims proposed by and Tourville 2006). Similar ideas have emerged in models of
Leiner, Leiner, and Dow began to emerge in the late 1980s and verbal working memory. Speciically, J. E. Desmond and col-
1990s (Desmond and Fiez 1998). A positron emission tomogra- leagues (1997) suggest that a rehearsal process that relies on
phy (PET) study conducted by S. E. Petersen and J. A. Fiez (1993) inner speech to maintain verbal items in working memory may
showed increases in cerebellar activity during a verb generation also implement an error correction process. In their model,
task. his neuroimaging inding was consistent with a follow-up inputs from frontal and parietal cortex into superior and infe-
case study of a patient with a lateral cerebellum lesion (Fiez et rior regions of the cerebellum are used to calculate and correct
al. 1992). he patient showed particularly poor performance on discrepancies between phonological and articulatory codes in
verb generation despite the fact that other neuropsychological order to improve memory performance.
assessments were within the normal range.
READING. Data from neuroimaging studies consistently show
Current Perspectives cerebellar activation during single-word reading tasks (Fiez and
he cerebellum has been implicated in a broad range of language- Petersen 1998; Turkeltaub et al. 2002). In addition, individuals with
related tasks. he majority of the work, however, can be related developmental reading disorders show some of the same symp-
to one of three domains: 1) speech production and perception, toms that are often seen in patients with cerebellar damage, such
2) reading, and 3) higher-level word processing. In order to as poor duration discrimination and impaired gross motor func-
account for the cerebellums function in language, investigators tions (Nicolson, Fawcett, and Dean 1995). hese observations led
have made reference to the timing and error correction functions R. Nicolson, A. Fawcett, and P. Dean (2001) to propose a relation-
that have been attributed to the cerebellum in the motor litera- ship between cerebellar deicits and developmental reading disor-
ture. For a review on how these may be general mechanisms that ders. Consistent with this idea, anatomical indings have reported
contribute to both motor and non-motor processes, see Ivry and smaller right anterior lobes of the cerebellum in children diagnosed
Spencer (2004) and Doya (2000). with developmental dyslexia (Eckert et al. 2003). his work in
developmental reading disorders has focused on the importance of
SPEECH PRODUCTION AND PERCEPTION. During speech produc- cerebellar involvement in coordination and timing. Integrating the
tion, the control, coordination, and timing of movements are neuroanatomical indings with behavioral work on dyslexia will be
essential. Not surprisingly, clinical indings demonstrate pro- key for establishing a speciic role for the cerebellum in reading.
found speech and motor deicits associated with lesions to the Recent neuropsychological research provides mixed ind-
cerebellum. One common speech disorder resulting from dam- ings on the causal relationship between lesions to the cerebel-
age to the cerebellum is dysarthria, which is characterized by dis- lum in adult skilled readers and reading diiculties. One study
torted and slurred speech that is often monotonic and of a slower found that patients with lesions to the cerebellar vermis had
rate (Dufy 1995). neuroimaging studies provide further more errors in single-word reading when compared to controls
149
Cerebellum
(Moretti et al. 2002). On the other hand, a study of native English Doya, K. 2000. Complementary roles of basal ganglia and cerebellum
speakers with lesions to the lateral cerebellar hemispheres did in learning and motor control. Current Opinion in Neurobiology
not ind any reading diiculties at the level of single words or 10: 7329. his paper suggests that the cerebellum is part of a more
text (Ben-Yehudah and Fiez 2008). hese seemingly inconsistent general supervised learning system that is guided by error signals.
Dufy, J. R. 1995. Motor Speech Disorders. St. Louis, MO: Mosby.
indings may be due to diferences in the site of the cerebellar
Eckert, M., C. Leonard, T. Richard, E. Aylward, J. homson, and V.
lesions in the two patient groups.
Berninger. 2003. Anatomical correlates of dyslexia: Frontal and cer-
ebellar indings. Brain 126 (Part 2): 48194.
HIGHER-LEVEL LANGUAGE. here is accumulating evidence that
Fiez, J. A., and S. E. Petersen. 1998. Neuroimaging studies of word read-
higher-level language processes may also involve the cerebellum,
ing. Proc Nat Acad Sci USA 95: 91421.
although this level has received less attention. A meta-analysis of Fiez, J. A., S. E. Petersen, M. K. Cheney, and M. E. Raichle. 1992. Impaired
the neuroimaging literature conducted by P. Indefrey and W. J. M. non-motor learning and error detection associated with cerebellar
Levelt (2004) reveals increased activity in the cerebellum for tasks damage. Brain 115: 15578.
that require higher-level word processing; such tasks include picture Gebhart, A. L., S. E. Petersen, and W. T. hach. 2002. Role of the poste-
naming and verb generation (Indefrey and Levelt 2004) or internal rolateral cerebellum in language. Annals of the New York Academy of
generation of semantic word associations (Gebhart, Petersen, and Sciences 978: 31833.
hach 2002). It is important to note that higher-level language pro- Guenther, F. H., S. S. Ghosh, and J. A. Tourville. 2006. Neural modeling
cesses seem to recruit more lateral areas, often in the contralateral and imaging of the cortical interactions underlying syllable produc-
tion. Brain and Language 96: 280301.
right hemisphere of the cerebellum (Indefrey and Levelt 2004).
Indefrey, P., and W. J. M. Levelt. 2004. he spatial and temporal signa-
Neuropsychological studies have observed impairments in higher-
tures of word production components. Cognition 92: 10144.
level language processing (Silveri, Leggio, and Molinari 1994; Riva
Ivry, R. B. 1996. he representation of temporal information in percep-
and Giorgi 2000), including poor performance on a grammaticality tion and motor control. Current Opinion in Neurobiology 6: 8517.
judgment task relative to controls (Justus 2004). Ivry, R. B., and R. M. Spencer. 2004. he neural representation of time.
Current Opinion in Neurobiology 14: 22532. his article discusses
Summary timing processes and their potential neural correlates. A review of the
In summary, these data collectively provide strong support for evidence from many diferent methods is provided, with an empha-
cerebellar involvement in many aspects of language, including sis on the potential contributions made by the cerebellum and basal
speech processing, reading, and higher-level language process- ganglia.
ing. hey also suggest that there may be diferent regions of the Justus, T. 2004. he cerebellum and English grammatical morphonol-
cerebellum that are involved in diferent types of language tasks. ogy: Evidence from production, comprehension, and grammaticality
his observation is consistent with an emerging concept that judgments. Journal of Cognitive Neuroscience 16.7: 111530.
distinct cerebro-cerebellar loops support cerebellar interactions Kelly, R. M., and P. L. Strick. 2003. Cerebellar loops with motor cortex
with cortex, thus potentially enabling the cerebellum to apply and prefrontal cortex of a nonhuman primate. Journal of Neuroscience
23.23: 843244. his article shows cerebellar regions that receive input
one or more of its suggested functions (e.g., error correction) to
from the same cerebral cortical regions they project to. hey hypoth-
separate input-output loops (Kelly and Strick 2003). According
esize closed cerebro-cerebellar loops for the basis of the interactions
to this view, language tasks that rely on diferent cortical regions
between cerebellum and cortex.
would engage distinct cerebro-cerebellar loops that recruit spe- Leiner, H. C., A. L. Leiner, and R. S. Dow. 1986. Does the cerebellum con-
ciic cerebellar regions. tribute to mental skills? Behavioral Neuroscience 100.4: 44354.
Sara Guediche, Gal Ben-Yehudah, and Julie A. Fiez . 1993. Cognitive and language functions of the human cerebel-
lum. TINS 16.11: 4447.
Marr, D. 1969. A theory of cerebellar cortex. J Physiol 202.2: 43770.
Mathiak, K., I. Hertrich, W. Grodd, and H. Ackermann. 2002. Cerebellum
Ackermann, H., S. Graber, I. Hertrich, and I. Daum. 1997. Categorical and speech perception: A functional magnetic resonance imaging
speech perception in cerebellar disorders. Brain and Language study. Journal of Cognitive Neuroscience 14.6: 90212.
60: 32331. Moretti, R., A. Bava, P. Torre, R. M. Antonello, and G. Cazzato. 2002.
Ben-Yehudah, G., and J. Fiez. 2005. Impact of cerebellar lesions on read- Reading errors in patients with cerebellar vermis lesions. Journal of
ing and phonological processing. Annals of the New York Academy of Neurology 49: 4618.
Sciences 1145: 26074. Nicolson, R., A. Fawcett, and P. Dean. 1995. Time estimation deicits
Callan, D. E., K. Tajima, A. M. Callan, R. Kubo, S. Masaki, and R. Akahane- in developmental dyslexia: Evidence of cerebellar involvement. Proc
Yamada. 2003. Learning-induced neural plasticity associated with Biol Sci 259.1354: 437.
improved identiication performance after training of a diicult sec- . 2001. Developmental dyslexia: he cerebellar deicit hypothesis.
ond-language phonetic contrast. NeuroImage 19: 11324. Trends in Neurosciences 24.9: 50811.
Desmond, J. E., J. Gabrieli, A. Wagner, B. Ginier, and G. Glover. 1997. Petersen, S. E. ,and J. A. Fiez. 1993. he processing of single words studied
Lobular patterns of cerebellar activation in verbal working memory with positron emission tomography. Annual Reviews of Neuroscience
and inger-tapping tasks as revealed by functional MRI. Journal of 16: 50930.
Neuroscience 17.24: 967585. Postma, A. 2000. Detection of errors during speech production: A review
Desmond, J. E., and J. A. Fiez. 1998. Neuroimaging studies of the cere- of speech monitoring models. Cognition 77: 97131.
bellum: Language, learning and memory. Trends in Cognitive Sciences Ramnani, N. 2006. he primate cortico-cerebellar system: Anatomy and
2.9: 355358. his article reviews neuroimaging evidence suggesting function. Nature Reviews Neuroscience 7: 51122. his review provides
that the cerebellum is involved in cognitive tasks, including those that a brief description of cerebellar anatomy, and stresses integrating ana-
involve learning, memory, and language. tomical, computational, and experimental knowledge.
150
Charity, Principle of
Riecker, A., J. Kassubek, K. Groschel, W. Grodd, and H. Ackermann. 2006. these beliefs: Communication begins where causes converge
he cerebral control of speech tempo: Opposite relationship between (Davidson [1983] 2001, 151). In later years, Davidson liked to use
speaking rate and BOLD signal changes at striatal and cerebellar struc- the metaphor of triangulation for this three-way interaction
tures. NeuroImage 29: 4653. among speaker, interpreter, and external object (cf. Davidson
Riva, D., and C. Giorgi. 2000. he cerebellum contributes to higher func-
[1991] 2001).
tions during development: Evidence from a series of children surgi-
he principle of charity does not exclude the possibil-
cally treated for posterior fossa tumours. Brain 123: 105161.
Silveri, M., M. Leggio, and M. Molinari. 1994. he cerebellum contrib-
ity of error; speakers are to be right when plausibly possible
utes to linguistic production: A case of agrammatic speech following a (Davidson [1973] 1984, 137). Charity, thus, in certain situations
right cerebellar lesion. Neurology 44.11: 204750. actually prevents the interpreter from ascribing beliefs of his or
Turkeltaub, P., G. Eden, K. Jones, and T. Zeiro. 2002. Meta-analysis of her own to the speaker, for instance, perceptual beliefs about
the functional neuroanatomy of single-word reading: Method and vali- objects the speaker cannot perceive from his or her position in
dation. NeuroImage 16.3 (Part 1): 76580. space, or beliefs it would be irrational for the speaker to hold on
the basis of other beliefs. If something false follows rather directly
CHARITY, PRINCIPLE OF from other beliefs the speaker holds, charity might even call for
ascribing outright mistakes. he rationality induced by the prin-
A charity principle is a principle governing the interpretation of ciple is of a minimal, subject-internal character.
the speech and thought of others. It says that the correct inter- For Davidson, the principle of charity plays a double role: On
pretation of certain kinds of expressions, areas of discourse, the one hand, it provides the method for the radical interpreter,
or whole languages maximizes truth and rationality across the but it does so because it, on the other hand, is the principle meta-
(relevant) beliefs of its subject. According to Donald Davidson, physically determining meaning (and belief content): What
the main defender of a principle of charity, its validity derives a fully informed interpreter could learn about what a speaker
from the essentially rational and veridical nature of belief and means is all there is to learn; the same goes for what the speaker
thought. believes (Davidson [1983] 2001, 148). his is a kind of superve-
Principles of charity are of central importance in discussions nience: According to Davidson, meaning (and content) super-
of radical interpretation or radical translation. In W. V. vene on (dispositions to) observable behavior in observable
O. Quines version, charity governs the translation of the logical circumstances. hat is, there cannot be a diference in meaning
constants (cf. Quine 1960, 59). According to Donald Davidson, (or content) without a (potential) diference in behavior. his
charity governs the radical interpretation of all expressions of a can be called a weak semantic behaviorism, but according to
language. In an early formulation, it tells the radical interpreter Davidson, meaning (and content) cannot be reduced to behav-
to optimize agreement by assigning truth conditions to alien ior. hat meaning is determined by charity leaves room for a cer-
sentences that make native speakers right when plausibly pos- tain indeterminacy, according to Davidson, but does not lead
sible (Davidson [1973] 1984, 137). To make native speakers to antirealism or skepticism about meaning or thought content.
right is to interpret them as having beliefs that are largely true Because of the role that external objects, as shared causes, play
and coherent with each other. Later, Davidson distinguished in the determination of content for basic perceptual beliefs, he
explicitly between these two aspects of charity: thought of his own position as a kind of externalism (cf. Davidson
he Principle of Coherence prompts the interpreter to discover 2001; see meaning externalism and internalism).
a degree of logical consistency in the thought of the speaker; the he principle of charity has been widely discussed. Not only
Principle of Correspondence prompts the interpreter to take have questions of its exact formulation and of its truth or valid-
the speaker to be responding to the same features of the world ity been raised but also the question of what kind of a truth it
that he (the interpreter) would be responding to under similar is, if any. What is its epistemic status a priori or a posteriori?
circumstances. Both principles can be (and have been) called And what is its metaphysical status necessary or contingent?
principles of charity: One principle endows the speaker with a Davidson mostly thought of charity as a principle constitutive of
modicum of logical truth, the other endows him with a degree of thought and meaning, an a priori truth of conceptual necessity.
true belief about the world. Successful interpretation necessarily Many commentators have claimed that radical interpretation is
invests the person interpreted with basic rationality. (Davidson supposed to provide an (a priori) argument for charity: If radi-
[1991] 2001, 211) cal interpretation is possible, charity is valid (see, for example,
Lepore and Ludwig 2005, 204 f). But according to others, the
Coherence restricts belief ascription in terms of the logical rela- direction of argument can only be the opposite: If charity holds,
tions among the beliefs of a speaker. Correspondence restricts radical interpretation is possible (Davidson 1994, 122; Gler
the ascription of empirical beliefs to a speaker in terms of 2006, 344). hen, Davidson would be seen as arguing for charity
their truth. Since this can only be done according to the inter- from considerations regarding the nature of thought content, its
preters own view of what is true, following the principle of holism and externalist determination (cf. Davidson [1991] 2001;
correspondence amounts to agreement maximization 1999, 343; 2001). Partly against Davidson, it has been argued that
between speaker and interpreter. Here, Davidson more and charity can only be an a posteriori necessity (cf. Fllesdal 1982;
more emphasized a causal element; in the most basic percep- Gler 2006) and that it, like other nomological principles, can be
tual cases, the principle of correspondence calls for the ascrip- justiied by the principles of empirical science (cf. Pagin 2006).
tion of beliefs shared by speaker and interpreter. he objects of
these beliefs are determined as the shared, external causes of Kathrin Gler
151
Childrens Grammatical Errors
WORKS CITED AND SUGGESTIONS FOR FURTHER READING grapple with. Howsoever powerful the childs innate mecha-
Davidson, Donald. [1973] 1984. Radical interpretation. In Inquiries into
nisms might be, they do not equate to an attribute (language)
Truth and Interpretation, 12539. Oxford: Clarendon Press. that comes into the world fully formed at birth. Instead, there is a
. [1983] 2001. A coherence theory of truth and knowledge. In bridge to be crossed from what Noam Chomsky (1980) has called
Subjective, Intersubjective, Objective: 13753. Oxford: Clarendon Press. the childs initial state (the genetic endowment for language,
. [1991] 2001. hree varieties of knowledge. In Subjective, present at birth) to the steady state (the mature knowledge of
Intersubjective, Objective, 20520. Oxford: Clarendon Press. grammar inally attained). Several explanations are available to
. 1994. Radical interpretation interpreted. Philosophical deal with this problem (see innateness and innatism). Of
Perspectives 8: 1218. note here is the simple point that such explanations are required
. 1999. Reply to Andrew Cutrofello. In he Philosophy of Donald by nativists and, inevitably, muddy the waters both theoretically
Davidson, ed. L. Hahn, 3424. Chicago: Open Court.
and empirically.
. 2001. Externalisms. In Interpreting Davidson, ed. P. Kotatko,
On the nurture side of the nature-nurture fence, speech errors
P. Pagin, and G. Segal, 116. Stanford, CA: CSLI.
Fllesdal, Daginn. 1982. he status of rationality assumptions in inter-
(grammatical and otherwise) again present a vexing issue that
pretation and action explanation. Dialectica 36: 30116. needs to be addressed. In particular, the behaviorist approach to
Gler, Kathrin. 2006. he status of charity I: Conceptual truth or apos- language acquisition has been castigated for an excessive reliance
teriori necessity? International Journal of Philosophical Studies on operant conditioning as a mechanism of language learning.
14: 33760. B. F. Skinner (1957) argued that one of the key processes in lan-
Lepore, Ernest, and K. Ludwig. 2005. Donald Davidson: Meaning, Truth, guage development was the shaping of the childs verbal behav-
Language, and Reality. Oxford: Clarendon Press. ior through reward. On this view, child utterances are rewarded
Pagin, Peter. 2006. he status of charity II: Charity, probability, and sim- according to their proximity to the adult models provided. But
plicity. International Journal of Philosophical Studies 14: 36184.
this is problematic. In a celebrated demolition of Skinners the-
Quine, Willard V. O. 1960. Word and Object. Cambridge, MA: MIT Press.
sis, Chomsky (1959, 42) pointed out that operant conditioning
cannot tell the whole story since a child will be able to construct
and understand utterances which are quite new, and are, at the
CHILDRENS GRAMMATICAL ERRORS
same time, acceptable sentences in his language. hus, operant
Language learners make errors. his observation, easily veriied, conditioning cannot account for novelty.
is not conined to children in thrall to their irst encounter with Similarly, imitation cannot account for the childs speech,
language. It applies equally well to adults acquiring a further lan- particularly errors. Although not mentioned by Chomsky (1959),
guage (see second language acquisition). And it applies grammatical errors do, in fact, present the most striking dem-
to cases of both typical and atypical language development (see onstration that language acquisition is not largely based on
specific language impairment). here is, then, nothing imitation. he reason is that children are exposed to very few
abnormal about speech errors. hey are an intrinsic feature of grammatical errors in the input they receive from parents.
language acquisition and do not mark out special cases of learn- Hence, there are very few faulty models for children to copy. For
ing but, rather, constitute the norm. In this vein, one might argue example, Elissa Newport, H. Gleitman, and L. R. Gleitman (1977)
that the very notion of language development almost inevitably report just one instance of parental ungrammaticality in a cor-
implies the occurrence of errors. Perfect speech could not read- pus of 1,500 utterances directed toward young children. In con-
ily be ascribed to a language learner, after all. It is not surprising, sequence, one cannot easily blame the parents for child errors.
therefore, to ind that linguistic errors have featured prominently A further critical point is that the child cannot imitate grammar,
in research on language development. As a universal feature of only the products of grammar (sentences). Perhaps not surpris-
language acquisition, errors provide not only evidence that ingly, since the demise of behaviorism, several other theories of
learning is taking place but also, in some cases, evidence of how language acquisition have been promulgated that do not rely on
that learning occurs. operant conditioning or imitation as their mainstay.
Childrens speech errors range over every level of lan- Beyond their relevance for the nature-nurture issue, child
guage: phonological (see phonology, acquisition of); errors have been studied because of the insights they furnish
lexical (see lexical acquisition); morphological (see mor- about the processes of language acquisition. For example, an
phology, acquisition of); and grammatical (see syntax, error like i thought they were all womans reveals that the
acquisition of). Grammatical errors are of special interest child has extracted the regular suixation rule for forming plurals
because they are germane, in the ield of language acquisition, of nouns in English. hat is, the child knows to add -s to singular
to the nature-nurture controversy. Barbara C. Scholz and G. K. nouns in order to mark plurality. he childs error lies in mistak-
Pullum (2006, 60) usefully encapsulate the nativist credo: [M]ost ing woman for a regular noun. his kind of error is commonly
of the acquisition of natural languages by human beings depends described as an overregularization, since the plural rule for
on unacquired (or acquired but unlearned) linguistic knowl- regular forms (add S) has been applied beyond its conventional
edge or language-specialized cognitive mechanisms. Child conines to an irregular noun. hus, errors of this kind illuminate
grammatical errors present a problem, therefore. If the bulk of the childs ability to extract and generalize a morphological rule.
what is acquired is unlearned, why is there a protracted period We know that the child is indeed adding -s to make plu-
in a young childs life (several years) during which language is rals, even in the case of regular plurals like coconuts. his lat-
manifestly imperfect? At the very least, grammatical errors throw ter fact has been established even though it is conceivable that
into sharp relief the messiness of the data which nativists must the child has simply heard the form coconuts in the input and
152
Childrens Grammatical Errors
stored it whole, entirely unaware that the word can be parsed despite the fact that our linguistic knowledge (competence) may
into coconut and the plural marker -s. Jean Berko Gleason (1958) be lawless.
invented nonsense words (including wug) to denote birdlike Adult speech (in particular, speech directed toward other
creatures (also invented), in one of the irst experiments in the adults) may be laden with false starts, hesitations, unnecessary
ield of child language. Children were shown a picture of one of repetitions, and slips of the tongue. he default assumption about
these creatures and heard his is a wug. hey were then shown adult errors is that they are the product of faulty performance.
a picture with two of these creatures and heard Now there are Child speech errors, on the other hand, are more likely ascribed
two of them. here are two he pronunciation of two is left to an immature competence. However, all the factors that apply
hanging in the air, inviting the child to complete the sentence. to adults as causes of performance errors apply equally well to
And, indeed, children four years of age will often declare here children. At the same time, the task of distinguishing errors of
are two wugs. Since the child has never before encountered the competence from errors of performance is empirically fraught.
word form wugs in the input, we can be sure that this word has And tellingly, it is a task that researchers have not even begun
been assembled on-line using the new word form wug and prior to tackle with any serious purpose (though, see Jaeger 2005 for
knowledge of the plural suix -s. work on childrens slips of the tongue). With regard to adult
What is almost always overlooked, or possibly just taken for errors, there is also a scarcity of evidence to support the assump-
granted, in research on childrens errors is the fact that error tion that they are, unfailingly, the product of performance fac-
is an intrinsically relative concept. In the case of language learn- tors. It may well turn out, on closer inspection, that adults vary in
ers (young and old), utterances can be judged against the stan- terms of their grammatical competence.
dard of an expert, typically a parent or other native speaker. As noted, theories of grammar acquisition tend to assume
When my four-year-old son said, Whats the man who the for- that immature competence lies at the root of grammatical errors.
est doing? I registered an error, based on the dictates of how I A notable exception is found in the study of childrens past tense
would have said the same sentence (possibly, Whats the man errors. he so-called words and rules theory suggests that when
who is in the forest doing?). But the intuitions of a parent are children learn an irregular past tense form (e.g., broke), it auto-
not suicient proof that a given child sentence is ungrammatical. matically blocks the application of the regular suixation process
Parental intuitions, as a form of evidence, are neither objective (break + -ed breaked). In this way, errors are avoided (Marcus et
nor decisive. Nevertheless, linguists continue to rely on intuitions al. 1992). Of course, young children do produce errors from time
as a primary source of data on grammaticality (Smith 2004). to time. To explain these errors, Gary F. Marcus and colleagues
It is argued that the intuitions of an adult native speaker con- (1992) suggest that young childrens memory retrieval system is
stitute direct evidence for mental grammar, that is, the knowl- immature and sometimes lets them down. In consequence, the
edge of grammar residing in the head of an individual human child may occasionally fail in an attempt to retrieve an irregu-
being. However, the judgment of what is and is not grammatical lar form like broke. his failure then triggers the default regular
is embedded in social convention. Whatever rule happens to be process to produce breaked. Hence, the explanation for child
mentally represented by a given individual, and whatever intu- errors is based on limitations in performance, not competence.
itions that rule gives rise to, its acceptance as part of the grammar In support of this idea, it is argued that overregularization rates
for a given language is judged in comparison with the intuitions are generally very low, something like 4 percent (Marcus et al.
of other persons. hus, the grammaticality of a child utterance 1992). his rarity lends itself to a performance-based explana-
will be judged against the intuitions of the parent or, in some tion for what prompts the childs errors. In the event, error rates
cases, a passing child language researcher. he social nature of may be considerably higher than initial estimates might indicate
this process is rooted in the appointment (or self-appointment) (Maslen et al. 2004). Sampling limitations may mask brief peri-
of one or more people as arbiters over the grammaticality of any ods of very high error rates, especially for high-frequency verbs.
given utterance. A further problem is that there is no empirical support for the
Evidently, decisions about when an error really is an error speculation that errors are caused by failures in memory retrieval.
are not entirely straightforward. And even when one has made Very little is known about retrieval processes in young children,
that judgment (on whatever basis), one is then faced with a fur- especially in connection with language. Whatever the merits of
ther diicult issue that has, hitherto, received scant attention. In the words-and-rules account of past tense errors, it does at least
short, does a given error arise from an immature or incomplete raise awareness that childrens grammatical errors may not nec-
knowledge of grammar? Or is the underlying knowledge base essarily stem solely from an immature competence.
entirely adultlike, but somehow an error has slipped out, owing As noted, the fact that children produce errors in the course
to a technical hitch in production? In this vein, Chomsky (1965) of language acquisition is uncontroversial. Where controversy
distinguished between competence and performance. does arise is in the attempt to explain how children expunge
Competence refers to the speaker-hearers tacit knowledge of errors and move toward a more adultlike system of grammar.
his or her language. Performance, on the other hand, comprises he obvious solution to the childs problem is for parents and
the use of this knowledge in producing speech. he utterances others to supply corrections. Corrections for grammatical errors
we produce arise from both our linguistic competence and other are often referred to as negative evidence, that is, evidence that
intervening performance factors, including the limitations of some structure is not permitted by the target grammar. However,
short-term memory, motor control over the execution of speech opinions difer sharply as to whether negative evidence is avail-
plans, and even the efects of anxiety or alcohol. Cognitive fac- able to children. Roger Brown and C. Hanlon (1970) demon-
tors of this kind can cause errors to creep into our speech output strated that parents do not overtly disapprove of their childrens
153
Childrens Grammatical Errors Chirographic Culture
grammatical errors. hus, they do not reliably mark grammat- provides a strong stimulus for language scientists to seek expla-
ical errors with injunctions like Dont say that or No, thats nations for how and why language learners difer from fully com-
wrong. his inding has exerted an enormous inluence in the petent native speakers.
ield of child language, being hailed by Steven Pinker as one
Matthew Saxton
of the most important discoveries in the history of psychology
(1988, 104). Undoubtedly, Pinker overstates the case. But his WORKS CITED AND SUGGESTIONS FOR FURTHER READING
enthusiasm stems from the perception that a crucial aspect of
Berko Gleason, Jean. 1958. he childs learning of English morphology.
linguistic knowledge could not have arisen in the childs mind
Word 14: 15077.
through the mediation of the environment. hat is, if children
Brown, Roger, and C. Hanlon. 1970. Derivational complexity and order
receive no help or information in the input from parents con-
of acquisition in child speech. In Cognition and the Development of
cerning what is or is not grammatical, then one must conclude Language, ed. J. Hayes, 1153. New York: John Wiley.
that the childs knowledge in this respect is innate. Observe that Chomsky, Noam. 1959. Review of B. F. Skinners Verbal Behavior.
the reach of this conclusion is extensive, since it could, conceiv- Language 35: 2658.
ably, encompass each and every rule or principle of grammar . 1965. Aspects of the heory of Syntax. Cambridge, MA: MIT Press.
in a language. . 1980. Rules and Representations. New York: Columbia University
Nativist enthusiasm for what is known as the no negative evi- Press.
dence assumption is tempered by numerous empirical studies Chouinard, Michelle M., and E. V. Clark. 2003. Adult reformulations
that challenge this assumption. Beginning with Kathy Hirsh- of child errors as negative evidence. Journal of Child Language
30: 63769.
Pasek, R. Treiman, and M. Schneiderman (1984), researchers
Hirsh-Pasek, Kathy, R. Treiman, and M. Schneiderman. 1984. Brown
have noted that the markers of disapproval examined by Brown
& Hanlon revisited: Mothers sensitivity to ungrammatical forms.
and Hanlon (1970) do not constitute the only possible form of Journal of Child Language 11: 818.
corrective input. More recent research has focused on the fre- Jaeger, Jeri J. 2005. Kids Slips: What Young Childrens Slips of the
quent contrasts between erroneous child usage and correct Tongue Reveal about Language Development. Mahwah, NJ: Lawrence
adult models that igure in childadult discourse (Chouinard Erlbaum.
and Clark 2003; Saxton, Backley, and Gallaway 2005). he follow- Marcus, Gary F., S. Pinker, M. Ullman, M. Hollander, T. J. Rosen, and F.
ing example is an exchange between my four-year-old son and Xu. 1992. Overregularization in Language Acquisition. Monographs of
myself (emphases highlight the contrast in linguistic forms, not the Society for Research in Child Development, serial no. 228.
pronunciation stress). Maslen, Robert J. C., A. L. heakston, E. M. V. Lieven, and M. Tomasello.
2004. A dense corpus study of past tense and plural overregulariza-
Child: I thinked about it with my brain. tion in English. Journal of Speech, Language, and Hearing Research
Adult: You thought about it. 47: 131933.
Newport, Elissa, H. Gleitman, and L. R. Gleitman. 1977. Mother, Id
To function as a form of corrective input, contrasts of this kind rather do it myself: Some efects and non-efects of maternal speech
would have to be interpreted by the child as not simply model- style. In Talking to Children: Language Input and Acquisition, ed. C.
ing a correct form. he child would also have to regard them as Snow and C. Ferguson, 109149. Cambridge: Cambridge University
signals that their own previous usage was ungrammatical (for Press.
evidence consistent with this view, see Saxton 1997 and Strapp Pinker, Steven. 1988. Learnability theory and the acquisition of a
and Federico 2000). Curiously, Brown and Hanlon themselves irst language. In he Development of Language and Language
remarked on the corrective potential of contrastive discourse, Researchers: Essays in Honor of Roger Brown, ed. F. Kessel, 97119.
observing that repeats of ill-formed utterances usually con- Hillsdale, NJ: Lawrence Erlbaum.
tained corrections and so could be instructive (1970, 43). Saxton, Matthew. 1997. he contrast theory of negative input. Journal of
Child Language 24: 13961.
However, this observation was entirely overlooked for many
Saxton, Matthew, P. Backley, and C. Gallaway. 2005. Negative input for
years, leading to a considerable distortion of the empirical facts.
grammatical errors: Efects after a lag of 12 weeks. Journal of Child
At the same time, though, and as noted previously, the fact that Language 32: 64372.
contrastive discourse is abundantly available to children does Scholz, Barbara C., and G. K. Pullum. 2006. Irrational nativist exuber-
not entirely resolve the matter. It still remains to be demon- ance. In Contemporary Debates in Cognitive Science, ed. R. Stainton,
strated decisively that children actually perceive such contrasts 5980. Oxford: Basil Blackwell.
as a form of negative evidence and that they exploit that infor- Skinner, Burrhus F. 1957. Verbal Behavior. New York: Appleton-Century-
mation in shedding errors and arriving at a mature system of Crofts.
grammar. Smith, Neil. 2004. Chomsky: Ideas and Ideals. 2d ed. Cambridge: Cambridge
To conclude, childrens grammatical errors demand the University Press.
Strapp, Chehalis M., and A. Federico. 2000. Imitations and rep-
attention of language scientists for two reasons. First, and most
etitions: What do children say following recasts? First Language
obvious, errors stand out. hey attract our attention like brightly
20.3: 27390.
colored lags, lapping above the parapet. And, second, the
investigation of errors reveals much about the processes of lan-
CHIROGRAPHIC CULTURE
guage acquisition. hey provide the paradigm demonstration
that language develops. he fact that errors occur at every level of Writing and script are systems of graphic marks that repre-
language, both in abundance and for extended periods of time, sent words, syllables, or individual sounds (phonemes)
154
Chirographic Culture
of a language. Chirography shares the same deinition with

the added meaning of writing by hand. he term thus applies
to all the writing systems or scripts that followed the irst
invention of writing in Mesopotamia, circa 3200 b.c., and
before Gutenbergs invention of the printing press about
1437. Chirography is generally viewed as the gateway to com-
plex literate societies while leaving behind the archaic oral
cultures.
he nature and extent of the divide between oral and chiro-
graphic cultures has long been a matter of debate. In the ifth
century b.c., in Phaedrus and Letter VII, the Greek philosopher
Plato expressed his concerns with the impact of chirography on
human cognition. He warned that writing would weaken human
memory and threaten scholarship by allowing the ignorant to
fake omniscience. As discussed in Khosrow Jahandaries volume
Spoken and Written Discourse (1999), the present consensus is
less critical. Wherever chirography emerged, in Mesopotamia
about 3200 b.c., in China about 1250 b.c. and in Mesoamerica
circa 650 b.c., it is held as a productive supplement of speech.
his is based on the facts that, irst, the human voice can be
heard only by a small audience but written documents can be
sent to any destination, and, second, speech disappears instan-
taneously, leaving no trace, while texts can be preserved over Figure 1. Cuneiform tablet featuring a list of goods. Courtesy of the Texas
time. It is, therefore, generally agreed that chirography extends Memorial Museum, The University of Texas at Austin.
the network of human communication from culture to cul-
ture and makes it possible to trace the roots of cultures and their
evolution in history. Moreover, by reducing to order and clarity lineal conception of time, that they view events linearly, with a
a myriad of details, chirography is credited with revolutionizing beginning, a middle, and an end. And, in his view, the resulting
record keeping. Registers allow for administering communi- systematic sequential presentation of arguments translated into
ties and keeping track of entries and expenditures, proits, and a more rigorous logic.
losses of businesses. Finally, writing is recognized for creating Also in the 1960s, Eric A. Havelock analyzed how the adop-
data banks far beyond the power of human memory, resulting tion of the Semitic alphabet in Greece inluenced the organi-
in turn in the accumulation and articulation of an unlimited zation of ideas, abstraction, and consciousness. Havelock, as
quantity of complex data regarded as instrumental to the for- well as McLuhan, dealt primarily with alphabetic scripts. In
mulation of signiicant syntheses and the creation of new cogni- contrast, Walter J. Ong, S.J., university professor of humani-
tive skills. In other words, literacy is viewed as enhancing the ties at St. Louis University, and Jack Goody, anthropologist at
possibilities for socially distributed cognition, allowing Cambridge University, included in their analyses prealphabetic
civilization to grow more complex with the administration of chirographic systems, such as the Mesopotamian cuneiform
organizations of greater dimensions and larger political units, script and non-alphabetic oriental writing systems. Among its
the creation of more extensive economies, the development of many important contributions, Ongs book Orality and Literacy
complex sciences and technologies, and a more accurate knowl- ([1982] 1983) makes the case that the nonliterate relates to the
edge of the past. world in a concrete, situational way, downplaying generalization
he major priority of the twentieth century, however, has and abstraction. Relying on Aleksandr R. Lurias psychological
been to investigate the impact of chirography on the human ield work, Ong argued that the illiterate does not name geomet-
mind. With his famous adage the medium is the message, ric igures abstractly as circles or squares but as moons or
Marshall McLuhan emerged as a popular champion of literacy. plates, and doors or mirrors. Furthermore, the nonliterate
In his books he Gutenberg Galaxy (1962) and Understanding avoids self-analysis, which requires abstracting the self from the
Media (1964), the Canadian media critic advocated that media surrounding world.
were not passive conduits of information but, rather, vortices of Goody, the author of he Domestication of the Savage Mind
power restructuring human perceptions. McLuhan argued that (1977) and he Interface Between the Written and the Oral (1987),
by translating sounds into a visual code, writing exchanged an investigated how the series of shifts involved in the develop-
eye for an ear and that [p]honetic culture endows men with the ment of writing restructured human thought. In particular,
means of repressing their feelings and emotions when engaged in he analyzed how the irst Mesopotamian texts that consisted
action (1964, 84, 88); he therefore claimed that literate humans exclusively of lists changed thought processes. He suggested
develop the power of acting with detachment from emotional that the lists of goods generated by the Mesopotamian admin-
involvement. He further emphasized the impact of the linear istration (Figure 1) or the sign lists compiled by scribes encour-
format of writing, pointing out that civilized societies acquire a aged scrutiny by selecting which items to include and which to
155
Schmandt-Besserat in Before Writing (1992) and How Writing

Came About (1996), a system of tokens was used to keep track
of goods in the Near East, starting about 7500 b.c. he tokens
were modeled in clay in multiple shapes, such as cones,
spheres, disks, and cylinders. Each token shape stood for a
unit of an agricultural product: A cone was a small measure
of grain, a sphere stood for a large measure of grain, and a cyl-
inder for an animal. Four thousand years later, in the urban
period circa 3300 b.c., the tokens had evolved into a complex
accounting device with a repertory of about 300 shapes, some
including additional incised or punched markings, to record
the various units of goods manufactured in workshops (Figure
2), such as wool, textiles and garments.
he fact that, in the Near East, the irst script was preceded
by a visual code a system of arbitrary symbols to represent
words sheds new light on chirographys contribution to soci-
ety. In particular, some of the merits formerly attributed to
writing have to be credited to the token system. For example,
tokens, not chirography, shifted an eye for an ear. Like texts,
the tokens were permanent and, therefore, could be transported
or stored. he clay artifacts symbolized units of real goods and
as such handled data in abstraction. Finally, like written lists,
tokens could organize information in successive lines in the
most concise way, allowing scanning, evaluating, scrutinizing,
Figure 2. Tokens from Uruk, Iraq, ca. 3300 B.C. Courtesy Vorderasiatisches and analyzing a budget. As a result, the token system stretched
Museum Berlin, Germany. human cognition by making it possible for the neolithic oral
cultures of the Near East to handle large amounts of complex
information.
Once appropriate recognition is given to the token system,
exclude. Moreover, he argued that the lists segmented reality the revolutionary contributions of chirography become very
by breaking down the perceptual world. For example, a list of clear. First, chirography abstracted numbers (Figure 3). It
tree signs abstracted the trees from the forests to which they should be well understood that the tokens were used in one-to-
belong. In other words, according to Goody, lists decontextu- one correspondence, which means that one, two, or three small
alize data but also regroup elements, ordering them by type, measures of grain were shown by one, two, or three cones. But
shape, size, number, and so on. Consequently, lists reorganize numerals signs that represent numbers abstractly appeared
the world, transforming it into an ideal form and creating a new about 31003000 b.c., after the three-dimensional tokens were
reality upon which the literate is forced to relect at a new level replaced by their images pictographs traced onto clay tab-
of generality. lets. At this point, 10 jars of oil and 10 large units of grain were
Among other twentieth-century authors who considered no longer shown by 10 ovoid tokens and 10 spherical tokens
that writing afected the human mind, David R. Olson, pro- but by one sign standing for ten, followed by a sign for the
fessor of applied cognitive science at the Ontario Institute goods in question. Second, chirography abstracted the sounds
for Studies in Education, emphasized in he World on Paper of speech. Whereas the tokens merely acted as logograms or
(1994) the importance of writing for relecting upon our- signs standing for a concept chirography created phonetic
selves. On the other hand, Bruno Latour, an anthropologist, syllabic signs to write personal names (see Figure 3). In sum,
is among the scholars who disagree with the proposition that compared to tokens that stood for concrete merchandise, the
writing created new cognitive skills. In an article in 1990, he chirographic numerals symbolized oneness, twoness, and
proposed that it is the combination of images and writing in abstract constructs of the mind, and the chirographic phonetic
maps, charts, graphs, photos, and diagrams that create better signs symbolized the immaterial and evanescent sounds of
tools to allow scientists to foray into new ideas. Others credit the voice. By creating numerals and phonetic signs, chirogra-
schooling, rather than writing, for increasing rationality and phy, therefore, raised human cognition to far higher levels of
abstract thought. abstraction.
hese seminal studies of the 196080s must now be updated In the twentieth century, research on the impact of chi-
by taking into account the archaeological discovery that rography on cognition was conined to issues of interest to
the cuneiform script, the earliest chirographic system, was the humanities. In the 21st century, however, the debate was
derived from an earlier visual code. As described by Denise extended to the ield of art. In When Writing Met Art (2007),
156
ancient Near East. he preliterate lines of a repeated motif

were apprehended at a glance, but the narrative composi-
tions of chirographic cultures were read analytically, sequen-
tially. It is generally assumed that the Neolithic motifs, such
as triangles or ibexes, were symbols like the dove is a symbol
of peace in our culture. hus, the preliterate Near Eastern art
probably evoked ideas perhaps profound ideas but only the
art compositions of chirographic cultures could tell complex
stories.
On the basis of these recent indings, the immense legacy
of the irst handwritten texts can now be assessed with greater
clarity. Art demonstrates that chirography created a paradigm
that can be successfully implemented in other communication
systems. Archaeology shows that compared to its archaic token
precursor, chirography meant leaps in abstraction with the cre-
ation of numerals and phonetic signs. By inventing symbols
to express such numbers as 1, 10, and 60, chirography laid the
foundation for the development of mathematics. By represent-
ing the sounds of the voice by phonetic signs, chirography set
Figure 3. Pictographic tablet from Uruk, Iraq, ca. 3000 B.C. Courtesy the stage for writing to become a universal system of communi-
Deutsches Archaeologisches Institut, Berlin, Germany. The tablet features cation emulating speech.
a list of goods. In the upper cases, the units of merchandise are indi-
cated by pictographs or images of tokens traced in clay. Numerals are Denise Schmandt-Besserat
shown with impressed signs. On the lower case, phonetic signs indicate
the name of the recipient or donor.
Coulmas, F., ed. 1999. he Blackwell Encyclopedia of Writing Systems,
Oxford: Oxford University Press.
Schmandt-Besserat argued that ancient Near Eastern art com-
. 2003. Writing Systems: An Introduction to heir Linguistic Analysis.
positions the way images are organized changed with the Cambridge: Cambridge University Press.
advent of chirography in 31003000 b.c. She showed that pre- Goody, Jack. 1977. he Domestication of the Savage Mind.
literate painted or carved compositions consisted of the mere Cambridge: Cambridge University Press.
repetitions of one motif as many times as necessary to cover a . 1987. he Interface between the Written and the Oral.
surface. For instance, the same triangle or ibex was replicated Cambridge: Cambridge University Press.
around the body of the vessel. In contrast, by borrowing the Havelock, Eric. A. 1963. Preface to Plato. Cambridge: Belknap Press of
strategies of writing, compositions of the chirographic cul- Harvard University Press.
tures were able to organize multiple igures into a narrative. To Houston, Stephen D., ed. 2004. he First Writing. Cambridge: Cambridge
University Press.
illustrate this concept, large and small signs of writing denoted
Jahandarie, Khosrow. 1999. Spoken and Written Discourse, A Multi-
greater or lesser units of goods and similarly, the size of images
disciplinary Perspective. Stamford, CT: Ablex Publishing.
indicated status. Gods were represented as larger than kings,
Latour, Bruno. 1990. Drawing things together. In Scientiic Practice,
and the images of kings were shown larger than those of their ed.M. Lynch and S.Woolgar, 1968. Cambridge, MA: MIT Press.
fellow citizens. Just as signs changed value by being placed to McLuhan, Marshall. 1962. he Gutenberg Galaxy: he Making of
the right or the left, above or below other signs, the heroes Typographic Man. Toronto: University of Toronto Press.
actions and interactions were indicated by their orientation, . 1964. Understanding Media: he Extensions of Man. New York: New
order, and direction. For instance, one igure standing in front American Library.
of another was understood as being more important than Niditch, S. 1996. Oral World and Written Word. Louisville, KY:
one standing behind it. From writing, art also acquired ways Westminster John Knox Press.
of loading images with information by using symbols akin to Olson, David R. 1994. he World on Paper. Cambridge: Cambridge
University Press.
determinatives signs denoting a general class. For instance,
Ong, Walter J., S. J. [1982] 1983. Orality and Literacy. London: Methuen.
the horned tiara indicated the divine status of a igure in the
Plato. 1973. Phaedrus and Letters VII and VIII. Trans. Walter Hamilton.
same way the star-shaped sign dingir did in Sumerian cune-
Harmondsworth: Penguin Books.
iform texts. As a result, reading art became akin to reading Schmandt-Besserat, Denise. 1992. Before Writing. Austin: University of
a text. Texas Press.
In sum, art, which is, in at least certain respects, a mirror . 1996. How Writing Came About. Austin: University of Texas
of culture, signals a conceptual change in design composi- Press.
tions that coincides with the advent of chirography in the . 2007. When Writing Met Art. Austin: University of Texas Press.
157
Clitics and Cliticization
CLITICS AND CLITICIZATION coincide (as with French object le, which is both unaccented and
distinctively positioned).
he unusual properties of little words have attracted the atten-
An association between them has been noted at least since
tion of generations of linguists. his is especially true of the items
the classic work of Jakob Wackernagel (1892), who pointed out
known to traditional grammar as enclitics and proclitics and to
that phonetically weak elements in the ancient Indo-European
modern linguistics simply as clitics. Diferent theoretical frame-
languages tended to cluster in second position within the sen-
works have highlighted diferent characteristics of what has often
tence, a position that was quite anomalous from the point of view
been seen as a unitary class of elements, with the result that two
of the rest of the syntax. Much later literature has treated the con-
quite distinct sorts of unusual behavior have not always been
nection between phonological weakness (and especially a lack
carefully distinguished.
of autonomous accent) and unusual positioning as essential,
although the two turn out to be quite separate characteristics,
Two Senses of Clitic not only logically but also empirically. he essential distinction
he etymology of the word clitic (from Greek kli:no lean) between them was pointed out by Arnold Zwicky (1977) and
brings out what seemed most distinctive to an earlier genera- further developed in much later literature, including Anderson
tion of scholars, their tendency to lean on or form part of a (2005).
prosodic unit with a preceding or following word, linked to
their typical lack of autonomous accent. Such attachment
Accounts of Clitic Behavior
may give rise to phonological words containing syntactically
Once we realize that it follows from the prosodically impov-
unrelated material as a result of linear adjacency. In addi-
erished nature of the elements concerned, the phonological
tion to standard cases in Greek, Latin, and Sanksrit, a well-
dimension of clitic behavior inds a natural account in more
known example of this is furnished by the Wakashan language
general theories of prosody and stress, as demonstrated in
Kwakwala, where determiner elements at the left edge of
work such as that of Elizabeth Selkirk (1995). he (morpho-)
nominal expressions (among other clitics) attach phono-
syntactic dimension is somewhat more controversial, how-
logically to the rightmost word of a preceding constituent. In
ever. Special clitics appear in a limited range of positions. Any
a sentence like mx id ida bgWanma-xa gnanma-sa
given clitic can be associated with some syntactic domain, and
kWixayu hit-Det man-Det child-Det club, he man hit the
within this domain it may appear initially, inally, postinitially
child with a club, the words bgWanma-xa man-Obj and
(in second position), preinally, or adjacent to the head of the
gnanma-sa child-Inst end in determiner elements that are
domain.
syntactically linked not to the phrase of which they are phono-
Syntacticians have tended to see special clitics as illing
logically a part but, rather, to the argument that follows in the
normal syntactic positions and then displaced to their sur-
sentence.
face position under the inluence of distinctive movement
In this case, the anomaly is a phonological grouping that does
principles within the syntax. One diiculty with this is the
not appear to relect syntactic organization, and this can plausi-
fact that in some languages (of which certain forms of Serbo-
bly be attributed to the prosodic weakness of the clitic (here the
Croatian are the most discussed), the element deining
determiners). In other instances, however, something else must
second position for the location of clitics is a unit in phono-
be at work. he pronominal object clitics in a French sentence
logical terms (a prosodic word) but not necessarily a unit that
like Je le lui donne I give it to him appear preceding the main
ought to be accessible to the syntax. An alternative that avoids
verb, a position in which objects cannot otherwise occur and
this diiculty is to note the close parallel between possible
which requires reference to ordering principles outside of the
positions for clitics and for aixes and to treat special clitics
languages normal syntax.
as a class of aixes introduced directly into the surface form
Elements appearing in unusual positions in this way are gen-
of phrases by principles closer to those of morphology than
erally also phonologically weak, and this has led to a tendency
to syntax.
to conlate the two sorts of exceptionality, taking prosodic weak-
ness (including lack of stress) as diagnostic of clitics and then
proposing distinctive ordering for the items so identiied. In fact,
Conclusion
he analysis of clitics and the principles by which they ind their
however, some prosodically weak elements appear only in posi-
place in sentence structure and prosody involve an intricate
tions that are quite normal syntactically (e.g., the reduced forms
interplay among all of the major components of grammatical
of is and has in Freds sleeping and Freds lost his dog), while some
structure, including syntax, phonology, morphology, and
elements that are positioned unusually along with other clitics
even semantics. hese elements have been invoked by schol-
are nonetheless prosodically full words, such as Tagalog tayo/
ars in all of these areas as evidence for fundamental claims about
natin we (incl.) and a number of other pronouns and particles
linguistic structure, and an assessment of those claims is only
in this language.
possible on the basis of a clearer understanding of the subdivi-
Such facts suggest that we should recognize two distinct
sions among clitics, and the appropriate mechanisms for accom-
dimensions of unusual behavior of clitics, one phonological
modating their speciic properties, than is often found in the
(associated with prosodic weakness) and the other syntactic
existing literature.
(associated with unusual positioning of a restricted sort, elements
commonly called special clitics). he two often, but not always, Stephen R. Anderson
158
Codeswitching
WORKS CITED AND SUGGESTIONS FOR FURTHER READING Many studies remain at the descriptive level, but at least two
Anderson, Stephen R. 2005. Aspects of the heory of Clitics. Oxford: Oxford
models ofer explanations for why CS occurs within a discourse.
University Press. conversation analysis (CA) analysts emphasize a switchs
Selkirk, Elizabeth. 1995. he prosodic structure of function words. sequential positioning in conversation, claiming that it provides
In Papers in Optimality heory, 43970. University of Massachusetts vital information about its sociopragmatic message (Auer 1998
Occasional Papers in Linguistics 18. inter alia; Li 2005). In contrast, the markedness model empha-
Wackernagel, Jakob. 1892. ber ein Gesetz der indogermanischen sizes that speakers use CS as a tool to present a certain persona;
Wortstellung. Indogermanische Forschungen 1: 333436. they exploit participants sense of the indexicality of each code
Zwicky, Arnold. 1977. On Clitics. Bloomington: Indiana University (see indexicals) and of the contrast between the social import
Linguistics Club. of codes in a given context (Myers-Scotton 1993 inter alia).
Some analysts, such as B. Rampton, C. Stroud, and J. Gafaranga,
CODESWITCHING emphasize CS as exemplifying the speakers creative agency.
CS researchers agree on two points: 1) To engage in CS is
Introduction largely an unconscious move, and 2) speakers seldom intend
Codeswitching (CS) is deined as the use of two or more language
a single, speciic meaning; potentially ambiguous or multiple
varieties in the same conversation, not counting established bor-
meanings are part of the pragmatic message.
rowed words or phrases. Two general types of structural conigu-
Two overlapping generalizations capture diferences in vari-
rations occur. 1) Intersentential CS, switches for one sentence
ous approaches. First, the meaning of strategy, with its implica-
or many, is generally studied for its social implications (1). 2)
tion that CS carries messages of intentionality, divides analysts.
Intrasentential or intraclausal CS is more studied for its gram-
Second, analysts difer on the role of community values and
matical conigurations (24).
participants own sociolinguistic proiles, as well as a varietys
(1) (Policeman to heckler in Nairobi crowd, switching from multiple associations, as they relate to a speakers motivation for
English to Swahilisentences) making a switch.
How else can we restrain people from stealing except by
punishment? CS and Its Grammatical Structure
Wewe si mtu ku-tu-ambia vile tu-ta-fanya kazi tu-na sheria Most analysts agree that CS has a principled grammatical struc-
yetu. ture, but the principles they propose to constrain sentence/clause
Swahili translation: You arent a person to tell us how to do structure vary. Many early studies employed a linear-based
our work we have our laws. framework; for example, Shana Poplack (1980) argues that pos-
(Myers-Scotton 1993, 77) sible switching depends on surface-level syntactic equivalences
across participating languages. Some place importance on distin-
(2) (A clause in French embedded in a Brussels Dutch clause) guishing borrowing from CS through quantitative analyses (e.g.,
[t is dat ][que jai dit madame]. Budzakh-Jones 1998). In contrast, he matrix language frame
hat is what that I told the lady. model links CS at abstract levels to psycholinguistic models
(Trefers-Daller 1994, 30) of language production (Myers-Scotton 1997, 2002). Asymmetries
(3) (Single English content morpheme in a Hungarian-framed between the structuring roles of the participating languages are
clause) stressed. Also, languages do not supply morpheme types equally.
jtsz-ok school-ot he 4-M model and a uniform structure principle explain diferent
play-s.pres school-acc morpheme distributions with more precision (cf. Myers-Scotton
Im playing school. and Jake 2009). Still other researchers argue that current syntactic
(Bolonyai 1998, 34). theory of mainstream generative grammar, though intended
for monolingual data, can explain CS parisomiously (MacSwan
(4) (English verb stem with Swahili aixes in Swahili frame) 2000). Although CS involves bilingual data (see bilingualism
father a-li-m-buy-i-a vi-tabu a-ka-potez-a vy-ote and multilingualism), researchers claim that no dominant
s-past-obj-buy-appl-fv cl-book s-consec-lose- or matrix language is needed. his conclusion is debated (cf.
fv cl-all MacSwan 2005a, 2005b and Jake, Myers-Scotton, and Gross 2002,
father bought for him books and he lost all [of them] 2005). CS as a vehicle in convergence in grammatical patterns is
(Myers-Scotton 1997, 87) also studied (e.g., Muysken 2000; Clyne 2003; Backus 2005).
Carol Myers-Scotton
CS and Its Social Meanings
CS is a means of presenting a particular persona or negotiating WORKS CITED AND SUGGESTIONS FOR FURTHER READING
interpersonal relationships in a given interaction, making it a major
Auer, Peter, ed. 1998. Code-switching in Conversation: Language,
research topic for some sociolinguists and linguistic anthropol-
Interaction and Identity. London: Routledge.
ogists. A starting point is John J. Gumperzs (1982) notion that CS is
Backus, Ad. 2005. Codeswitching and language change: One thing leads
one of the possible contextualization cues of the speakers prag- to another? International Journal of Bilingualism 9: 33740.
matic intentions. Also, researchers often mention E. Gofmans Bolonyai, Agnes. 1998. In-between languages: Language shift/mainte-
concept of footing, and M. Bakhtins concept of speakers mul- nance in childhood bilingualism. International Journal of Bilingualism
tiple voices that are echoes of earlier utterances. 2: 2143.
159
Cognitive Architecture
Budzhak-Jones, Svitlana. 1998. Against word-internal codeswitch- As digital computers evolved, so, too, did the notion of com-
ing: Evidence from Ukrainian-English bilingualism. International puter architecture. Computer designers had to pay attention not
Journal of Bilingualism 2: 16182. only to the needs of the user but also to additional constraints
Clyne, Michael. 2003. Dynamics of Language Contact. that arose with the development of high-level programming
languages and with the invention of new hardware technolo-
Gumperz, John J. 1982. Discourse Strategies. Cambridge: Cambridge
gies. Brookss more modern deinition of architecture relects
University Press.
Jake, Janice, Carol Myers-Scotton, and Steven Gross. 2002. Making a
these developments: he architecture of a computer system we
minimalist approach to codeswitching work: Adding the matrix lan- deine as the minimal set of properties that determine what pro-
guage. Bilingualism, Language & Cognition 5: 6991. grams will run and what results they will produce. he architec-
. 2005. A response to MacSwan (2005): Keeping the matrix lan- ture is thus the systems functional appearance to its immediate
guage. Bilingualism, Language, and Cognition 8: 2716. user (Blaauw and Brooks 1997, 3). he key element here is that
Li, Wei. 2005. How can you tell?: Towards a common sense expla- a computers architecture describes the information-processing
nation of conversational code-switching. Journal of Pragmatics capacities of a device without appealing to its speciic hardware
37: 37589. properties. In short, a computers architecture is a description
MacSwan, Jef. 2000. he architecture of the bilingual language fac- of its logical and abstract information-processing properties
ulty: Evidence from intrasentential code switching. Bilingualism,
(Dasgupta 1989).
Language, and Cognition 3: 3754.
. 2005a. Making a minimalist approach to codeswitching
work: Adding the matrix language. Bilingualism, Language, and
The Cognitive Architecture
Cognition 5: 6991. he concept cognitive architecture is the direct result of apply-
. 2005b. Remarks on Jake, Myers-Scotton and Grosss ing the notion of computer architecture to human cognition.
response: here is no matrix language. Bilingualism, Language and Cognitive scientists assume that cognition is information pro-
Cognition 8: 27784. cessing (e.g., Dawson 1998). Cognition as information process-
Muysken, Pieter. 2000. Bilingual Speech, A Typology of Code-mixing. ing must therefore be characterized by a fundamental set of
Cambridge: Cambridge University Press. logical and abstract properties (e.g., a primitive set of symbols
Myers-Scotton, Carol. 1993. Social Motivations for Codeswitching: Evidence and operations). By identifying this set of properties for human
from Africa. Oxford: Oxford University Press.
cognition, one speciies the cognitive architecture.
. 1997. Duelling Languages, Grammatical Structure in
For example, Z. W. Pylyshyn isolates the basic operations for
Codeswitching. 2d ed. Oxford: Oxford University Press.
storing and retrieving symbols, comparing them, treating them
. 2002. Contact Linguistics: Bilingual Encounters and Grammatical
Outcomes. Oxford: Oxford University Press. diferently as a function of how they are stored (hence, as a func-
. 2006. Multiple Voices: An Introduction to Bilingualism. tion of whether they represent beliefs or goals), and so on, as well
Oxford: Blackwell. Chapters 6 and 9 deal with codeswitching for as such basic resources and constraints of the system, as a lim-
advanced undergraduates. ited memory. It also includes what computer scientists refer to as
Myers-Scotton, Carol, and Janice Jake. 2009. A universal model of code- the control structure, which selects which rules to apply at vari-
switching and bilingual language processing and production. In he ous times (1984, 30). It is no accident that this account empha-
Cambridge Handbook of Linguistic Code-Switching, ed. B. Bullock and sizes symbols and primitive operations for their manipulation.
A. Toribio, 33657. Cambridge: Cambridge University Press. his is because Pylyshyn wants to ensure that the architecture
Poplack, Shana.1980. Sometimes Ill start a sentence in English Y
is indeed cognitive, which for him means that it must be repre-
TERMINO EN ESPAOL: Toward a typology of code-switching.
sentational: It [the cognitive architecture] is the level at which
Linguistics 18: 581618.
the system is representational, and where the representations
Trefers-Daller, Jeanine. 1994. Mixing Two Languages: French-Dutch
Contact in a Comparative Perspective. Berlin: Mouton de Gruyter. correspond to the objects of thought (1991,191). here may be
Winford, Donald. 2003. An Introduction to Contact Linguistics. other levels of system organization above and below the cogni-
Cambridge: Cambridge University Press. Winford provides a compre- tive architecture, but researchers like Pylyshyn would argue that
hensive overview of codeswitching designed for beginning graduate these levels are not cognitive.
students.
Architecture and Explanation
Brookss (1962) original notion of computer architecture was
COGNITIVE ARCHITECTURE
driven by the goals of computer design: he architecture served
as a functional account of capabilities to be used as a blueprint by
The Architecture of a Computer hardware engineers in order to bring a computer into being. In
he cognitive sciences have developed in large part from the the study of cognition and language, the information processor
application of concepts that initially arose in computer science. already exists. Why, then, is there a need for the cognitive archi-
One such concept is that of architecture. tecture? he answer is that an architectural account converts a
he term computer architecture was originated by Frederick P. cognitive description into a cognitive explanation.
Brooks, Jr., a pioneering force in the creation of IBMs early com- Architectural components convert descriptions into expla-
puters. For Brooks, computer architecture, like other architec- nations by providing a bridge between functional and physical
ture, is the art of determining the needs of the user of a structure accounts. hey can do this because components of the cog-
and then designing to meet those needs as efectively as possible nitive architecture must be both cognitive and physical (e.g.,
within economic and technological constraints (1962, 5). Haugeland 1985, 100).
160
Cognitive Architecture
hese two lives are important because the predominant and a common human endowment (1995, 167). Complete
research methodology used by cognitive scientists is functional accounts of human language must appeal to these biological
analysis (Cummins 1983). In functional analysis, a researcher underpinnings.
attempts to account for some complex function by decompos- hus, in the Chomskyan tradition, an architectural account
ing it into an organized system of subfunctions. Each subfunc- would include the speciication of a generative grammar, as
tion often becomes the subject of its own functional analysis; well as additional processes that are necessary and suicient for
this methodology is intrinsically iterative. However, if this were mediating language. Furthermore, this account would be bio-
all that there was to this methodology, functional analysis would logically grounded.
fall victim to an ininite regress and generate an ininite variety of On the other hand, the Chomskyan tradition takes strong
unexplained functions (Ryle 1949). positions that conlict with the general notion of cognitive archi-
To avoid Ryles regress, functional analysis also attempts to tecture sketched earlier. Two of these positions require special
progressively simplify the proposed subfunctions: he highest mention here. he irst is that a theory in linguistics should focus
level design breaks the computer down into a committee or army on competence, and not performance. he second is that
of intelligent homunculi with purposes, information and strate- the language faculty is modular in the sense of J. A. Fodor
gies. Each homunculus in turn is analyzed into smaller homun- (1983).
culi, but, more important, into less clever homunculi (Dennett hese positions are important because both have been used
1978, 80). At some point, the less clever homunculi become so to exclude certain concepts from linguistic study that are critical
simple that they can be replaced by physical devices that carry components of the cognitive architecture. For example, memory
out the desired function. At this point, the functional descrip- has not been deemed to be properly part of the linguistic domain.
tion is physically subsumed (Cummins 1983) and becomes hat is, while memory might impact language production or
explanatory. comprehension (e.g., by limiting the number of embedded
From this perspective, the cognitive architecture can be clauses that can be processed), some researchers would argue
described as the set of primitive functions that have been sub- that memory is not part of the language faculty proper. For some,
sumed in a functional analysis. heir functional or cogni- memory limitations are viewed as being important to a theory
tive role deines how these components mediate complex of language performance, but not to a theory of language com-
information processing. heir physical role deines how such petence (e.g., Chomsky 1965). Furthermore, memory limitations
processing can be physically implemented and explained. are related to a general cognitive resource, which by deinition
therefore cannot be solely part of a language faculty (Hauser,
Architecture and Language Chomsky, and Fitch 2002).
To the extent that language is mediated by information process- More recent variations of the Chomskyan approach provide
ing, an explanation of language must be grounded in a cognitive a more lexible view of the competence/performance distinc-
architecture. In many respects, linguistics provides prototypical tion and, as a result, lead to theories that make strong proposals
examples of architectural accounts of language. However, it can about cognitive architecture as construed earlier: A theory that
also be argued that the cognitive architecture holds an uneasy allows us to readily relate competence to performance ought to
position within some linguistic theories. be favored over one that creates hard boundaries between the
On the one hand, dominant theories in linguistics appear two (Jackendof 2002, 197).
to provide architectural accounts of language. We have already Jackendofs parallel architecture (2002) is one such theory.
seen that a cognitive architecture requires a set of functions He assumes that syntax is not the only system responsible for
to be established as primitives by subsuming them as neural the combinatorial nature of language. Instead, he proposes a par-
implementations. From its inception, the standard Chomskyan allel architecture in which three separate levels (phonology,
approach to language appears to strive toward this kind of archi- syntax, and semantics) are independent, each having their
tectural account. own primitive operations and combinatorial principles. hough
First, the speciication of a generative grammar provides independent, the three levels are linked by interface constraints.
a detailed account of a set of complex tokens, and the rules for hus, all three levels cooperate to produce the generative struc-
their manipulation, that are required to assign structural descrip- ture of language. Furthermore, this theory of linguistic structure
tions to sentences. Furthermore, this grammar is intended to can be mapped to a parallel architectural theory in which each
describe (at least in part) cognitive processing: Every speaker has level is a modular processor, but there are interfaces between the
mastered an internalized a generative grammar that expresses three processors that share a common linguistic memory.
his knowledge of his language (Chomsky 1965, 8). In short, one he preceding discussion of language and the cognitive archi-
purpose of a generative grammar is to describe the functional tecture has emphasized theories that adopt a so-called classical
properties of an internalized set of symbols and rules. perspective that emphasizes the rule-governed manipulation of
Second, the Chomskyan tradition presumes a strong link symbols. It is important to realize that alternative architectures
between generative grammar and the brain. his link is included for language have also been explored. For instance, classical
in the general view that the language faculty is a biological organ researchers have argued that artiicial neural networks are not
(Hauser, Chomsky, and Fitch 2002). According to Chomsky, capable of modeling the generative properties of language (Fodor
he human brain provides an array of capacities that enter and Pylyshyn 1988). However, empirical and theoretical analy-
into the use and understanding of language (the language fac- ses would indicate that this criticism is not valid (Hadley and
ulty); these seem to be in good part specialized for that function Hayward 1997; Siegelmann 1999). As a result, many examples
161
Cognitive Grammar
exist in which neural networks have been used to model a variety structure. It has played a key role in the history of cognitive
of language phenomena (Mammone 1993; Sharkey 1992). linguistics.
Fundamental to CG is the idea that language is an integral
Michael R. W. Dawson
part of human cognition and cannot be properly understood
without reference to cognitive abilities. A pervasive feature of
CG is the determination to reconcile accounts of linguistic struc-
Blaauw, G. A., and F. P. Brooks. 1997. Computer Architecture: Concepts ture with what is known about cognitive processing in domains
and Evolution. Reading, MA: Addison-Wesley. other than language. CG contrasts in this respect with models
Brooks, F. P. 1962. Architectural philosophy. In Planning a Computer that insist upon a discrete, autonomous grammar module and
System Project Stretch, ed W. Buchholz, 516. New York: McGraw-
the autonomy of syntax. he cognitive orientation of CG
Hill.
is apparent from a reliance on notions such as sensory imag-
Chomsky, N. 1965. Aspects of the heory of Syntax. Cambridge, MA: MIT
Press.
ery, perspective, mental scanning, attention, and igure versus
. 1995. he Minimalist Program. Cambridge, MA: MIT Press. ground asymmetry in accounting for linguistic phenomena. In
Cummins, R. 1983. he Nature of Psychological Explanation. Cambridge, broad terms, grammatical structure is explained as conventional
MA: MIT Press. Explores the role of the architecture in providing imagery, with alternate structures relecting alternate construals
explanatory power to functional analyses. of the conceived situation.
Dasgupta, S. 1989. Computer Architecture: A Modern Synthesis. New Not surprisingly, the cognitive notions underlying CG assume
York: Wiley. a relatively abstract interpretation when applied to some aspects
Dawson, M. R. W. 1998. Understanding Cognitive Science. of linguistic structure. For example, cognitive processes such as
Oxford: Blackwell. registration of contrast, scanning of a ield, and perception of a
Dennett, D. 1978. Brainstorms. Cambridge, MA: MIT Press.
boundary are all deemed relevant for explicating the notion of a
Fodor, J. A. 1983. he Modularity of Mind. Cambridge, MA: MIT Press.
count noun, understood as a bounded region in some domain
Fodor, J. A., and Z. W. Pylyshyn. 1988. Connectionism and cognitive
architecture. Cognition 28: 371.
in Langacker (1987, 189203). Such processes may be obvi-
Hadley, R. F., and M. B. Hayward. 1997. Strong semantic systematic- ous factors in the conceptualization of nouns with clear spatial
ity from Hebbian connectionist learning. Minds and Machines boundaries (e.g., cup, pencil), but a more abstract interpretation
7: 137. of these processes is clearly required in other domains. Body-part
Haugeland, J. 1985. Artiicial Intelligence: he Very Idea. Cambridge, nouns (e.g., waist, shoulder, side) must be explicated in terms of a
MA: MIT Press. virtual boundary that does not correspond to any visible, objec-
Hauser, M. D., N. Chomsky, and W. T. Fitch. 2002. he faculty of tively identiiable demarcation. Likewise, the notions of igure
language: What is it, who has it, and how did it evolve? Science and ground familiar from the study of perception are seen as
298: 156979. underpinning various relational asymmetries in language. hese
Jackendof, R. 2002. Foundations of Language: Brain, Meaning,
notions have most obvious relevance in the case of words relat-
Grammar, Evolution. Oxford: Oxford University Press. Provides an
ing to the spatial domain, such as the contrasting pair above and
overview of the state of linguistics that pays special attention to archi-
tectural issues.
below, where there is a igure-ground reversal of the igure and
Mammone, R. J. 1993. Artiicial Neural Networks for Speech and Vision. the conceptual reference point. he terms trajector (an adapta-
New York: Chapman and Hall. Contains many examples of artiicial tion of the notion of igure) and landmark (an adaptation of the
neural networks applied to speciic areas of speech and language. notion of ground) are used to refer to the speciically linguistic
Pylyshyn, Z. W. 1984. Computation and Cognition. Cambridge, MA: MIT manifestation of the perceptual notions of igure and ground,
Press. A detailed examination of the role of cognitive architecture in such that the book is the trajector and the table is the landmark
cognitive science. in the book under the table. Conversely, the table is the trajector
. 1991. he role of cognitive architectures in theories of cognition. and the book is the landmark in the table over the book. More
In Architectures for Intelligence, ed. K. VanLehn, 189223. Hillsdale,
abstractly still, the traditional syntactic contrast between subject
and object is construed in terms of relative salience, such that the
Ryle, G. 1949. he Concept of Mind. London: Hutchinson and Company.
subject is a primary clausal igure, or trajector, and the object is a
Sharkey, N. E. 1992. Connectionist Natural Language Processing.
Dordrecht: Kluwer Academic Publishers. secondary clausal igure, or landmark.
Siegelmann, H. T. 1999. Neural Networks and Analog Computation: Beyond At the heart of CG is the concept of a symbolic unit, consist-
the Turing Limit. Boston: Birkhauser. ing of semantic structure standing in correspondence to a
phonological structure. Consistent with the idea that language
is part of conceptual structure, semantic structure is understood
COGNITIVE GRAMMAR
as conceptualization tailored to the speciications of linguistic
Cognitive Grammar (CG) refers to the theory of language artic- convention (Langacker 1987, 99; see Talmy 2000, 4 for a similar
ulated most comprehensively in Ronald W. Langacker (1987, view of semantic structure). CG takes the notion of symbolic unit
1991), two mutually dependent volumes that are best read (similar to, but not to be equated simply with, the Saussurean
together. Langacker (1988) provides a succinct chapter-length sign) as fundamental and applicable at all levels of representa-
overview of his theory, while Taylor (2002) and Evans and Green tion, including lexical items, grammatical classes, and gram-
(2006, 553640) are highly recommended as student-oriented matical constructions. he lexical item tree, for example, consists
introductions to the theory. CG is wide ranging in its scope and of a semantic unit [tree] and a corresponding phonological unit
provocative in its approach to an understanding of linguistic [tri], which combine to form the symbol for tree, [[tree]/[tri]].
162
Cognitive Grammar
he same apparatus is applicable for deining a word class such are not feasible for every network. he extensive polysemy of
as a noun, abbreviated by Langacker as [[thing]/[]], indicat- head, for example, makes one single superschema covering such
ing a schematic semantic speciication of a thing but without diverse senses as head of a lettuce, head of a bed, head of
any speciic content phonologically. A morphologically more a university department, and so on unlikely. Semantic exten-
complex lexical item such as trees is represented as a composite sion holds between the more basic sense of human head and
structure integrating two symbolic units representing the noun the sense of head of an administrative unit. he node that is
tree and the plural [z]: [[[tree]/[tri]]-[[pl]/[z]]]. Grammatical the source of the extension constitutes a local prototype (with
constructions are in principle no diferent from a lexical item like respect to the extended sense); where one node is experienced
trees in terms of the descriptive apparatus required to capture all as representative of the whole category, as is likely in the case of
the relevant detail, with each of the component structures of a the human head sense of head, we speak of a global prototype.
construction represented by a symbolic unit. Grammatical mor- here is clearly variation among speakers in their judgments
phemes appearing in a construction, such as of, are treated as about nodes and relationships within the network, including
symbolic units in their own right, with semantic structure (of, for their ability to identify relatedness of senses and to extract sche-
example, speciies a partwhole relation). he integration of any matic meanings. his variation poses challenges for description
two symbolic units goes hand in hand with distinguishing the but does not negate the need to acknowledge the reality of such
dependent and autonomous parts of the composite structure. networks.
As far as semantic structure is concerned, [tree] is autonomous, CG adopts a nonreductionist or maximalist stance in its anal-
while [pl] is dependent, requiring an elaboration by a noun to ysis of linguistic structure, contrasting with prevailing reduction-
complete the structure. In terms of phonological structure, [tri] is ist, minimalist approaches in contemporary linguistics. he
pronounceable as a whole syllable and can be considered auton- nonreductionist approach of CG explicitly provides for the list-
omous, while the single consonant [z] is dependent. ing of highly speciic patterns alongside the statement of more
A striking feature of CG is the detail provided for in the inte- general patterns, rather than recognizing only the most general
gration of structures into larger composite structures. he analy- rules and schemas. he existence of a general rule of plural for-
sis of the English passive construction in Langacker ([1991] 2001, mation in English suixing /s/~/z/~/z/ to a noun, for example,
10147) illustrates the theoretical notions relevant to a detailed does not mean that certain instantiations of the rule, such as cats,
grammatical description and is recommended as a prime exam- dogs, horses, and so on, have no place in a grammatical descrip-
ple of a full-blown CG account of a construction type. Briely, and tion. On the contrary, where such instantiations have gained unit
consistent with the foregoing remarks, each morpheme in the status and are activated directly by the speakers, it is appropri-
passive (including by and the auxiliary verbs) has its own sym- ate to recognize them alongside other symbolic units, grammar
bolic representation, giving rise to the overall semantic structure, and lexicon forming a continuum of types of symbolic elements.
just as the active counterpart has its own compositional struc- Even when particular instantiations conform to a general rule,
ture and resulting semantic structure. Passive clauses do not they may acquire unit status in their own right, for example,
derive from active clauses in this view, nor do they derive from through high frequency of use. Acknowledging low-level, highly
some abstract structure underlying actives and passives. Rather, speciic instantiations runs counter to deeply entrenched prac-
passive clauses exist in their own right as instantiations of a con- tices in contemporary linguistics, which has been preoccupied
struction type with its own distinctive way of integrating sym- with higher-level generalizations and the principle of economy
bolic units, relecting a particular construal of the event. in description.
While phonological structure can be fruitfully explored Langacker has repeatedly emphasized the desirability of both
within CG (see Langacker 1987, 32848, 388400; Taylor 2002, general and particular statements in linguistic description, refer-
7895), it is semantic structure that has received most attention ring to the assumption that a phenomenon is to be accounted
and for which most theoretical apparatus has been developed. for in a mutually exclusive way as either a rule or a list as the
Fundamental to semantic structure is the idea of a network that rule/list fallacy (Langacker 1987, 402). Grammar, in CG terms,
is employed to represent polysemy relationships and to provide amounts to a structured inventory of conventional linguistic
motivation for conventional and novel extensions. Each node of units (Langacker 1987, 73). he units, so conceived, may be
the semantic network, together with the associated phonological semantic or phonological; they range from the symbolic units
structure, represents a semantic variant of the lexical item. consisting of a single morpheme to larger composite symbolic
Two types of relationships igure prominently in these net- units at the clause level, and they include highly speciic, as well
works: schematicity, whereby one node of the network expresses as highly schematic, units. his conception of grammar makes
a meaning fully contained in another node (see schema), and CG comparable to construction grammars, which are also
extension, understood as a relationship between semantic nodes inventory-based (cf. Evans and Green 2006, 47583), particularly
of a lexical item involving a conlict in semantic speciications. radical construction grammar (Croft 2001).
he word head, for example, can be assigned a sense [part of a By including quite speciic syntagmatic patterns within a
whole which controls the behavior of the whole] that is grammatical description, CG is able to comfortably accom-
schematic relative to iner-grained elaborations, such as [part modate phenomena that have been largely neglected in lin-
of the human body where thinking is located] and [person guistic theorizing, for example, the collocational patterning of
who manages an administrative unit]. In some cases, a high- great idea, absolutely fabulous, and so on involving combina-
est-level node or superschema can be proposed, encompassing tions of particular words. he greater emphasis on speciic pat-
all lower-level senses in the network, though such superschemas terning makes CG highly compatible with the methodology of
163
Cognitive Linguistics and Language Learning
corpus linguistics and other approaches that focus on lan- language system holds that language is a relection of human
guage in use whereby actual usage, including frequency of occur- cognition and that language can be accounted for by the inter-
rence and patterns of co-occurrence, can be observed and used action of the complex set of cognitive capabilities with which
as a basis for extracting patterns of varying generality (see also humans are endowed. Language is understood to emerge from
the entries treating connectionism). Fully general, exception- contextualized use; that is, it is usage based. he language
less rules are seen as atypical, and while it is valid to seek out system to be learned is an inventory of linguistic constructions
such rules, it would be misguided in this approach to attend only (units with a phonological pole, i.e., the constructions form,
to the most general patterns. and a semantic pole, i.e., the constructions meaning) which
Finally, a word on notation employed in CG. here is an array range in size from lexical items and morphemes to syntactic
of notational devices used by Langacker, who employs a distinc- and even discourse (see discourse analysis [linguistic])
tive and highly original geometric style of representation (in his patterns. he challenge for language acquisitionists working
earlier publications, he used the term Space Grammar to refer within a cognitive linguistic framework is to account for the rapid
to his approach). To some extent, the notation is intuitive: A learning of this vast array of linguistic patterns, drawing solely on
circle is used to denote a [thing] entity; thicker, darker lines general cognitive processes. Over the past 25 years, developmen-
represent the proile, that is, the designated thing or relation in tal psychologists have developed a large body of observational
the semantic structure of a morpheme. A full appreciation of the and experimental evidence that begins to do just that.
notation, however, requires careful study. Of course, not all the Elizabeth Bates famously said, Language is a tool. We use it
detail needs to be represented all the time, and CG ideas can be to do things (1976, 1). Cognitive linguists hold that the primary
efectively incorporated into linguistic analyses simply in prose thing for which we use language is communication. Humans
or with a minimum of notation (as in Taylor 2002). are highly social by nature. We also have the unique ability to
think about entities and events that are not in the immediate envi-
John Newman
ronment. Since we cannot communicate using mental telepathy,
we have to somehow externalize our internal conceptualizations
to make them available to others. Language is the major tool we
Croft, William. 2001. Radical Construction Grammar: Syntactic heory in have developed to accomplish this task. Cognitive linguists hold
Typological Perspective. Oxford: Oxford University Press. that when children learn a language, what they are learning is
Evans, Vyvyan, and Melanie Green. 2006. Cognitive Linguistics: An constructions, of varying sizes and degrees of abstractness, as
Introduction. Edinburgh: Edinburgh University Press.
they engage in using language in context. his learning process
Langacker, Ronald W. 1987. Foundations of Cognitive Grammar. Vol. 1.
takes place over a rather extended period of time, with most of
heoretical Prerequisites. Stanford, CA: Stanford University Press.
the pieces in place by the time the child is six or seven.
. 1988. An overview of cognitive grammar. In Topics in Cognitive
Linguistics, ed. Brygida Rudzka-Ostyn, 348. Amsterdam and According to the developmental psychologist Jean Mandler
Philadelphia: John Benjamins. (2004), children begin forming rudimentary concepts, many
. 1991. Foundations of Cognitive Grammar. Vol. 2. Descriptive of which form the basic semantic frames from which language is
Application. Stanford, CA: Stanford University Press. constructed, in early infancy. Very early on, the infant begins a
. [1991] 2001. Concept, Image, and Symbol: he Cognitive Basis of cognitive process of reformatting raw perceptional information
Grammar. Berlin and New York: Mouton de Gruyter. he chapters in into simple, schematized spatial representations that express
this volume cover key areas of grammar (grammatical valence, case, fundamental experiences, such as self-motion, caused motion,
passive, etc.) and can be read more or less independently of one and containment (see schema ). Experimental evidence shows
another a good balance between CG theory and application to data.
that by three months, the infant can distinguish typical human
Newman, John. 2004. he quiet revolution: Ron Langackers fall quar-
motion from mechanical motion. he infant learns that certain
ter 1977 lectures. In Imagery in Language: Festschrift in Honour
entities start up under their own power while others do not. he
of Professor Ronald W. Langacker, ed. Barbara Lewandowska-
Tomaszczyk and Alina Kwiatkowska, 4360. his chapter gives a irst- same entities that generate self-motion can also cause other
hand account of an early presentation of the material that eventually entities to move or change; the entities that do not generate
became Langacker (1987). self-motion tend to be acted upon, and so on. he constellation
Talmy, Leonard. 2000. Toward a Cognitive Semantics. Vol. 1. Concept of these perceptually grounded generalizations form the basis
Structuring Systems. Cambridge, MA: MIT Press. for fundamental concepts, such as animacy, inanimacy, and
Taylor, John. 2002. Cognitive Grammar. Oxford: Oxford University Press. caused motion. Such categories, in turn, represent the seman-
tic frames for basic syntactic patterns, such as intransitive and
transitive constructions, and participant roles, such as agent
COGNITIVE LINGUISTICS AND LANGUAGE LEARNING
and patient (see thematic roles ). A wide range of infant
A fundamental challenge for any theory of language is to provide studies provides rather clear evidence that infants are actively
a convincing account of how the prelinguistic child becomes a observing and exploring their world, forming generalizations
competent member of his or her linguistic community. To be across events and entities, and in the process developing con-
convincing, the developmental account should be consistent cepts and rudimentary syntactic-semantic frames that are the
with the general model of language posited for the adult sys- foundation of language. hese concepts are largely in place by
tem. Michael Tomasello (2003, 45) calls this the how do we get nine months.
there (to the adult language system) from here (the pre-linguistic Other researchers provide evidence that prelinguistic chil-
infant) problem. A cognitive linguistic approach to the adult dren generalize over units of spoken language and ind linguistic
164
Cognitive Linguistics and Language Learning
patterns. For instance, several studies show that children as reading skills are general cognitive skills that are fundamental to
young as eight months are sensitive to repeated patterns of syl- a number of nonlinguistic human activities, such as tool use and
lables; this particular pattern inding forms the basis for rec- play.
ognizing words in the auditory stream (e.g., Safran, Aslan, Young children tend to be conservative in their language
and Newport 1996, 19268; see word recognition, audi- use, apparently learning language item by item, as the items
tory and speech perception in infants). In general, this are used in context. For instance, if they have heard a verb
approach to language learning argues that language is extracted used transitively, they are unlikely to use it in an intransitive
from the patterns of usage events experienced by the child; the construction until they hear it used intransitively. Eventually,
system is derived from and grounded in contextualized utter- children begin to form generalizations or more abstract repre-
ances. For instance, the evidence suggests that the childrens sentations of the patterns. It is only when the childs syntactic
early word forms are shaped by the salience of particular types of constructions become more abstract that creative language
words in the adults speech. English children irst produce rela- begins to emerge, sometime between two and three years.
tional words such as more, up, of, which seem to have particu- Tomasello argues that the creative use of language represents
lar salience in adult speech directed to children, and later fuller the child putting together utterances out of already well-en-
verb forms, such as take of. In contrast, Korean childrens irst trenched pieces of language.
verb forms are full forms that relect the salient forms in adult Adele Goldberg (2006) speciically argues that syntactic pat-
speech. (Choi and Gopnik 1996). In line with the commitment terns are best understood as meaningful constructions (see con-
to cognitive generalization, pattern inding is not limited to struction grammars) that relect recurrent, humanly salient
language; it is essential to all types of category formation (see scenes, such as an agent engaging in an activity that results in
categorization). force being applied to another entity (transitive) or the agent
Tomasello argues that intention-reading skills are also neces- causing someone to receive something (cause to receive). To
sary to account for language learning. At around 9 to 12 months, the set of pattern inding and intention-reading skills, Goldberg
the young child begins to engage in a number of activities in adds key frequency-based constraints to account for the way in
which he or she actively attends to and participates in commu- which the child learns to limit his or her use of abstract construc-
nicative interaction. Around this age, children engage in joint tions, using language creatively but in a way that is attuned to the
attentional frames in which the child coordinates and shares conventional restrictions of the ambient discourse community.
attention with another participant around a third entity, for Two of the most important constraining elements identiied
instance, when the infant and parent attend to the same object by Goldberg are skewed input and preemption. Corpus studies
or when the infant follows the eye gaze or gesture of an adult show that in the speech directed at children, a single verb, or
in order to attend to a distal object or event. hese are activities small set of verbs, tends to be disproportionally used with partic-
that create common shared ground for communication that ular syntactic constructions. For instance, in the cause-to-receive
involves intentional communication about something outside construction, for example, Ellie _____ Jerry the teddy bear, doz-
the dyad. he communicative events that take place within joint ens of diferent verbs occur, but the verb give accounts for over
attentional frames have the quality of focusing on a goal-directed 40 percent of the instances. Goldberg points out that the seman-
activity in which both the child and the adult are participating. tics of give closely match those of the cause-to-receive construc-
Within the context of the joint attentional frame, the infant can tion; thus, there is a reinforcing match between the semantics of
begin to understand the adults use of pieces of language in coor- the syntactic pattern and the central verb that occurs with the
dination with communicative intent. his is the grounding construction. She argues that such skewed input is a key aid in
for the young children to recognize that those around them are helping children both learn the semantics of the syntactic con-
intentional agents, like themselves, and further, that language is struction and the conventionally appropriate matching between
used intentionally to manipulate the attention, mental state, or verbs and constructions.
even actions of the other person. A second important constraint provided by the input is
At this age, children also begin using verbal cues in order to preemption. his is the notion that if two forms seem equally
perform intentional actions; for instance, for varying purposes, appropriate for a particular context, but one consistently occurs
the child begins to use linguistic symbols to direct the adults in the input while the other does not, the child learns that the
attention to something outside the immediately shared frame. form that occurs in the input preempts the other, seemingly
In order to do this successfully, the child must engage in what appropriate, form. As a simple example, at some point the child
Tomasello terms role-reversal imitation. It is not enough to sim- forms the generalization that -ed is used to represent an action
ply repeat the adults language; the child must learn to use a sym- or event that took place in the past and creates the form goed.
bol toward the adult in the same intentional way the adult uses it However, in the input, where the child expects to hear goed, he
towards him or her. Tomasello argues that it is not a coincidence or she consistently hears went. After a short period of overgener-
that shortly after young children begin to engage in joint atten- alization (see overregularizations), the child will learn that
tion sharing, they also begin to produce their irst truly linguistic went preempts goed, and stop producing goed. Goldberg argues
symbols. Although these early utterances are one-word phrases that exactly the same process accounts for children learning
or unanalyzed chucks, such as whats-that (holophrases; see the match between speciic verbs and argument structure, for
holophrastic stage), they have a range of functions, includ- instance, that told occurs in the cause-to-receive construction,
ing imperative, declarative, and interrogative, that are typically as Mommy told Isabela the story, but say does not. N. Ellis (2002)
distinguished by distinct intonation contours. Intention- reviews a plethora of psycholinguistic studies showing that
165
Cognitive Linguistics and Language Learning Cognitive Linguistics, Language Science, and Metatheory
humans are highly sensitive to frequency of linguistic input, thus COGNITIVE LINGUISTICS, LANGUAGE SCIENCE, AND
providing support for Goldbergs fundamental claims regarding METATHEORY
the importance of skewed input and preemption.
Cognitive linguistics is probably best understood not as a theory
Much of the research on child language learning focuses on
but as a theoretical orientation. A theoretical orientation is a
universal human capabilities and universal stages that all chil-
broader category that encompasses a number of particular theo-
dren go through. A theory that argues language is a relection of
ries that share presuppositions, attitudes, interests, methods,
general human cognitive processes would predict many univer-
and so on. generative grammar is a theoretical orientation
sals. As a usage-based theory, cognitive linguistics also focuses
in this sense, as is connectionism. In the past, behaviorism
attention on language-speciic learning and predicts a wide
and structuralism were important theoretical orientations.
range of variation. Melissa Bowerman and Sonja Choi (2001)
Diferent theories within an orientation need not be mutu-
studied the acquisition of spatial language by young English- and
ally compatible. Moreover, theories in diferent orientations
Korean-speaking children, an area in which one might expect to
need not be mutually exclusive, and are certainly not mutually
ind general human perception relected rather uniformly across
exclusive on all points. However, orientations often operate as
languages. Relatively speaking, however, English has a rather
identity categories, ways of deining ailiations, dividing in-
general system for expressing the notion of containment with
groups from out-groups. his has a number of intellectual and
the preposition in and that of support with the preposition on.
practical consequences, including our tendency to exaggerate
In contrast, Korean makes a number of iner distinctions in these
diferences between in-groups and out-groups, formulating the
categories with separate verbs of containment that express tight
views of both in extreme ways. For example, Noam Chomskys
it versus loose it and verbs of support expressing more informa-
view of the autonomy of syntax is complex. However, both
tion about the supporting surface, such as horizontal surface and
Chomskyans and anti-Chomskyans may absolutize the distinc-
juxtaposing surfaces. Bowerman and Choi found that despite the
tion between syntax and semantics, setting aside the nuances
seeming diferences in complexity, both systems were learned at
of Chomskys own formulations (see autonomy of syntax).
about the same age. Korean children were quite sensitive to the
he dichotomizing of in-group/out-group diferences is a
ine-grained spatial distinctions lexicalized in their language.
matter for social psychology. However, the tendency it repre-
Such indings raise issues about how language might inlu-
sents is not unique to group relations. here are analogues for
ence the speakers mental representation of phenomena in
this sharpening of diferences at virtually every level of human
the world. Dan Slobin (1985) describes language-directed atten-
cognition, extending down to the neuronal level in perception.
tion as thinking for speaking. He argues that the language makes
I mention this because the continuity of cognitive functions
salient, and focuses the speakers attention on, diferent aspects
is arguably the fundamental principle of cognitive linguistics.
of a scene in order to encode it in language. However, the claim
Moreover, that cognitive continuity is embodied (see embodi-
is not that language somehow causes people to experience spatial
ment), thus ultimately founded on bodily experience. Cognitive
scenes or activities diferently (strong Sapir-Whorf hypothesis),
linguists are, of course, concerned with the ways in which this
but rather that speakers of diferent languages have the capacity
embodied continuity bears on language. Additionally, such a
to categorize objectively similar experiences in diferent ways.
continuity helps us to understand the development of theoretical
Andrea Tyler orientations. In this way, cognitive linguistic ideas are signiicant
for language science at two levels. First, they are important at the
WORKS CITED AND SUGGESTIONS FOR FURTHER READING level of guiding a set of research programs in language study.
Second, they have metatheoretical value in suggesting ways
Bates, Elizabeth. 1976. Language and Context: he Acquisition of
we might think about the relations among diferent theoretical
Pragmatics. New York: Academic Press.
orientations.
Bowerman, Melissa, and Sonja Choi. 2001. Shaping meanings for lan-
In this entry, I do not discuss speciic cognitive linguistic
guage: Universal and language-speciic in the acquisition of spa-
tial semantic categories. In Language Acquisition and Conceptual theories, which are treated in other entries theories such as
Development, ed. Melissa Bowerman and S. Levinson, 15891. cognitive grammar, frame semantics, conceptual met-
Cambridge: Cambridge University Press. aphor, and conceptual blending (see also construction
Choi, Sonja, and A. Gopnik. 1996. Early acquisition of verbs in English: A grammars, exemplar theory, functional linguistics,
cross-linguistic study. Journal of Child Language 22: 497530. usage-based theory, and cognitive linguistics and
Ellis, N. 2002. Frequency efects in language processing: A review with language acquisition, as well as the more speciic entries
implications for theories of implicit and explicit language acquisition. on basic level concepts, blended space, conduit met-
Studies in Second Language Acquisition 24.2: 14388. aphor, framing effects, generic- and specific-level
Goldberg, Adele. 2006. Constructions at Work: he Nature of Generalization
metaphors, image schema, mental space, metonymy,
in Language. Oxford: Oxford University Press.
parable, and projection [blending theory]; in addition to
Mandler, Jean. 2004. he Foundations of Mind: Origins of Conceptual
hought. Oxford: Oxford University Press.
overviews of key topics, these entries provide key bibliographical
Safran, E., R. Aslan, and E. Newport, E. 1996. Statistical learning by items for further reading in cognitive linguistics). Rather, I con-
8-month old infants. Science 274: 19268. sider three general characteristics of the cognitive linguistic ori-
Slobin, Dan. 1985. he Crosslinguistic Study of Language Acquisition. entation, characteristics drawn from Croft and Cruse (2004). In
Hillsdale, NJ: Lawrence Erlbaum Associates. discussing these ideas, I simultaneously consider what they reveal
Tomasello, Michael. 2003. Constructing a Language: A Usage-Based cognitively about the pretheoretical attitudes that ground cogni-
heory of Language Acquisition. Cambridge: Harvard University Press. tive linguistics, and how these might difer from the pretheoretical
166
Cognitive Linguistics, Language Science, and Metatheory
attitudes that ground other approaches to language science, par- conceptual metaphor theory is a paradigmatic case of cognitive
ticularly generative grammar. linguistics, it is diicult not to read this statement as manifest-
On the irst page of their important introduction to cognitive ing the canonical form of a metaphor, with the structure Target is
linguistics, William Croft and D. Alan Cruse (2004) present the source (e.g., Juliet is the sun; see source and target). hus
following three major hypotheses: First, language is not an conceptualization more generally, semantics provides the
autonomous cognitive faculty; second, grammar is conceptu- mental model for understanding syntax. his yields an orient-
alization; and third, knowledge of language emerges from lan- ing principle for language study. Faced with a grammatical phe-
guage use. Despite their phrasing, it does not seem quite right to nomenon, this tells us, look for an explanation in the semantic
refer to these as hypotheses. Like the fundamental ideas in any function of the grammatical phenomenon.
theoretical orientation, these are more like guiding principles, It is worth contrasting, and comparing, generative grammar
assumptions for research programs that tie together the difer- on this score. In both cognitive linguistics and in generative
ent theorists with their diferent theories. Cognitive linguists do grammar there is, in efect, a privileged level of language study.
not set out to falsify or even corroborate, say, the nonautonomy While the privileged level in cognitive linguistics is semantics,
of grammar. Rather, they put forth speciic hypotheses regard- that in generative grammar is syntax. he privileging is some-
ing how grammatical patterns can be explained by reference to what diferent in the two cases. In generative grammar, syntax
general cognitive structures, processes, and contents. (I should does not provide an explanation for semantics. However, in each
emphasize that this is in no way a criticism of cognitive linguists. case, language is seen as irst and most importantly deined by
Everyone does this in all theoretical orientations. hat is part of one level (semantics or syntax). his has consequences for the
having a theoretical orientation and, therefore, is not something ways in which all theories in the orientation are formulated.
that merits blame, or praise.) I would like to consider each of For example, literary theories inluenced by generative gram-
these orienting principles in turn. mar took syntax as a model (see generative poetics), while
he irst principle is a reformulation of the fundamental idea literary theories inluenced by cognitive linguistics often take
of cognitive linguistics that language (structure, production, semantics as a model (see cognitive poetics). Perhaps more
reception, acquisition, and so forth) is continuous with other importantly, this privileging of a single level of language has con-
aspects of human cognition. However, Croft and Cruse put the sequences for empirical study and the evaluation of theories and
statement negatively. Along one axis, the opposite of continu- evidence. Every theory of any value has things that it can explain
ity would be discontinuity, a separation between language and and things that it cannot explain. Part of a theoretical orientation
the rest of cognition. his is the position associated with genera- involves distinguishing central cases, the things that really need
tive grammar. So, in framing their statement negatively, Croft explaining, from peripheral cases, the things that it is less crucial
and Cruse are making clear just where the identity division falls to explain. he privileging of one level of language contributes
here between those who see linguistic cognition as continu- to that division. Cognitive linguists are likely to ind it scandal-
ous with other forms of cognition and those who make language ous that generative grammarians do not have any cogent way
a separate faculty. I say that this is an opposition along one of explaining the complexities of metaphor. Generative gram-
axis because, along another axis, this is not an opposition. For marians are likely to ind objections on these grounds to be triv-
example, part of the identity deinition of generative grammar ial. However, they are likely to ind it scandalous that cognitive
involved demarcating its mentalistic view of language subse- linguists explain certain complex syntactic patterns in a some-
quently shared with cognitive linguistics from the nonmental- what loose, nonalgorithmic way. Cognitive linguists are likely to
istic view of behaviorism. respond to the generative arguments with the same indiference
he second orienting principle of cognitive linguistics is that that generativists show toward metaphor, seeing those intrica-
grammar is conceptualization. his extends the continuity cies of grammar as contrived artifacts of the generative method.
assumption, but it begins to structure that continuity as well. In each case, some concerns deemed central by one orientation
If cognition is continuous in its various operations, there are are deemed marginal by the other.
several ways in which particular cognitive functions might be he privileging of one level of language is related to something
organized to perform tasks in diferent domains. For example, else ones determination of what a paradigmatic case of lan-
it seems possible that syntax and semantics would be separate, guage is. As usual, the point is bound up with general cognition,
even if neither is autonomous with respect to other cognitive in fact general semantic cognition. When one thinks of a category
processes. Syntax might follow some sort of sequential ordering (see concepts and categorization), certain sorts of things
process also found in bodily movement, while semantics might come to mind a prototype or standard case, perhaps salient
follow some other set of principles found in inferential thought. instances; depending on the category, some mode of actional
However, cognitive linguists have tended to see diferent compo- engagement may arise as well (e.g., a bodily orientation). Take a
nents of language itself as continuous with one another. category such as minority group. Each of us has an idea of what
But this, too, leaves open several options. Just how are we to a prototypical minority is, and each of us has some exemplars
understand the continuity of language? Are there neutral prin- or instances. While there are certainly similarities across our
ciples that apply equally to syntax and semantics, for example? prototypes and exemplars, there are diferences as well. hese
Here, cognitive linguists have tended to opt for a stratiied diferences will afect how we respond to such ideas as minority
view of cognitive operation in language. Semantics is primary; rights. he same point holds for a category such as sports, though
syntax is secondary. It is worth considering the form of Croft and in this case, particular sports may partially arouse action tenden-
Cruses statement grammar is conceptualization. Given that cies as well (e.g., think of batting in baseball; when doing so, at
167
Cognitive Linguistics, Language Science, and Metatheory
least many people imaginatively orient toward a batting stance). others, it is related to usage-based theory which leads us to the
his is the sort of anticipatory actional engagement that is well third of Croft and Cruses criteria, that knowledge of language
known in emotion theory (see, for example, Frijda 1986, 69 f.). It emerges from language use.
is bound up with research linking meaning with action (see, for he irst thing to note about this orienting principle is that,
example, Pulvermller 2002, 5662), research often stressed by as phrased, it refers, most basically, to language acquisition.
cognitive linguists treating embodiment. As such, it is opposed (once again) to generative grammar.
Again, all these points apply directly to language study for Generative grammarians often see language as growing out of
language, too, is a semantic category. Our prototypes and exem- an innate language faculty (see innateness and innatism).
plars of language certainly have things in common, enough in Alternatively, they see a language acquisition device as
common that we identify roughly the same sorts of things as taking in very fragmentary, inadequate data and using that data
language. But they difer also. Indeed, it seems that they difer to set parameters for principles that are already given geneti-
systematically. Speciically, our pretheoretical ideas about lan- cally (see principles and parameters theory). One might
guage seem to cluster in ways that are roughly coordinated with say that, in this view, knowledge of language is largely given
theoretical orientations. For some people, language is irst of all innately and, thus, language use arises from prior knowledge.
a matter of words; for others, sentences; for others, larger dis- Cognitive linguists reverse this explanatory sequence as one
courses. For some people, actional engagement with language would expect, given the apparent diference in pretheoretical
prototypically involves looking up words in a dictionary or learn- prototypes.
ing vocabulary; for others, it may involve language instruction, Croft and Cruses statement also alludes to Chomskys divi-
language therapy, or computer programming; for others, it may sion between competence and performance. Competence is
be a matter of interior monologue or personal writing; for others, the inner grammar that one has developed (or grown) in learn-
animated conversation. If language brings to mind complexes ing a language. Performance is any act of speaking, writing, or
of related and opposed words, one will be inclined to view lan- signing. In generative grammar, competence explains perfor-
guage diferently than if it brings to mind sentences, or if it brings mance or partially does, since performance is also afected
to mind dialogues (see, for example, dialogism and hetero- by many nonlinguistic factors. (hat is why generative gram-
glossia) or chains of reasoning or poetry. marians often wish to abstract away from persons in treating
Prototypes, exemplars, and actional orientations are, of language.) his leads us away from narrow concerns with acqui-
course, somewhat diferent among ordinary folk, on the one sition. Speciically, Croft and Cruses statement suggests that, in
hand, and professional language scientists, on the other. But Chomskys terms, performance does not arise from competence
the general pattern is the same. Indeed, one might argue that (plus other things), but the reverse. Competence or knowledge
the situation with language scientists is more extreme. On the of language, they suggest, is a sort of artifact of the actional prac-
one hand, they have greater exposure to a wider range of under- tice of language use once more, just what one would expect,
standings of language. his should, to some extent, loosen the given diferent pretheoretical prototypes and associated theo-
constraints imposed by their pretheoretical attitudes. On the retical orientations. his has consequences for acquisition as
other hand, their specialized engagement with particular aspects well, for it suggests that the child is not a passive recipient of
of language (not to mention their emotional and career invest- language but is actively engaged in language practice while
ment in the success of certain theoretical orientations) tends learning.
to entrench those attitudes more irmly. hus, structuralism Finally, it is worth remarking on the use of metaphors here.
and related developments, such as deconstruction, tend to Chomsky sees speech as performance. I do not at all believe
begin with a pretheoretical view of language as primarily a mat- that we are determined by the metaphorical associations of our
ter of words, speciied professionally in terms of phonemes and speech. However, certain word choices may relect certain prior
morphemes. In contrast, generative grammar began with a pre- attitudes, and they may serve to prime other, related ideas (see
theoretical view of language as primarily a matter of sentences. priming, semantic), making it easier for us to choose those
his was entrenched through research focusing on syntax. primed options (other things being equal). In relation to drama,
Cognitive linguists actually seem to have somewhat diferent ilm, or music, performance may suggest a secondary activity,
prototypes for language in terms of language units, contributing a more or less efective, successful, or accurate instantiation of
to their division into slightly diferent theoretical groups. (One some prior, correct language the play text, the screenplay, or
could undoubtedly make related points about structuralism and the musical score. For example, an actor may lub his lines and
generative grammar.) hey do focus on meaning. But meaning a director may cut scenes. If I want to study Shakespeares King
occurs in diferent bundles words, sentences, larger discourses. Lear, I do not conine myself to a particular ilm or stage produc-
Some cognitive linguists seem to focus on words, others on sen- tion, a performance. Rather, I look at the text. he analogue in
tences, and others on larger discourses. However, in each case, language leads me to study competence, what lies behind perfor-
their prototype involves some person a person thinking the mance, rather than performance itself. In contrast, use suggests
words, writing the sentences, or arguing with someone else in that language is a tool. he prior form is not the real thing (like
the larger discourse. Generative grammarians commonly wish the play text). Rather, the crucial matter is the current action,
to abstract away from persons, seeing persons as introducing what is done with the tools. In relation to language, this suggests
performance errors into language. In contrast, cognitive linguists the importance of focusing on speech practice.
stress the necessary involvement of persons in language. In some he use of metaphor is related to a series of other gen-
versions of cognitive linguistics, this becomes embodiment. In eral semantic/cognitive processes that bear on our theoretical
168
Cognitive Linguistics, Language Science, and Metatheory Cognitive Poetics
orientations. For example, it is related to our organization of one of the most important contributions of cognitive linguistics
semantic space, thus what we consider most similar to human to language science.
language and what we consider most diferent from it. For exam-
Patrick Colm Hogan
ple, we will understand and investigate language in certain ways
if we view it as most similar to mathematics; we will understand
and investigate it in other ways if we view it as closest to gesture
or cooperative labor. he same point holds if we see human lan- Croft, William, and D. Alan Cruse. 2004. Cognitive Linguistics.
guage as most diferent from animal communication, from Cambridge: Cambridge University Press.
human silence, or from machine code. As with other phenom- Feyerabend, Paul. 1975. Against Method: Outline of an Anarchistic heory
of Knowledge. London: Verso.
ena we have been considering, none of this is something that we
Frijda, Nico. 1986. he Emotions. Cambridge: Cambridge University
examine empirically and seek to falsify or corroborate. Rather,
Press.
it is pretheoretical background that inclines us toward one or Lakof, George, and Mark Johnson. 1980. Metaphors We Live By.
another theoretical orientation. Chicago: University of Chicago Press.
Individual readers no doubt have diferent views as to which Pulvermller, Friedemann. 2002. he Neuroscience of Language: On Brain
pretheoretical ideas are right and which are wrong. I certainly Circuits of Words and Serial Order. Cambridge: Cambridge University
do. One thing I hope to have suggested, however, is that such Press.
preferences are primarily a matter of ones own pretheoretical
attitudes. Does it seem obvious that language is really embod-
COGNITIVE POETICS
ied and that abstracting from that embodiment is wrong?
Certainly, speech is embodied and it would be misleading to Cognitive poetics is the study of literary reading that draws on the
ignore that. But it also seems clear that there are many speech principles of cognitive science. In its early phase, the discipline
glitches and that at least some degree of abstraction is neces- drew mainly on cognitive linguistics in focusing on the
sary. What about metaphors or the structure of semantic space? textual cues for literary reading; in its more recent phase, it has
Language is in some ways very much like mathematics, but in drawn more readily from cognitive psychology in order to explore
other ways like shared work. It is in some ways very diferent issues of readerly efects and aesthetics. hroughout its short
from animal communication or machine code, but in other history, however, practitioners in the ield have shown a willing-
ways very similar to them. his is, of course, not to say that there ness and propensity for genuinely multidisciplinary study. Work
are no facts about language, about whether language is best in cognitive poetics is often characterized by an awareness that
understood in one way or another. here are such facts in par- the task of understanding literary reading holistically involves
ticular cases, and there are theories that are better at explaining a serious engagement with several disciplines: Linguistics and
those facts. However, it is to say that the broad principles that psychology are central, but they are also often enriched from lit-
deine our theoretical orientations are almost always partially erary scholarship and critical theory, discourse analysis and
correct and partially incorrect. It is probably productive to pur- social theory, anthropology and historical study, neuroscience
sue one set of orienting principles rigorously in a research pro- and medical research, aesthetics, ethics, and philosophy. Most
gram. However, it is probably not productive to dismiss other people working in cognitive poetics believe that the systematic
theoretical orientations that are pursuing research programs analysis of literary reading is also essential within both linguistics
based on diferent principles (themselves derived from difer- and psychology.
ent pretheoretical attitudes). he ield of cognitive poetics coalesced as an identiiable
Cognitive linguistics is a vibrant research program, or set of movement in the mid-1990s, though it is possible to classify ret-
research programs, in language science, as attested by a range rospectively several areas of work that can be seen as precursors.
of entries in this volume. But it also has something to teach us he term itself was coined by Reuven Tsur in the 1970s (see Tsur
at a metatheoretical level. Speciically, it suggests that our theo- 1992) to refer speciically to his exploration of literary aesthetics
rization should be pursued vigorously within theoretical orien- through neuroscience and cognitive psychology. Since then, the
tations, but that our subsequent evaluation of theories should term has been taken up and broadened in scope to include a wide
be less insular and, perhaps more importantly, less combative. range of research questions, frameworks for analysis, and areas
One of the early discoveries of conceptual metaphor theory was for exploration. Various alternative names for the enterprise have
that we commonly model intellectual dispute on war (see Lakof been used during its brief history, each indicating the slightly dif-
and Johnson 1980, 45). his is not simply a matter of metaphors. ferent emphases of its users. For example, cognitive rhetoric
It is the result of the ways in which we form in-groups and out- has been used in North America to point to the connections with
groups based, in this case, on theoretical orientations (as well as classical rhetoric in uniting form and efect in language study;
personal ainities, attitudes toward the public political stances cognitive stylistics indicates a focus on detailed and rigorous tex-
of leading theorists in each orientation, and other factors). In any tual analysis in the European tradition of stylistics, or literary
event, it is unfortunate. Of course, cognitive linguists are people linguistics; other, more neutral terms, such as cognition and liter-
with brains like anyone else. All of us tend to slide into the model ature, have also been preferred on occasion.
of warfare when engaging in intellectual dispute. But, as a theo- It is apparent, too, that there are cultural contexts underlying
retical orientation, cognitive linguistics helps us to see what we this nomenclature, much of which has to do with the branding
are doing when we enter into combat mode and how intellec- of the new discipline in institutional settings and the intellectual
tually deleterious it is. hat metatheoretical point is potentially marketplace. In the United States, the generative paradigm in
169
Cognitive Poetics
linguistics, on the one hand, and poststructuralist critical theory work. Cognitive poetic analyses thus have beneits for the study
in literary study, on the other, have meant that cognitive poetics of human language processing and the study of mind, as well as
has been ighting for recognition on two fronts. hough the dis- for literary and artistic scholarship.
tinction was important earlier, a West Coast psychological tradi- Within cognitive poetics, several diferent dimensions of
tion of metaphor study and an East Coast tradition of linguistic investigation have emerged. Some of the foremost and earliest
textual analysis have now largely been merged. Similarly else- work centers on the human conceptual and linguistic capac-
where, a continental European empiricist focus and a British and ity for metaphor or conceptual integration (see conceptual
East Asian stylistic emphasis have joined together more recently. blending). When cognitive linguists explored the workings of
If there is any division remaining, it is a tendency for American conceptual metaphors of the sort mentioned earlier, it became
scholars to emphasize macrocognitive concerns and for Euro- apparent that some expressive, poetic, or innovative metaphors
Asian researchers to emphasize the more micrological efects of used in literary settings were causing problems with the basic
stylistic texture, though even this division is fast disappearing as theory. Some creative literary language was more concerned with
cognitive poetics develops globally. (See Lakof and Turner 1989; interesting deviance and unsettling defamiliarization than with
Turner 1991; Stockwell 2002; Gavins and Steen 2002; Semino and resolving unfamiliar concepts in familiar terms. Furthermore,
Culpeper 2002). some artistic metaphors seemed to afect those very same famil-
For the majority of writers in cognitive poetics, the basic iar domains in ways that persisted in the continuing life of the
common principles of their work involve a rejection of the reader, and most perplexingly of all, some literary metaphors
distinctions between text and context, form and mean- seemed to take on a life of their own that went beyond the basic
ing, abstraction and speciication, and literal and metaphori- explanatory mappings of their source domains.
cal expression. Fundamental to this position is, most often, the he cognitive poetic theory of conceptual integration or
cognitive linguistic claim that language use is embodied and blending has developed as a good account of these and other
that mind and body cannot be separated (see embodiment). features. Briely, the theory proposes that a set of inputs are gen-
In other words, the linguistic expressions used in all languages eralized to produce a blended space, a mental representation
are elaborations of basic physical circumstances of the human that the reader uses to develop the emergent logic, texture, and
condition. To give an example from early cognitive linguistics, consequences arising from an engagement with the metaphor.
there are fundamental conceptual metaphors (such as life For example, at a microstylistic level, in heodore Roethkes
is a journey, ideas are objects, theories are buildings, and phrase I have known the inexorable sadness of pencils, three
so on) that tend to igure abstraction and complexity in familiar main input spaces (unstoppable motion, human emotion, and
and concrete terms. Such idealized igurative habits condition the tool of the writer) are blended to produce a richly integrated
our thinking and are manifest in linguistic expressions. hese sense of emotional signiicance that is diicult to express in
conceptual metaphors are maintained and exploited in literary any other way. At a more macrological level, the allegorical and
texts just as in any form of language, and a thorough cognitive analogical signiications of, for example, Margaret Atwoods
poetic analysis is interested in both the conceptual signiicance novel he Handmaids Tale amount to more than the sum of a
of the underlying scheme and the textual pattern through which political manifesto, on the one hand, or a dark fantastic narra-
it is expressed (see Johnson 1987). tive, on the other (see Fauconnier and Turner 2002).
he principle of linguistic embodiment resolves the key issue Another cognitive poetic dimension with a long history is
for a theory of interpretation, which is how a single explana- comprised of the various frameworks for the consideration of
tion can account for the fact that individual readings are possible worlds in literary works. Traditional possible worlds logic
but communal readings are in practice very common. Our basic from pragmatics and the philosophy of language has been
human condition creates igurative linguistic commonalities, augmented as a means of understanding the richness of ictional
and our personal, social, and cultural tracks through these ide- projection. (Indeed, the notion of mental spaces in concep-
alizations create individual, group, and cultural distinctiveness. tual integration theory owes something to the notion of worlds in
he literary scholar coming to cognitive poetics thus has a sys- this sense). Such work has been especially fruitful in dealing with
tematic and principled means of exploring individual expression extended prose iction in which the divergence from reality (the
and sociohistorical patterns in culture. actual possible world) is most striking or thematized: science
Cognitive poetics sees reading as a natural evolutionary iction, fantasy, magical realism, dream visions, allegories, and
process, rather than as an artiice that is distinct from other fairy tales (see Ryan 1991; Ronen 1994; Semino 1997).
human capacities. So, it is particularly interested in the ways Also aiming to integrate text and context in this way is schema
everyday (that is, nonprofessional, nonacademic) readers read, poetics, which draws on the notion of psychological schemas
and it draws connections between the natural creativity of imag- from both Kantian philosophy and artiicial intelligence work.
ination in everyday language (see creativity in language Here, culturally shared knowledge frames provide a rich context
use) and the particular ways in which linguistic creativity is for linguistic input that always underdetermines the afective
manipulated in the literary setting. Meaningfulness is regarded outcome in the reader. In literary research, the notion has been
as a readerly process, rather than a inal classiication, and so used to explain mismatches in reader and character knowledge
cognitive poetics researchers have investigated how meanings and as a means of exploring the notion of literariness itself
are constructed and resolved in the process of literary reading. (see Cook 1994).
he personal experience and social circumstances of the reader Most recently developing out of the worlds tradition is text
are at least as important as the textual organization of the literary world theory, a cognitive poetic approach to discourse processing
170
Cognitive Poetics Coherence, Discourse
that has been fruitfully applied to many texts, including literary Semino, Elena, and Jonathan Culpeper, eds. 2002. Cognitive Stylistics.
ones. Text world theory ofers a means of understanding how only Amsterdam and Philadelphia: Benjamins. A collection of articles in
certain parts of readerly schematic knowledge are activated in a cognitive poetics.
reading of the literary text. It seems particularly useful for track- Stockwell, Peter. 2002. Cognitive Poetics: An Introduction. London and
New York: Routledge. he standard and comprehensive textbook of
ing readerly comprehension and understanding involvement
the ield.
and empathy (see Emmott 1997; Werth 1999; Gavins 2007).
. 2009. Texture: A Cognitive Aesthetics of Reading. Edinburgh and
Other work in cognitive poetics has dealt with detailed mat- New York: Edinburgh University Press and Columbia University Press.
ters of stylistic texture, drawing on cognitive grammar and Tsur, Reuven. 1992. Toward a heory of Cognitive Poetics.
the psycholinguistics of igure and ground (Langacker Amsterdam: North-Holland.
1987, 1991; and see van Peer 1986), on image schemas, and Turner, Mark. 1991. Reading Minds: he Study of English in the Age of
on the psychology of deictic (see deixis) maintenance and shift Cognitive Science. Princeton, NJ: Princeton University Press.
(Duchan, Bruder, and Hewitt 1995). If there is a major criticism . 1996. he Literary Mind: he Origins of hought and Language.
of cognitive poetics in addressing the nature of the connection Oxford and New York: Oxford University Press. An excellent polemic
between cognition and stylistic realization, this work seems and exempliication of cognitive poetics.
to ofer a direct response. Certainly the linguistic realization van Peer, Willie. 1986. Stylistics and Psychology: Investigations of
Foregrounding. New York: Croom Helm.
of conceptual patterns is the most urgent area for new work.
Werth, Paul. 1999. Text Worlds: Representing Conceptual Space in
Currently, the interest of cognitive poetics in readerly empathy,
Discourse. London: Longman.
emotion, and aesthetics is being developed through the notion
of texture (see Stockwell 2009), and connections are being forged
with related ields in political ethics and critical discourse
COHERENCE, DISCOURSE
analysis (see Lakof 2002; OHalloran 2003). As a young and
expanding ield that has not yet established its paradigms, cogni- What is a discourse? What makes a discourse coherent or inco-
tive poetics looks set to continue this initial expansive phase for herent? Investigation into these diicult questions has yielded so
the foreseeable future. many sophisticated proposals that a short, comprehensive sur-
vey is well out of reach.
Peter Stockwell
With regard to the irst question, it is fair to say that there is
widespread disagreement. Some researchers think of discourses
as texts, which raises questions about how texts are to be iden-
Cook, Guy. 1994. Discourse and Literature. Oxford: Oxford University tiied and individuated. Is sameness of spelling necessary and/
Press.
or suicient to textual identity? Could there be signiicant varia-
Duchan, J. F., G. A. Bruder, and L. E. Hewitt, eds. 1995. Deixis in
tions, such as diferences of spelling, among the tokens of a sin-
Narrative: A Cognitive Science Perspective. Hillsdale, NJ: Lawrence
gle text type or discourse? Under what conditions are sentence
Erlbaum.
Emmott, Catherine. 1997. Narrative Comprehension: A Discourse tokens grouped as parts of a single text? Some investigators
Perspective. Oxford: Clarendon Press. question the very choice of texts as the basic unit of analysis: A
Fauconnier, Gilles, and Mark Turner. 2002. he Way We hink. New discourse is not a text (type), they say, but a string of spoken or
York: Basic Books. written sentences in a language. Yet disagreement again resur-
Gavins, Joanna. 2007. Text World heory: An Introduction. Edinburgh faces when we ask how these strings are to be picked out. Some
and New York: Edinburgh University Press and Columbia University authors tend to think of a discourse as an utterance (construed
Press. loosely along Gricean lines as anything that is a candidate for
Gavins, Joanna, and Gerard Steen, eds. 2002. Cognitive Poetics in non-natural meaning and produced with communicative
Practice. London and New York: Routledge. his book is the compan-
intention), whereas others think that speech-acts, or groups
ion volume to Stockwell (2002) and contains good examples of cogni-
thereof, are the relevant discursive units. Yet even this is not suf-
tive poetics.
iciently holistic for some discourse analysts, who want to focus
Johnson, Mark. 1987. he Body in the Mind: he Bodily Basis of Meaning,
Imagination, and Reason. Chicago: University of Chicago Press. on larger sociocultural patterns, such as the discourse of racism
Lakof, George. 2002. Moral Politics. 2d ed. Chicago: University of Chicago (see discourse analysis [foucaultian] and discourse
Press. analysis [linguistic]). Some researchers argue for the pri-
Lakof, George, and Mark Turner. 1989. More han Cool Reason: A Field macy of face-to-face conversational interactions and contend
Guide to Poetic Metaphor. Chicago: University of Chicago Press. that the analysis of discourse coherence should ind its point of
Langacker, Ronald. 1987, 1991. Foundations of Cognitive Grammar. Vols. departure in speciic sequences of immediate communicative
1 and 2. Stanford, CA: Stanford University Press. interaction (Scheglof 2001). Another area of divergence con-
OHalloran, Kieran. 2003 Critical Discourse Analysis and Language cerns the nature and number of the participants or producers of
Cognition. Edinburgh: Edinburgh University Press.
a single discourse. Is a conversation between two or more parties
Ronen, Ruth. 1994. Possible Worlds in Literary heory.
one or several discourses?
Investigators who disagree over the very outlines of the
Ryan, Maire-Laure. 1991. Possible Worlds: Artiicial Intelligence and
Narrative heory. Bloomington and Indianapolis: Indiana University concept of discourse can often still agree that certain kinds of
Press. examples ought to be counted as discourses, and so can mean-
Semino, Elena. 1997. Language and World Creation in Poems and Other ingfully debate logically independent questions about discourse
Texts. London: Longman. coherence. One such question is whether the coherence or
171
Coherence, Discourse
incoherence of a discourse is a matter of degree and, if so, what (8) I am fed up with the telephone always ringing. Yet I do
sort of vagueness or lack of speciication explains this fact. Is get some important calls every now and then. (concession,
it just ignorance of the real coherence conditions that leads us contrast, qualiication)
to judge that a given discourse is more or less coherent or that
he appearance of divergent and at times highly elaborate lists of
another discourse is a borderline case? Is discourse coherence
coherence-constitutive relations has occasioned the generation of
by its very nature a genuinely scalar concept? To what extent are
such lists, as well as theoretical relections over their status (Hovy
judgments of coherence and incoherence relative to the contexts,
and Maier n.d.; Redeker 2000; Kehler 2002; Sanders 1997). Is the
categories, and genres of discourse? One may be tempted to con-
list of coherence-constitutive relations open-ended or inite? Are
clude that the coherence of a poem is something quite diferent
there deeper-level relations to which various other relations may
from that of an argumentative essay, but perhaps the subtending
be reduced? What would count as a successful reduction? Should
semantic relations are similar or even identical.
the list of relations include recordation, or the loosest memorial
he coherence of discourse is not just a matter of logic. he
associations? How much of a real constraint is there?
absence of logical contradictions, or the presence of logical
At least some disagreements about the lists of relations stem
coherence (see coherence, logical), is hardly suicient to
from divergent assumptions about the nature of the relata. Are
establish discursive coherence more generally, as the following
the actual intentions of the utterer a component of coherence, or
example is designed to show:
only those thoughts or moves expressed by the spoken or writ-
(1) he sap dripped. Mike pondered the cogito. ten phrases? Some analysts (e.g., Grosz and Sidner 1986) bring
Battologymeans tiresome repetition. in the speakers plans, while others want to leave them out and
contend that coherence is determined by rhetorical structures
It is logically possible that what the speaker of (1) has said is that are like the crystallized form of diverse speech-acts (Asher
entirely true, but as the speaker lits from topic to topic; the and Lascarides 2003). According to these authors, a discourse
utterance has a kind of incoherence but not the logical kind. is coherent to the extent that anaphoric expressions can be
he example may be used to suggest that coreference or, more resolved and the propositions introduced can be rhetorically
broadly, sameness of subject or topic, is a necessary condition on connected to other propositions in the discourse. he recog-
a discourses coherence. nition of coherence requires the drawing of inferences about
Compare (1) to (2): rhetorical relations on the basis of semantic content; these rhe-
(2) I am fed up with the telephone always ringing. Dont torical relations then serve as the basis for inferences about other
hesitate to call me! aspects of content. his approach is contrasted to one in which
conversational implicature and other implicit, coher-
he speaker has conjoined two sentences that both have some- ence-relevant content are recognized by inferring the speakers
thing to do with the topic of telephoning, yet this two-part dis- intentions directly from conventional linguistic meaning and
course seems incoherent unless the utterance was meant to contextual factors such as salient aspects of the conversational
convey the idea that in spite of the irritation over phone calls in situation. Another issue is whether the coherence of a conversa-
general, an exception is being made with regard to the addressee. tional exchange depends inally on complex relations between
And indeed, an intended, implicit contrast or counterpoint can the participants cooperative activities and intentions. Can a
contribute to discursive coherence. What seems to be missing in participants deliberate silence contribute to the coherence or
(2) is an explicit, metadiscursive marker, such as but, indicative incoherence of the overall discourse? Could the logical incoher-
of the intended link. (An informative survey of work on discourse ence of one participants contribution contribute to the coher-
markers, a growth market in linguistics, is ofered in Schifrin ence of the conversation of which it is a part?
2001.) Questions about the psychological status or reality of some
Investigators have described many other relations held to be of the complex coherence relations iguring in the literature lead
constitutive of discursive coherence or cohesion. For example, to a more general issue pertaining to the explanatory merits of
consider the following coherent minidiscourse schemata, with discursivity as such. Although we may sometimes attend specii-
some of the proposed names for the illustrated coherence- cally to coherence or incoherence as such, especially in scholarly
constitutive link: and argumentative contexts, in many other cases the production
and processing of words and related deeds proceed smoothly in
(3) I am fed up with the telephone always ringing. A call woke
the absence of any such focus. Speciic relations such as paral-
me up last night. (elaboration, illustration, speciication)
lelism, contrast, causeefect, and so on play their part, but not
(4) I am fed up with the telephone always ringing. You are the disjunctive property called coherence that can be theoreti-
tired of always hearing me complain. (parallelism) cally cobbled together. Consequently, the analysis of discourse
(5) I am fed up with the telephone always ringing. People are coherence runs the risk of reifying underconstrained theoretic
always bothering me and asking for something. (ampliica- constructions.
tion, generalization)
Paisley Livingston
(6) I am fed up with the telephone always ringing. Im going
to disconnect my phone. (causation, explication) WORKS CITED AND SUGGESTIONS FOR FURTHER READING
(7) I was fed up with the telephone always ringing. hen I fell Asher, Nicholas, and Alex Lascarides. 2003. Logics of Conversation.
ill. (temporal ordering, narration) Cambridge: Cambridge University Press.
172
Coherence, Logical Colonialism and Language
Clark, Herbert. 1996. Language Use. Cambridge: Cambridge University given a single contradiction, every arbitrarily chosen proposition
Press. follows validly. Yet it is now often denied that this is a good prin-
Cohen, Philip, Jerry Morgan, and Martha Pollack, eds. 1990. Intentions in ciple of reasoning, and some philosophers contend that there are
Communication. Cambridge, MA: MIT Press. paraconsistent yet nonexplosive systems. hat some proposition
Gernsbacher, Morton Ann, and Talmy Givon. 1995. Coherence in
and its negation are part of the same belief set does not imply
Spontaneous Text. Amsterdam and Philadelphia: John Benjamins.
that all other propositions belong to that set, and diferent levels
Grimes, J. 1975. he hread of Discourse. he Hague: Mouton.
Grosz, Barbara J., and Candace L. Sidner. 1986. Attention, intentions, and
of logical coherence can be delineated in semantic representa-
the structure of discourse. Computational Linguistics 12: 175204. tions of inconsistent sets (Jennings and Schotch 1984).
Halliday, M. A. K., and Ruqaiya Hasan. 1976. Cohesion in English. Some philosophers have gone so far as to contend that there
London: Longman. are true contradictions, such as the conclusions yielded by liar
Hobbs, Jerry R. 1985. On the Coherence and Structure of Discourse. and sorites paradoxes. he latter dialetheist stance is contested
Stanford, CA: Center for the Study of Language and Information. by many logicians, however, who have sought to establish that
Hovy, Eduard H., and Elisabeth Maier. N.d. Parsimonious or proli- all paradoxical arguments are invalid or unsound. For example,
gate: How many and which discourse structure relations? Available in an updating of the medieval cassatio account of logical para-
online at: http://www.isi.edu/natural-language/people/hovy/papers/ doxes, Laurence Goldstein (2000) argues that while liar sentences
93discproc.pdf (accessed February 7, 2009).
are meaningful, they lack content in the sense of failing to specify
Kehler, Andrew. 2002. Coherence, Reference, and the heory of Grammar.
truth conditions and, therefore, are neither true nor false.
Mann, William C., and Sandra A. hompson. 1988. Rhetorical struc-
Logical coherence or consistency is not equivalent to logi-
ture theory: Toward a functional theory of text organization. Text cal validity, which is often deined as a basic constraint on the
8.3: 24381. relations between the premises and conclusions of an argu-
Redeker, Gisela. 2000. Coherence and structure in text and discourse. ment: Valid arguments are those in which truth is preserved, in
In Abduction, Belief and Context in Dialogue: Studies in Computational the sense that whenever all of the premises of the argument are
Pragmatics, ed. H. Bunt and W. Black, 23363. Philadelphia: John true, its conclusion is necessarily true. (In classical logic, valid-
Benjamins. ity requires the preservation of falsehood as well; necessarily,
Sanders, Ted. 1997. Semantic and pragmatic sources of coherence: On if the conclusion is false, at least one of the premises is false.)
the categorization of coherence relations in context. Discourse
Attempts to provide a conceptual analysis of the notion of logical
Processes 24: 11947.
consequence include syntactical, model-theoretic, and proof-
Sanders, Ted, and Wilbert Spooren. 1999. Communicative intentions and
theoretic approaches.
coherence relations. In Coherence in Spoken and Written Discourse,
ed. W. Bublitz, U. Lenk, and E. Ventola, 23550. Amsterdam: John Paisley Livingston
Benjamins.
Scheglof, Emanuel A. 2001. Discourse as an interactional achieve-
ment: III: he omnirelevance of action. In Handbook of Discourse
Analysis, ed. Deborah Schifrin, D. Tannen, and H. Hamilton, 22949. Etchemendy, John. 1990. he Concept of Logical Consequence.
Oxford: Blackwell, Cambridge: Harvard University Press.
Schifrin, Deborah. 2001. Discourse markers: Language, meaning, and Goldstein, Laurence. 2000. A uniied solution to some paradoxes.
context. In Handbook of Discourse Analysis, ed. Deborah Schifrin, D. Proceedings of the Aristotlean Society 100: 5374.
Tannen, and H. Hamilton, 5475. Oxford: Blackwell. Jennings, R. E., and P. K. Schotch. 1984. he preservation of coherence.
Studia Logica 53: 89106.
Priest, Graham. 2004. In Contradiction: A Study of the Transconsistent.
COHERENCE, LOGICAL he Hague: Martinus Nijhof.
Priest, Graham, J. C. Beall, and Bradley Armour-Garb, eds. 2004. he Law
Logicians generally employ coherence and consistency as of Non-Contradiction: New Philosophical Essays. Oxford: Clarendon.
synonyms naming the absence of contradictions in a group of Sainsbury, R. M. 1995. Paradoxes. 2d ed. Cambridge: Cambridge
sentences, propositions, or beliefs, where a contradiction is University Press.
the conjunction of a proposition and its negation. In metaphysi-
cal terms, logical incoherence or contradiction is the impossible
COLONIALISM AND LANGUAGE
instantiation of a property and some other, incompatible prop-
erty, as in the circle was square. Epistemically, a contradiction One of the complications of writing on the centrality of language
is an irrational belief in both a proposition and its denial. to imperialism and colonialism is that even if the account were
Logical consistency is not a necessary feature of what people limited to the policies and practices of the European nation-
say, write, or think. Nor is the absence of contradictions a suf- states in the last 500 years, it would still leave out major historical
icient condition on discourse coherence (see coherence, events and processes. For example, it would not cover the efects
discourse), as a collection of logically consistent yet unrelated of Roman imperial linguistic strategy, one of which, at a deep
sentences does not constitute a coherent discourse. In many historical level, was the appearance of a number of the modern
contexts, however, logical consistency is a regulative norm for European vernacular languages that were, in turn, to become
speakers and interpreters. According to classical logic, a set of vehicles of imperial and colonial rule. Nor would it include the
propositions is either coherent or contradictory and trivial (in linguistic impact of earlier empires, for example, the Aztecs
the sense that it entails all propositions or explodes). In classical in Mexico or the Incas in Peru. And it would not address non-
logic, the ex falso quodlibet argument was held to establish that European modern imperialism and colonialism, for example,
173
Colonialism and Language
the consequences of the imposition of Japanese language and kind and manner of living, than the diversity that is betwixt them
culture on its Asian neighbors in the late nineteenth and early to in tongue, language, order and habit, which by the eye deceives
middle twentieth centuries. the multitude, and persuades unto them, that they should be as it
Given the complexity of this larger history or set of histories, were of sundry sorts, or rather of sundry countries. (Statutes 1786,
it would be impossible to provide any sort of sensible rendition 28 H 8. c.xv.)
of it or them in a short entry. It is proposed, therefore, to trace
he corollary to this belief that cultural speciically linguistic
the development of one major form of linguistic colonialism in
diference created division and prevented political and religious
order to demonstrate the general ideology that lay behind the
unity was the idea that a common language would forge com-
practice, and to show how, even in this single example, it worked
mon political allegiance and identity. he logical consequence,
diferently in distinct locations and points in history. hough this
therefore, was that linguistic diference had to be extirpated and
sacriices historical speciicity in one sense, it is intended that
Ireland Anglicized. Edmund Spenser, poet and colonial servant,
the example chosen the uses of English in the British imperial
noted in 1596 that it hath ever been the use of the Conqueror,
and colonial project will demonstrate the particularity and var-
to despise the language of the conquered and to force him by all
iability of the process. Antonio De Nebrija made an important
means to learn his (Spenser [1596] 1633, 47). He argued for the
point when he asserted in his Gramtica Castellana, published
eradication of Gaelic on the supposition that the speech being
in the fateful year 1492, that siempre la lengua fue compaera
Irish, the heart must needs be Irish: for out of the abundance of
del imperio (language was always the companion of empire)
the heart the tongue speaks (ibid., 48).
(de Nebrija [1492] 2006, 13), but it is necessary to pay attention to
Yet if it was the aim of linguistic colonialism in Ireland to
the ways in which this relationship was constituted within difer-
Anglicize the country in order to bring it completely under politi-
ent forms of colonialism.
cal control, then it was a goal that was not achieved until the late
An account of English (later British) colonialism in Ireland
nineteenth century (by which point Irish linguistic nationalism,
might start by noting that when the English irst invaded Ireland
the binary opposite of the colonial policy, had already started to
in 1169, they took their language with them and imposed it on the
inspire the revolutionary movement that overthrew British rule).
native population. But such a narrative would involve an anach-
It has been calculated that in the 1830s, for example, half the
ronistic oversimpliication both in terms of the national identity of
native Irish population spoke Irish, and half of that group spoke
the invaders and the languages that they spoke. It is open to ques-
only Irish. Some 80 years later, just prior to Irish independence,
tion, for example, whether the leaders of the invasion thought of
less than 14 percent of the population spoke any Irish, and no
themselves as English at all (barely a century after the Norman
more than 2 percent were Irish monoglots. How was this linguis-
Conquest, they were more likely to have considered themselves
tic shift brought about? In this speciic case there were a number
Norman or Anglo-Norman), and the languages of their mercenary
of factors: the incorporation of the country by military force into
soldiery included Flemish, Welsh, Anglo-Norman, and, of course,
the imperial political and economic order and the consequent
what passed for English. Indeed, the irst colonial legislation on
introduction of the socially centralizing processes of industrial-
language in Ireland, he Statute of Kilkenny (1366), was notable
ism and urbanization; the massive emigration that followed upon
for two reasons. First, it was directed against the colonists, rather
the widespread poverty among the rural population; the imposi-
than the colonized, and had the aim of preventing the colonizers
tion of an educational system that rejected the native language in
from adopting the native Gaelic language and culture. he indig-
favor of English; the spread of the bureaucratic state into every-
enous Irish were not included in the scope of the law since they
day life; the choice of English as the language of religion by the
could speak their own language if they wanted; the point was to
Irish Catholic Church; and the death of large numbers of Irish
stop the colonizers from going native (a process of cultural assim-
speakers in the Great Famine.
ilation that had been occurring since the irst invasion). Second,
Although a number of these causes were particular to
despite proclaiming that English should be the language of the
Ireland, others repeated in a pattern that occurred across the
colonists, the statutes were in fact written in Norman-French
British Empire though with diferences. Indeed, if there is a
one of the languages of law in England at the time. he point here
key to understanding how and why linguistic colonialism of the
is that although the general outline of the history of linguistic
modern European type operated, it lies in this variable combi-
colonialism in any given case can be traced relatively easily, the
nation of economic, cultural, educational, and religious factors
debates and practices pertaining to speciic historical conjunc-
and their efects upon the lived experience of colonial subjects.
tures are often diicult and complex to understand.
he nature, practices, and functions of colonial language policy
In the sixteenth century, some 400 years after the irst inva-
changed throughout time, were altered to suit the difering pur-
sion of Ireland, the centralizing English state determined upon a
poses of the colonizers, and were adapted when the colonized
policy of linguistic colonialism as part of its attempt to bring the
responded in various ways. he sole aim was, to coin an oxymo-
whole island under crown rule. he legislation that marked the
ron, the ruthlessly pragmatic use of language to achieve, consol-
implementation of the strategy, Henry VIIIs Act for the English
idate, and prolong colonial rule.
Order, Habit and Language (1537), revealed the belief that under-
In this regard, it should also be remembered that the discourse
pinned it. he law ordered that all of the kings subjects conform
deployed around the languages of colonialism also formed part
to English culture, especially language, on the basis that
of the colonial project. For example, Edwin Guest noted in 1838
there is again nothing which doth more contain and keep many that English is rapidly becoming the great medium of civilisation,
of [the kings] subjects of [Ireland], in a certain savage and wild the language and law to the Hindoo, of commerce to the African,
174
Colonialism and Language Color Classisiication
of religion to the scattered islands of the Paciic; its range, he the other hand, Chinua Achebe, another major writer, rejected
observed, is greater than ever was that of the Greek, the Latin, or this position and opted instead to use English, but a new form
the Arabic; and the circle widens daily (1838, 703). And in 1850, of English, linked to its national home but altered to conform to
T. Watts argued in the Transactions of the Philological Society that African realities.
it will be a splendid and novel experiment in modern society, if he range of views on this and related issues and the vehe-
a language becomes so predominant over all others as to reduce mence with which they are expressed testiies to the ongoing
them in comparison to the proportion of provincial dialects. He complexity and signiicance of the debates surrounding the leg-
had one language in mind, of course: [A]t present the prospects acy of linguistic colonialism, many of which are treated as ques-
of the English language are the most splendid the world has ever tions of language policy (see language policy).
seen. It is spreading in each of the quarters of the globe by fash-
Tony Crowley
ion, by emigration, and by conquest (Watts 1850, 214).
he imperial vision that both Guest and Watts articulated in
the mid-nineteenth century was already one that held English
to be a global language transmitted by means of economic and Achebe, Chinua. 1975. Morning Yet on Creation Day. London:
military conquest; by the emigration of English speakers as both Heinemann.
proponents and victims of British power; by cultural inluence Calvet, Louis-Jean. 1998. Language Wars and Linguistic Politics. Oxford:
Oxford University Press.
(not least through education and the fashionability that eco-
Crowley, Tony. 2005. Wars of Words: he Politics of Language in Ireland
nomic success brings with it); and by the imposition of the civi-
15372004. Oxford: Oxford University Press.
lizing inluence of religion. But it is important to note that both de Nebrija, Antonio. [1492] 2006. Gramtica Castellana. Barcelona:
in the nineteenth century and today, the phrase global language Lingua.
is signiicant in its reference to the use of English in contexts Guest, Edwin. 1838. A History of English Rhythms. London: Bell.
across the world, but also highly misleading in its suggestion that Kachru, B. J. 1986. he Alchemy of English: he Spread, Function and
it is the same form of the language used throughout the world. Models of Non-Native Englishes. Oxford: Pergamon.
English, as the vehicle of imperialism and colonialism (primarily Ngg Wa hiongo. 1986. Decolonising the Mind: he Politics of Language
British, more recently American), was and is used in enormously in African Literature. London: James Currey.
wide-ranging situations, but it isnt a world language, either in Pennycook, Alastair. 1998. English and the Discourses of Colonialism.
the sense of a single form reproduced globally or in the sense that London: Routledge.
Spenser, Edmund. [1596] 1633. View of the present state of Ireland. In
it is used by even a majority of human beings. Given the diver-
he Historie of Ireland Collected by hree Learned Authors, ed Sir James
sity of human experience, the complexity of our history, and the
Ware, 1119. Dublin.
nature of human language, it is highly implausible that a par- Statutes at Large Passed in the Parliaments Held in Ireland, he. 1786
ticular language English, Chinese, Arabic, or any other will 1801. 20 vols. Dublin.
become a true global language. Indeed, as has been seen in the Watts, T. 1850. On the probable future position of the English language.
history of colonialism and postcolonialism, what in fact happens Proceedings of the Philological Society 4: 20714.
when it is imposed in diferent places across the world is that the
language itself changes and develops. his process of the emer-
COLOR CLASSISIFICATION
gence of variant forms, sometimes recognized as new languages
in their own right, is often described as the price that imperial Color terms label categories of the hue, saturation, and bright-
languages have to pay for their historical role. ness of light relected from surfaces. Because colors vary from
If the functions of language in imperialism and colonial- one another continuously and independently on these three
ism are historically, spatially, and contextually variable, then dimensions, there is no apparent intrinsic structure to the color
the responses made by those who were subjected to these lan- space that would prevent speakers of diferent languages from
guages also difer accordingly in the colonial and postcolonial cutting up the continuum in diferent ways. In the absence of
periods. To take the example of English again, it is possible to empirical studies, psychologists and anthropologists expected
point to the distinct roles of the language in India both before color classiication to be an example of extreme cultural relativ-
and after national independence. Although English was clearly ism and that the spectrum would be segmented into categories
used under colonialism to produce domination and to exercise by diferent languages in arbitrarily diferent ways (e.g., Brown
power, it is nonetheless the case, as B. J. Kachru has shown, 1965, 31516).
that it was used as a language of Indian nationalism in the inde- B. Berlin and P. Kay (1969) refuted this relativist assumption.
pendence struggle and now functions in complicated ways as a hey asked native speakers of 20 diferent languages to identify
vehicle of control, authority and administrative cohesion not the best examples ( foci) and the boundaries of basic color terms
least in the way in which it can operate as a neutral medium in on a Munsell color chart (a grid of 320 color chips with 40 hues
particular contexts. his is not, however, to say that English is not and 8 levels of brightness, plus a 10-chip gray scale). Although
still perceived by some as a language of oppression in India, as it informants varied enormously in their placement of boundaries
is in other postcolonial locations. In the debate about the proper of color categories, they agreed considerably more on the choices
medium for African literature, for example, the Kenyan writer of the foci of the categories. Berlin and Kay found that there were
Ngg Wa hiongo identiied English as a signiicant cause of only 11 basic color categories in their sample of languages, with
colonial alienation and thus argued for his native Gky as foci in black, white, red, yellow, green, blue, brown, gray, pink,
the language best suited to express his African experience. On orange, and purple. More surprisingly, they found that the color
175
Color Classisiication
W R Y W R Y
G/B/K G B/K
W/R/Y W R/Y W R/Y W R Y W R Y Figure 1. Five trajectories of color term evolution.

G/B/K G/B/K G/B K G/B K G B K
W R K W R K
Y/G/B Y/G B
categories came in a limited number of combinations. If there Two families were opponent processes: a red-green channel
were two categories, their foci were in black and white; if three, (excited by red light and inhibited by green light, or vice versa)
they focused in black, white, and red; if four, black, white, red, and a yellow-blue channel (excited by yellow light and inhibited
and yellow or black, white, red, and green; if ive, black, white, by blue light, or vice versa). he third family was a white-black
red, yellow, and green; if six, blue was added; if seven, brown channel that responds to brightness levels independently of the
was added; and if there were eight or more categories, gray, pink, other two channels. his physiological account helped explain
orange, and purple were added, in no particular order. why there might be universals in the classiication of what
In the roughly 40 years since Berlin and Kays initial descrip- seemed a structureless domain: he universal structure is one
tion of the universals and evolution of color classiication, this imposed by the neurophysiology of color vision.
picture has been enriched and complicated by further research, Unfortunately, subsequent research has revealed that the
but not radically changed. he most thorough revision was neurophysiological opponent process system would actually put
prompted by the results of the World Color Survey (WCS) (Kay, the unique hue points in the wrong places. he true axes of the
Berlin, and Merriield 1991; Kay et al. 1997; Kay and Mai 1999; system are closer to cherry-teal and chartreuse-violet than they
Cook, Kay, and Regier 2005). he WCS investigated color nam- are to red-green and yellow-blue (Jameson and DAndrade 1997).
ing in 110 languages, with roughly 25 native speakers of each Sadly, this leaves the universals in color classiication without a
language interviewed about their color names for each of 330 clear neurophysiological explanation.
color chips, and their choices of the best examples of each basic
James Boster
color term of their language. he WCS represented an enormous
improvement in both the methods and the quantity of data over WORKS CITED AND SUGGESTIONS FOR FURTHER READING
Berlin and Kay (1969): he WCS interviewers questioned many
more speakers of each language, surveyed nearly six times as Berlin, B., and P. Kay. 1969. Basic Color Terms: heir Universality and
Evolution. Berkeley: University of California Press.
many languages, and focused on languages spoken by indig-
Brown, R. 1965. Social Psychology. New York: Free Press.
enous groups in Africa, Papua New Guinea, and Central and
Cook, R. S., P. Kay, and T. Regier. 2005. he world color survey data-
South America, as opposed to predominantly Indo-European base: History and use. In Handbook of Categorization in Cognitive
languages. Science, ed. H. Cohen and C. Lefebvre, 22342. New York: Elsevier.
he revised sequence (Kay et al. 1997; Kay and Mai 1999) DeValois, R. L., I. Abramov, and G. H. Jacobs. 1966. Analysis of the
recognizes ive evolutionary pathways (shown in Figure 1). he response patterns of LGN cells. Journal of the Optical Society of
ive trajectories were interpreted as generated by four princi- America 56: 96677.
ples: partition (lexicons tend to partition items into exhaustive Jameson, K., and R. G. DAndrade. 1997. Its not really red, green, yel-
and mutually exclusive categories); black and white (distinguish low, blue: An inquiry into perceptual color space. In Color Categories
black and white); warm and cool (distinguish warm primaries in hought and Language, ed. C. L. Hardin and L. Mai, 295319.
from the cool primaries); and red (distinguish red from other Cambridge: Cambridge University Press.
Kay, P., B. Berlin, L. Mai, and W. Merriield. 1997. Color naming across
colors).
languages. In Color Categories in hought and Language, ed. C. L.
One open question is where the universals come from. Kay
Hardin and L. Mai, 2156. Cambridge: Cambridge University Press.
and C. McDaniel (1978) argued that the six unique hue points of Kay, P., B. Berlin, and W. Merriield. 1991. Biocultural implications of
white, red, yellow, green, blue, and black are given to us by the systems of color naming. Journal of Linguistic Anthropology 1: 1225.
neurophysiology of color vision. hey based their interpretation Kay, P., and L. Mai. 1999. Color appearance and the emergence and evo-
on R. L. DeValois, I. Abramov, and G. H. Jacobss (1966) research lution of basic color lexicons. American Anthropologist 101: 74360.
on the lateral geniculate nucleii (LGN) in the hypothalamus of Kay, P., and C. McDaniel. 1978. he linguistic signiicance of the mean-
rhesus macaques, which had reported three families of neurons. ings of basic color terms. Language 54: 61046.
176
Communication
COMMUNICATION liberating. Human language was irst proposed by behaviorists

to be analyzable in terms of a inite state model, and this analy-
Explicit models of what communication is are not prominent in
sis of human language was then shown by Chomsky, from some
cognitive science. Research under the explicit banner of com-
elementary considerations about the structure of sentences, to
munication does lourish in sociology. here, the emphasis is
be evidently defective. he conclusion drawn was that the struc-
most often on phatic rather than ideational communication, a
ture of human language was not to be understood on this model,
distinction due to B. Malinowski (1923). he former creates,
but no objection was raised to it as a model of communication.
maintains, or dissolves communities of communicators. he
Language was saved but communication was fed to the behav-
latter communicates ideas. Generally, these two aspects of com-
iorists. he conclusion from Chomskys demolition might just as
munication are pursued independently, however unjustiiably.
well have been that the behaviorist model was a bad model of
he explicit and general models of language structure that dom-
human communication and that better was deserved.
inate linguistics are sometimes tacitly thought of as providing,
his skepticism about addressing communication is widely,
inter alia, models of ideational communication, though the gap
though not universally, shared by cognitive scientists it is not
between structure and function is underestimated.
just an isolated aberration of Chomskys. hose who have consid-
Of course, this avoidance of explicit study might be an ana-
ered communication worth discussion have generally pursued
logue of the lack of discussion of life in biology. Life is the subject
it within the framework of pragmatic theories, such as Paul
matter of biology and, therefore, the word is little heard therein.
Grices, which address communication as an add-on to a logical
Communication is at least a sizable part of what the social sci-
theory of sentence meaning (Sperber and Wilson 1986; Levinson
ences are about and, so the argument goes, it is not surprising
2000). hese authors appreciate the gap between structure and
that the word is little heard. Unfortunately, this analogy does not
function. We communicate more than the literal meanings of
it unproblematically or absolve us very far.
the sentences we utter (e.g., the implicatures we thereby make;
here is, of course, a huge amount of work on communica-
see conversational implicature), but this assumes, rather
tion phenomena. Linguistics is largely about the structure of
than explains, what it is to communicate a sentences meaning.
natural languages, and at least one of their functions is com-
Functional studies of information structure, many descended
munication. psycholinguistics studies the interpretation of
from Halliday (2004), are further oblique contributions to an
natural language discourses by people. Social psychology has
understanding of communication in terms of the tailoring of
much to say about both linguistic and nonlinguistic communi-
linguistic message to audience, but again they eschew a direct
cation, from tone of voice to body language. Sociology provides
account of what communication is.
extensive studies of communication in all sorts of guises, from
Perhaps it is itting that the nearest thing we have to a frontal
microdialogue to mass media. he humanities likewise. he
approach to communication is derived from philosophical and
issue here is not lack of study, or even lack of study in cognitive
logical approaches to language, and speciically to the seman-
science, but lack of theoretical frameworks for conceptualizing
tics of discourse (see discourse analysis [linguistic]). he
what human communication is. One might be happy to have lots
logical tradition can be seen as deining communication as the
of models, better still competing ones, but to have none smacks
achievement of mutual interpretation for discourses. Traditional
of carelessness.
logic had no explicit account of the process of interpreta-
One signiicant event in the recent cognitive history of the
tion, but logic was implicitly about the criteria that had to be
concept of communication was Noam Chomskys demolition
met to achieve mutually shared interpretation between two
of the pretense of behaviorist psychologists and linguists to
participants in an argument or proof. If the parties shared all
analyze language (and communication) in terms of inite state
assumptions, then they should also share the deductive closure
machine models. his computational model is closely related to
of the assumptions and conclusions. So if they difered on con-
C. Shannon and W. Weavers model of communication, one of
clusions (such as P vs. not P) then that must be because there
the few inluential abstract models of communication, in which
was some divergence of assumptions or interpretation of the lan-
a sender issues signals from a inite code book through a channel
guage fragment that appeared in the argument. For example, two
to a receiver who decodes the messages from an identical code
important cases were equivocation and enthymeme.
book. he amount of information transmitted is a function of
In equivocation, a party might draw a conclusion that relied
the probability of the occurrence of these signals. Information is
on slippage in their interpretation. Socrates might be interpreted
measured by the decrease of uncertainty. he less predictable a
as, say, referring to the Greek philosopher on one occurrence
signal, the more we learn from its occurrence. Of course, Shannon
and the Brazilian footballer on a second. Or democracy might, at
and Weaver were not behaviorists. hey assumed that the send-
one point in the argument, admit of a political system in which
ers and receivers minds have general capacities for assimilating
unlimited funds could be used to campaign, while excluding
messages, though any such assimilation lay outside their model.
such systems at another point. Logic from the earliest times dis-
But the behaviorists inite state machine can be construed as a
tinguished content from form, and avoidance of equivocation
particular application of Shannon and Weavers model, and the
was the main constraint on content in the process of interpre-
behaviorists claim was that it was a general theory not just of
tation. No constraint was placed on the content attached to X,
human communication but of human behavior in general. here
other than that it must be the same content that attached to every
was no mind to assimilate the inite code of messages.
occurrence of X in the argument.
Chomskys demolition of the inite state model as a model of
Enthymeme is an inexplicitness of assumption. Since natural
human language was deservedly famous, thoroughgoing, and
argument rarely spells out all its assumptions, enthymeme is a
177
Communication
prevalent cause of misalignment. Of course, there is a ine line If the library is open, she is in the library.
between equivocation and enthymeme. If by dog you mean all
we may use our general knowledge and may withdraw the infer-
members of Canis canis in the current domain, and I mean only
ence we made before, constructing a model that can be summa-
the ones who arent bitches, this could be described as equivoca-
rized as:
tion in the interpretation of a term (here across parties) or enthy-
matic suppression of a premise by one party (i.e., all relevant dogs If she has an essay, and the library is open, she is in the library.
are male). Modern developments of logic have provided formal
theories of this process of reasoning from general knowledge and he speciication of the models that are the objects of communi-
contextual speciicities to mutual interpretations. But in logics cation on this theory is an achievement of a successful discourse.
twentieth-century detour into the foundations of mathematics Each speciication of a situation determines a set of situa-
and the possibilities of knowledge engineering, this model of tions difering from it by permutations of the speciication: She
communication has largely been lost. does/doesnt have an essay and the library is/isnt open, and
Discourses come sentence by sentence, and each sentence Because of closed-world reasoning, there are no other students
has complicated and generally nonmonotonic efects on the or libraries or essays nor indeed much else in the current model
structure of the context in which subsequent sentences will be until we hear about them. It is these models that can be thought
interpreted. Entities (people, objects, states, processes, events) of as Shannon and Weavers code books. Far from getting the
get added to, but also subtracted from, the current model of book down from the shelf at the outset of the discourse, it is only
the discourse. hese efects are functions of both linguistic and at each stage of development of the discourse that we can see
nonlinguistic long-term knowledge, current perceptual circum- what code book has been speciied. But note that this was never
stance, and much else besides. So the meaning contributed by Chomskys complaint about Shannon and Weavers model. he
each sentence to the discourse is a complex function of more complaint was always about the ininity of the code, not about its
than its own or any other sentences structure. he general con- on-line construction.
ception of discourse semantics has been developed with great Such defeasible logics formalize the implicit model of com-
sophistication by H. Kamp and U. Reyle (1993) and their col- munication dominant in psycholinguistics the ield that has
leagues under the banner of discourse representation theory and most concerned itself with the empirical study of the process of
indeed by other approaches to discourse semantics, though still interpretation. he intended models correspond to what in psy-
with very little explicit connection to communication. cholinguistics is known as the gist of the discourse, or what W.
Whereas the classical logic of proof is monotonic in the sense Kintsch (1988), for example, calls the situation model. It is this
that adding more assumptions never removes valid conclusions, gist that was shown to be rapidly extracted from discourses in
defeasible logics of interpretation are nonmonotonic as new classical studies of text comprehension.
assumptions are added, earlier conclusions may be subtracted. Notice that the most plausible kinds of Shannon and Weaver
hese defeasible logics for reasoning to interpretations model signaling within the context of a fully speciied code book have to
the process of discourse interpretation. In a monologue, hear- do with temporal changes of state: She now has an essay to write
ers attempt to construct an interpretation of a speakers utter- and is in the library. She now doesnt have an essay and she is in
ances that makes the statements true, bringing with them all the library (shes inished perhaps). She now has no essay and is
available general knowledge and contextual information. When not in the library (having left?) what one might call the mono-
things go smoothly, this returns a unique minimal model at logue of the surveillance camera. here are kinds of human com-
every stage (van Lambalgen and Hamm 2004; Stenning and van munication that are like this (e.g., the stock ticker perhaps), but
Lambalgen 2007). he reason these defeasible logics can yield one only has to consider such examples to realize what a minor
unique intended models for discourses against a background of part they play. Creating local mutual interpretations is not sig-
large bodies of general knowledge is the extensive deployment of naling within their possibilities.
closed-world reasoning. If there is no evidence for the relevance Here is the beginnings of a general abstract model of human
of facts in the model, then we can conclude that they are not rel- communication based on logical theories of discourse process-
evant and at least not yet there. here are many subtleties about ing. In this model, communication is the construction of mutual
how we close the world in constructing the intended model, but interpretations for discourses. It requires an attendant theory
they are variants on this same idea. Whereas in classical logic of the structure of a language and of the organization of general
we have to search for counterexamples in usually ininite sets of knowledge databases, which might be fully consonant with lin-
logically possible models, closed-world reasoning gets us down guistic theories. But it is not to be confused with such theories.
to single, small, intended models of the discourse at each point Its objects of communication are models of discourses (inter-
in its development. pretations that make them true), not sentences or meanings.
To adapt an example from an experiment designed to invoke he contrast between this and other sentence-based theories of
defeasible reasoning to an interpretation, when we are presented communication can be well illustrated by considering the case of
with the following discourse, soliloquy. We do indeed talk to ourselves, either audibly or not,
and intuitively we talk to ourselves for some of the same reasons
She has an essay. we talk to other people, including to help understand what we
If she has an essay, she is in the library. believe or want, to formulate a course of action, to persuade our-
we duly conclude that she is in the library. But when we then selves to follow resolutions, to understand what someone said to
encounter, as the next sentence, us, or to weigh up pros and cons. Needless to say, this process is
178
Communication Communication, Prelinguistic
extremely important in learning, as is well testiied in the empiri- Kamp, H., and U. Reyle. 1993. From Discourse to Logic: Introduction
cal literature (e.g., Chi et al. 1989). To adapt an old saying, talking to Model heoretic Semantics of Natural Language, Formal Logic
to oneself is the irst sign of sanity, or at least the search for it. and Discourse Representation heory. Part II, Vol. 42, of Studies in
here is little temptation to understand soliloquy in terms Linguistics and Philosophy. Dordrecht, the Netherlands: Kluwer
Academic Publishers.
of the reduction of uncertainty, but we can apply the same logi-
Kintsch, W. 1988. he role of knowledge in discourse comprehension: A
cal model of communication as we use for public discourse. We
construction-integration model. Psychological Review 95: 16382.
would not need to talk to ourselves if our knowledge and belief Levinson, S. C. 2000. Presumptive Meanings. Cambridge, MA: MIT Press.
were a transparent, homogeneous, consistent database of facts Malinowski, B. 1923. he problem of meaning in primitive languages.
and principles driven by unconlicted motivations. We equiv- Supplement to he Meanings of Meanings: A Study of the Inluence of
ocate and suppress our assumptions in internal argument just Language upon hought and the Science of Symbolism, ed. C. K. Ogden
as well as in public, and successful argument with ourselves and I. A. Richards, 451510. London: Routledge and Kegan Paul.
can lead to the same kinds of revision in order to gain coherent Sperber, D., and D. Wilson. 1986. Relevance: Communication and
interpretations. Cognition. Oxford: Blackwell.
Certainly there are diferences between soliloquy and public Stenning, K., and M. van Lambalgen. 2007. Human Reasoning and
dialogue, but there are also enormous overlaps. In pursuing our Cognitive Science. Cambridge, MA: MIT Press.
van Lambalgen, M., and F. Hamm. 2004. he Proper Treatment of Events.
goals, it may often be a pragmatic matter of convenience whether
Oxford and Boston: Blackwell.
we choose to talk to ourselves or a conversational partner. hese
functions of soliloquy are functions shared with dialogical com-
munication, and they are functions that have been neglected in COMMUNICATION, PRELINGUISTIC
our thinking about communication.
Over the last 30 years, the ield of developmental psychology
One could, of course, reject the notion that soliloquy is com-
has devoted considerable research to prelinguistic communi-
munication and deine away these barriers to the consignment
cation, deined, most generally, as the sharing of information
of communication to Shannon and Weaver, but deeper consid-
prior to the onset of language. Because language onset is usu-
erations indicate that to do so is to miss much of what is crucial
ally identiied with the irst spoken words, the prelinguistic
about public communication. One should observe that this
period encompasses roughly the irst 12 to 18 months. Research
model is not so incompatible with Chomskys deeper views as at
reviews have been framed both in terms of age-related changes
might at irst appear. For example, his objections to functional
in infants interest in, and behavior during, social interactions
linguists who would see language shaped only by public com-
(e.g., Reddy 1999) and in terms of milestones for behaviors spe-
munication is that language evolution may have been driven as
ciically related to communication, such as visual regard, turn-
much by the advantages of an internal medium for represen-
taking, and gesture (Dromi 1993). hese two reviews point to a
tation and reasoning as by one for public communication. But
general agreement about the infant behaviors that are relevant to
with a more adequate theory of what communication is, and by
prelinguistic communication (e.g., gesturing and visual regard)
dropping the idea that communication is automatically public,
and about the ways in which those behaviors change during the
this view is entirely consistent with our claim that communica-
irst year of life. he major controversies in this area concern how
tion is about achievement of coherent interpretation, whether by
active a role infants play in structuring early episodes of com-
public utterance or internal soliloquy.
munication and how changes in cognitive functioning relate to
Lastly, the model may help to reconnect the cognitive and
changes in prelinguistic communication.
the afective perspectives on phatic and ideational communica-
Shortly after birth, infants recognize familiar people, and
tion alluded to at the outset. One observation is that the funda-
they can recognize their mothers by voice, face, and even by
mental basis of ideational communication is the achievement of
smell (e.g., DeCasper and Fifer 1980). hese perceptual capac-
mutually aligned interpretations. he process of getting to these
ities set the stage for infants social interactions to play a spe-
is, by deinition, phatic communication it creates community
cial role in prelinguistic communication; infants are interested
through shared interpretation of language. We perhaps for-
in and responsive to their caregivers, who interpret their early
get how disturbing are our rare experiences of complete failure
social behaviors as communicative. Even if infants are not yet
to achieve this happy state and, in so doing, fail to see that our
aware that others have emotions or ideas to share or that they
cognitive theories of ideational communication contain within
themselves might have the same, the fact that the social world
them abstract speciications of just what has to be achieved and
treats them as communicative partners is viewed as a critical fea-
maintained phatically, along with abstract accounts of some of
ture of social-pragmatic theories of language acquisition (e.g.,
the processes by which this might be done.
Tomasello 2006).
Keith Stenning Social smiling emerges at roughly six to eight weeks of age
and helps to mark the beginning of face-to-face, or en face,
WORKS CITED AND SUGGESTIONS FOR FURTHER READING interactions with caregivers, which are characterized by vocal
Chi, M., M. Bassok, M. Lewis, P. Reimann, and R. Glaser. 1989. Self-
turn-taking and by the sharing of afect (see review by Adamson
explanations: How students study and use examples in learning to 2003). Although the role of the infant in holding up the structure
solve problems. Cognitive Science 13: 14582. of these early en face interactions is controversial, it is clear that
Halliday, M. A. K. 2004. An Introduction to Functional Grammar. infants take an even greater role in initiating social interactions
London: Arnold. and maintaining their structure during the middle of the irst
179
Communication, Prelinguistic Communicative Action
year. his three-to-eight-month age range has been character- DeCasper, Anthony, and W. Fifer. 1980. Of human bonding: Newborns
ized as a time when infants become increasingly interested in prefer their mothers voices. Science 12: 30517.
regularity and surprise. Conventional games such as peek-a-boo Dromi, Ester. 1993. he development of prelinguistic communica-
become prominent, and infants begin to take the lead in initiat- tion. In At-Risk Infants: Interventions, Families, and Research, ed. N.
Anastasiow, 1926. Baltimore: Brookes Publishing.
ing these games as well as their turns in them.
Hollich, George J., K. Hirsh-Pasek, and R. M. Golinkof. 2000. Breaking
Infants interest in the attention of others during the last quar-
the language barrier: An emergentist coalition model of the origins
ter of the irst year has been viewed as an important milestone of word learning. Monographs of the Society for Research in Child
in prelinguistic communication. For the irst time, there is joint Development 65.3: 1-137.
attention, which refers to episodes when infants and partners are Piaget, Jean. 1983. Piagets theory. In Handbook of Child Psychology.
both engaged with the same object-in-the-world. Infants readily Vol. 1: History, heory, and Methods, ed. P. Mussen, 10326. New
follow the gaze of their partner and attempt to engage them in York: Wiley.
attending to the object of interest. Infants use expanded means Reddy, Vasudevi. 1999. Prelinguistic communication. In he
to garner the attention of others, including giving objects, show- Development of Language, ed. M. Barrett, 2550. Hove, East Sussex,
ing, and pointing. Gestures, especially pointing, have been the UK: Psychology Press.
subjects of intense study in the prelinguistic period because ges- Tomasello, Michael. 2006. Acquiring linguistic constructions.
In Handbook of Child Psychology. Vol 2: Cognition, Perception,
tures may be used to share ones focus of attention with another
and Language, 6th ed. Ed. D. Kuhn and R. Siegler, 25598. New
or to direct the attention of another (e.g., Bates 1979).
York: Wiley.
Also of great interest in late infancy is how new achieve-
ments in cognition might relate to changes in communication.
According to Jean Piagets theory of cognitive development (e.g., COMMUNICATIVE ACTION
1983), infants begin, in the latter half of their irst year, to under-
Communicative action is a term introduced by Jrgen Habermas
stand that objects exist when out of sight and that objects exist
as part of his attempt to develop a general theory of action for
independent of our actions on them. his achievement, termed
the social sciences. Among social theorists, it is widely believed
object permanence, is theorized to be a critical part of the devel-
that a purely instrumental (or economic) model of rational action
opment of symbolic functioning. he words of language are
is unable to account for the orderliness and stability of human
symbols, that is, arbitrarily spoken or written units that stand for
social interaction (Parsons 1968). Classical sociological theorists,
other objects and events. hus, the achievement of object per-
from Max Weber (1978) to Talcott Parsons (1951), tried to remedy
manence and symbolic functioning are important milestones
this defect by positing some additional category of value-oriented
setting the stage for the onset of formal language.
or norm-governed action that imposed constraints on the range
It is probably important, however, to view the transition
of strategically optimizing behavior. Absent from this analysis,
between prelinguistic communication and formal language as
however, was any precise speciication of the role that language
neither abrupt nor all-or-none. Clearly, there continue to be
played in mediating social interaction. Indeed, in many cases, it
important relations between cognitive development and lan-
was unclear how speech was supposed to it into the theory of
guage after infants speak their irst words; just as clearly, non-
action at all (Cicourel 1973, 21).
symbolic forms of communication continue throughout the life
Habermas took as his point of departure the observation
span (as, for example, in communication via physical actions or
that not only was a purely instrumental model of rational action
emotional expressions).
unable to explain the orderliness of social interaction, but it was
Current research in developmental psychology is focusing on
also unable to supply an adequate pragmatics for a theory of
processes that might be speciic to language learning, as well as
meaning. So instead of looking to values or norms for a speci-
on more general cognitive processes (such as categorization)
ication of the structure of noninstrumental rational action, he
that might be involved in early word learning (see review by
turned to speech-act theory. In particular, he looked to the
Hollich, Hirsh-Pasek, and Golinkof 2000). For example, infants
notion of illocutionary force, as developed by J. L. Austin
ability to ind patterns in auditory stimuli or to group objects
and John Searle (Habermas 1984, I: 293; Austin 1975; Searle 1969).
together if they share similar attributes are general cognitive pro-
His central intuition was that the limitations of Gricean (Grice
cesses that relate to the problem of learning a language (Hollich,
1989), or intentionalist (see communicative intention)
Hirsch-Pasek, and Golinkof 2000; Tomasello 2006). Delays in
these milestones of prelinguistic communication, and in the others
semantics might both reveal the limitations of a strictly instru-
mental approach to understanding the illocutionary dimension
noted here, have been the subject of early intervention programs,
of speech-acts and provide some indication of the structural
and deicits in prelinguistic communication skills have even been
features that a noninstrumental theory of rational action should
linked to speciic developmental disorders, such as autism.
exhibit. Once an account of the rationality of speech acts was
James A. Green and Gwen E. Gustafson developed, his thought was that this could be extended to pro-
vide a more general account of the rationality of linguistically
WORKS CITED AND SUGGESTIONS FOR FURTHER READING mediated interactions. It is the latter category of action that he
Adamson, Lauren B. 2003. he still face: A history of a shared experi- refers to as communicative action.
mental paradigm. Infancy 4: 45173. Although this theory is of primary relevance to social scien-
Bates, Elizabeth. 1979. he emergence of symbols: Cognition and commu- tists, it is also important to the study of language. Because of the
nication in infancy. New York: Academic. constraints imposed by the compositionality requirement,
180
Communicative Action
any plausible approach to the theory of meaning must incor- action. On the contrary, in producing an utterance, Habermas
porate some sort of division of labor between the semantics argues, speakers always associate a validity claim with its con-
and pragmatics (with the former taken to have a compositional tent, essentially extending a warrant to the efect that the rele-
structure, the latter typically not). heorists of language have, vant norms governing its production have been satisied. In the
however, sometimes been naive when it comes to understanding case of assertions, this takes the form of a truth claim (which is
the constraints that the theory of action imposes upon the prag- why, Habermas claims, to assert something is to assert it as true).
matics. For example, it is often simply assumed that individuals He then generalizes this analysis to suggest that imperatives are
are capable of rule-following at the level of social action; yet produced with an associated rightness claim, which warrants
rule-following is, at the level of general action theory, a deeply that the action mandated is in fact the correct one to perform. He
contested if not entirely problematic concept (e.g., see Bicchieri argues also that expressives are produced with an associated sin-
1993). Habermass concept of communicative action is impor- cerity claim, which he analyzes in an analogous manner (Heath
tant for showing not only how action theorists might learn from 1998). Appealing to Michael Dummetts (1993) assertability-
contemporary developments in the study of language but also conditional semantics, he then argues that grasping the condi-
how theorists interested in language might proit from greater tions under which these validity claims are satisied constitutes
attention to the structure of social action. an understanding of the meaning of an utterance. hus speech-
acts, insofar as they are meaningful, are necessarily governed by
Speech Acts a noninstrumental pragmatics.
Standard noncooperative game theory (or rational choice theory;
see games and language), which provides the canonical Communicative Action
modern formulation of the instrumental conception of rational Habermas concludes, on this basis, that instrumental action and
action, is the most widely adopted model of rational action in the speech constitute two elementary forms of action (1998, 118).
social sciences. It is, however, not a candidate for adoption as a he former is oriented toward success in the attainment of some
general theory of rational action because the model explicitly objective; the latter is oriented toward mutual understanding in a
excludes any communication between the parties to an interac- process of communication. Naturally, the term elementary form
tion (Nash 1951) and prohibits any action from having semantic should not be taken to suggest that language is a presocial phe-
content (see Farrell 1993). Furthermore, when these restrictions nomenon. he point is simply to identify two orientations that
are lifted, the standard equilibrium solution concepts are no lon- the agent is able to assume toward his or her environment before
ger valid, and no theorist has yet succeeded in developing cor- considering the implications of introducing a second rational
relates that exhibit the same stability properties (Farrell 1993; agent into the frame of reference. he introduction of a sec-
Heath 1996). In other words, linguistic communication so far ond agent, in Habermass terms, generates social action. Social
does not it into the model of action favored by rational choice action, in this view, is a complex phenomenon constructed out of
theorists. the interaction of the two elementary forms. he most immedi-
It is against this background that Habermass theory of ate consequence of introducing a second agent is that it places
communicative action must be assessed. he central question them both in the position that Parsons referred to as double
is: What properties do speech-acts possess that make them contingency what the irst agent wants to do will depend upon
unsuitable for purely instrumental use? he most obvious what he or she expects the second to do and vice versa (Parsons
answer, in the case of assertions, is that they are subject to a 1951, 1011). hus, agents engaged in interaction are always in a
norm of veridicality (and produced, in the standard run of cases, position where they must coordinate their action-plans, even if
with at least the pretense of satisfying that norm). Among phi- this means simply developing a stable set of expectations against
losophers of language, this norm is commonly regarded not as a which they can each proceed to pursue their private objectives
convention that happens to govern the production of assertions (Habermas 1998, 221).
but as a norm that is internally connected to the meaningful- In Habermass view, this problem of interdependent expecta-
ness of these expressions (see truth conditional seman- tions can be resolved by drawing upon the resources of either
tics). Absent such a norm, not only would no utterance be elementary action type. When instrumental action is assigned
credible, but it is not clear that the language itself would even priority, social action takes the form of strategic action, in the
be learnable. If speakers simply claimed whatever happened to standard game-theoretic sense. In this context, the resources of
be in their interest at the time to claim, the connection between language are used only to supply the content of the intentional
semantic conventions and patterns of use would essentially be states beliefs and preferences that serve as parameters of the
scrambled. strategic optimization problem. However, when the resources
Habermas articulates this idea by claiming that in order to of language are used to resolve the coordination problem, this
produce a meaningful utterance, speakers must adopt what he use generates the form of action that Habermas refers to as
calls the performative stance, whereby they bracket the more communicative action. he diference is that communicative
mundane instrumental objectives that they may be pursuing action draws upon the commitments made, in the form of valid-
and adopt the standard intracommunicative objective of reach- ity claims, in order to limit the range of action alternatives that
ing mutual understanding. his is essentially a cooperative are available (thus, the consensus achieving force of linguistic
undertaking, and so even though it may be pursued as a means processes of reaching understanding the binding and bonding
to securing other, extracommunicative objectives (indeed, this energies of language itself becomes efective for the coordina-
is almost always the care), it is not itself a system of instrumental tion of action [Habermas 1998, 221]).
181
Communicative Action Communicative Intention
It is important to note that communicative action is not the Farrell, Joseph. 1993. Meaning and credibility in cheap-talk games.
same as speech. It is a form of teleological action, in the sense Games and Economic Behavior 5: 51431.
that agents continue to pursue extralinguistic objectives. he Grice, H. P. 1989. Studies in the Ways of Words. Cambridge: Harvard
distinguishing characteristic is that they use language in order University Press.
Habermas, Jrgen. 1984. he heory of Communicative Action. 2 vols.
to solve the problem of double contingency by establishing a
Trans. homas McCarthy. Boston: Beacon Press.
set of shared goals and norms (rather than simply using lan-
. 1996. Between Facts and Norms. Trans. William Rehg. Cambridge,
guage to identify background beliefs and preferences). his use MA: MIT Press.
of language as an explicit coordination mechanism imposes . 1998. On the Pragmatics of Communication. Ed. Maeve Cooke.
constraints on the type of goals that agents can pursue and the Cambridge, MA: MIT Press.
means they can employ. hus, communicative action, despite . 2001. On the Pragmatics of Social Interaction. Trans. Barbara
being teleological in form, is not merely a species of instrumental Fultner. Cambridge, MA: MIT Press.
action. At most, it represents a type of constrained instrumental Heath, Joseph. 1996. Is language a game? Canadian Journal of
action. hus, it is the use of language to coordinate social inter- Philosophy 26: 128.
action, Habermas claims, that provides the explanation for the . 1998. What is a validity claim? Philosophy and Social Criticism
orderliness and stability of human social interaction. he central 24: 2341.
. 2001. Communicative Action and Rational Choice. Cambridge,
error of the classical sociological action theorists, according to
MA: MIT Press.
this view, rested with their focus upon a practical, rather than a
Nash, John. 1951. Noncooperative games. Annals of Mathematics
communicative, conception of rationality (Habermas 1996, 3). 54: 28995.
Parsons, Talcott. 1951. he Social System. New York: Free Press.
Practical Discourse . 1968. he Structure of Social Action. 2 vols. New York: Free Press.
Finally, it is worth mentioning a further distinction in Habermass Parsons, Talcott, and Edward Shils, eds. 1951. Towards a General heory
work, between communicative action and practical discourse. of Action. New York: Harper and Row.
he orderliness of linguistically mediated interactions (i.e., com- Searle, John. 1969. Speech Acts. Cambridge: Cambridge University Press.
municative action) is achieved by the binding-bonding efects Weber, Max. 1978. Economy and Society. 2 vols. Ed. G. Roth and C.
of the validity claims raised within speech-acts. It is the rational Wittich. Berkeley: University of California Press.
acceptance of these claims by listeners that makes it rational, in
turn, for them to accept any constraints on their conduct that
may arise as a consequence. However, this process of acceptance COMMUNICATIVE INTENTION
is usually only tacit and in many cases relies merely upon the Late twentieth-century discussion of the nature of commu-
speakers warrant. his means that should the listener suddenly nicative intention was dominated by the theories of British
experience doubts during the course of the subsequent interac- philosopher Herbert Paul Grice. Grice initially (1957) argued
tion, it is always legitimate for him or her to go back and demand that the primary intended efect of an indicative utterance
further justiication (i.e., request that the speaker redeem some was to get the hearer to believe the proposition expressed; an
validity claim that was associated with the speech-acts). As a essential component of this communicative intention was the
result, the potential for critical scrutiny of social practices is intention to have this efect be achieved through the hearers
always present, in every society and culture, even if not explicitly recognition of that intention. He eventually acknowledged that
institutionalized. Such a demand for justiication interrupts the there were counterexamples to this analysis and subsequently
sequence of communicative action and shifts the participants (1968; 1969, 1712) proposed that the primary communica-
into discourse, where contested validity claims are relexively tive intention must be that the hearer should at least come to
thematized and debated. Contested rightness claims are discur- believe that the utterer has some particular thought or belief.
sively tested in a forum that Habermas refers to as practical dis- Grice also allowed that speakers need not intend to change
course, which is governed by a set of distinctive inference rules, the attitudes of some speciic, actual audience; instead, this
in particular a universalization rule that serves as the foundation part of the communicative intention concerns what is meant
for the theory of discourse ethics. he distinction between prac- to happen should there be an audience having such-and-such
tical discourse and communicative action is important, in this characteristics.
regard, because it is only the former that is directly governed by Setting aside some of the many reinements (1989, 86116),
the universalization constraint. Grices characterization of communicative intention runs as
Joseph Heath follows:
An utterance, U, is made with a communicative intention if and

WORKS CITED AND SUGGESTIONS FOR FURTHER READING only if the utterer, S, utters U with an intention comprised of three
Austin, J. L. 1975. How to Do hings with Words. 2d ed. Cambridge: Harvard subintentions:
University Press.
Bicchieri, Cristina. 1993. Rationality and Coordination. Cambridge: (1) Ss utterance U is to produce a certain response, R, should
Cambridge University Press. there be an audience, A, having characteristics, C;
Cicourel, Aaron. 1973. Cognitive Sociology. Harmondsworth, UK:
Penguin.
(2) A is to recognize Ss intention (1);
Dummett, Michael. 1993. What is a theory of meaning? (II) In he Seas (3) As recognition of Ss intention (1) is to function as at
of Language, 3493. Oxford: Clarendon Press. least part of As reason for having response R.
182
Communicative Intention Comparative Method
It is assumed here that S communicates a belief to some audi- expression is the performance of an observable action as an
ence, A, just in case As recognition of Ss communicative inten- indication of some attitude, where some x indicates some y
tion yields R, where R is the formation of the relevant belief in whenever x provides some (possibly unreliable) evidence that
A. For example, Sally said Congratulations! with communi- y is the case.
cative intent just in case she meant her saying to congratulate
Paisley Livingston
the person to whom she was speaking and meant for this inten-
tion to be recognized by that person; she also had to intend for
that very recognition to be a reason for the recognition of the
congratulation. Bennett, Jonathan. 1976. Linguistic Behavior. Cambridge: Cambridge
Peter F. Strawson (1964) challenged the suiciency of the University Press.
loop or mechanism constituted by subintention (3). Suppose that Davis, Wayne A. 2003. Meaning, Expression, and hought.
Karen thinks that if her tennis racket is lying on the kitchen table,
Grice, Herbert Paul. 1957. Meaning. Philosophical Review 66: 37788.
her friend Laura will think Karen plans to play tennis that day.
. 1968. Intentions and speech acts. Analysis 29: 10912.
Karen knows that Laura is watching her, and she also knows that . 1969. Utterers meaning and intentions. Philosophical Review
Laura does not know that Karen knows Laura is watching. Karen 78: 14777.
then puts the racket on the table with the intention of getting . 1989. Studies in the Way of Words. Cambridge: Harvard University
Laura to believe Karen plans to play tennis. Karen also intends Press.
that Lauras recognition of the latter intention will give Laura Holdcroft, David. 1978. Words and Deeds: Problems in the heory of
reason to believe that Karen in fact means to play tennis. hus, Speech Acts. Oxford: Clarendon.
all three clauses in Grices deinition have been satisied. Yet in Recanati, Franois. 1986. On deining communicative intentions. Mind
such a situation, Strawson argues, Karen has not communicated, and Language 1: 21342.
at least in Grices sense, to Laura that she plans to play tennis. So Schifer, Stephen. 1972. Meaning. Oxford: Clarendon Press.
Strawson, Peter F. 1964. Intention and convention in speech acts.
what is missing? Karen must intend not only that Laura recog-
Philosophical Review 73: 43960.
nize her intention to get Laura to think she plans to play tennis
but also that Laura recognize her intention to get Laura to recog-
nize her intention to get Laura to think so.
COMPARATIVE METHOD
Grice and other philosophers (e.g., Schifer 1972; Holdcroft
1978; Recanati 1986) have explored various responses to the Genetic Relatedness and Common History
problem raised by Strawson. Grices own preferred response to Languages sharing a period of common history in a single ances-
the problem was to allow that meaning requires an ininite set of tral language are genetically related. he comparative methods
intentions. Yet this condition is to be understood as deining the (henceforth CM) goal is to demonstrate genetic relatedness by
optimal state in relation to which actual communicative states identifying similarities attributable to retention from common
are measured. He contends, then, that strictly speaking, no history. Demonstrations of genetic relatedness are presented as
speaker actually means that p in the sense of actually having the reconstructions (or proto- forms) in the hypothetical ancestral
set of ininite intentions required for ideal, non-natural meaning, system. Genetically related languages comprise a language
but he adds that the speaker is in a situation which is such that family, relationships within which are identiied through
it is legitimate, or perhaps even mandatory, for us to deem him female kin terms. A form inherited in a daughter is a relex of the
to satisfy this unfulillable condition (1989, 302). Grices justi- mothers form. Relexes of the same form in diferent languages
ication for this move inds its roots in his views concerning the are cognates.
status of the normative rationality assumptions relied upon in Languages sharing a period of common history independent
the entrenched and self-justifying system of both everyday and of the rest of the family constitute a subgroup. Subgrouping is rep-
philosophical psychology. He evokes the diference between the resented as a family tree. (Wave models are an alternative permit-
titular and factual character of an utterance, where the former is ting representation of overlapping shared innovations. Textbooks
its idealized, rational character, never actually present in toto, like Hock [1986, Chap.15] compare the two.) In determining
and where the latter could be a matter of a pre-rational coun- genetic relatedness, one seeks evidence of shared retention from
terpart of meaning (1989, 856). Yet it seems unsatisfactory a proto-language; in subgrouping, one seeks evidence of shared
to conclude that Karen could never, strictly speaking, actually innovation after the family began to diverge. In subgrouping, one
communicate to Laura that she wants to play tennis! must identify unique events common to the history of the sub-
Wayne A. Davis (2003) develops an alternative approach to group. he features deining the subgroup must demonstrably
the relation between communication and semantic intentions. not be retentions from an earlier period of common history. (On
Not all instances of communication are intentional, and cases subgrouping arguments, see Harrison 2003, 3.2).
of intentionally communicating something to someone are ana- Because a language is not an organism passing on genetic
lyzed as doing something that expresses a mental state, where material but behaviors and underlying knowledge, common
this action is the basis of an audiences recognition that the history is a property not of languages but the constructions con-
mental state is expressed. What is expressed depends on what stituting them (see construction grammars). A borrowing
is intended, but that does not mean that the hearer has to rec- and its source also share common history, and so genetic related-
ognize the speakers communicative intention for intentional ness privileges common history involving transmission between
communication to take place. he intentional component of individuals speaking the same language. For mixed languages
183
Comparative Method
Table 1.
plant paddle needleish thin forehead pandanus fold
Trukese ftuk-i ftun taak mlii-lif chaamw faach n-num
Mokilese poadok padil doak manip-nip soamw -par lim
Gilbertese arok-a arina raku m-manii ramwa ara- num
PMC *faSok- *faSla *[sS]aku *ma-nii(nii) *camwa *faca *lumi
Table 2.
Trukese f t k n ch m mw aa u u i
Mokilese p d k l r m mw oa a oa a i o i i
Gilbertese r k n r m mw a a a a a u u o i i
PMC *f *S *k *l *c *m *mw *a *u *o * *i
(homason 2001, 70 f), speaking the same language is diicult Mokilese, raisings conditioned by the vowel in the following
to deine. Genetic relatedness may require redeinition to incor- PMC syllable, and lengthenings, by PMC syllable structure. Each
porate language mixing. correspondence set, or set of conditioned sets, is reconstructed
as a phoneme of the proto-language (see historical recon-
Deining the Comparative Method struction or textbooks like Campbell 2004, Chap. 5).
he CM emerged from nineteenth-century research largely on he regularity assumption is vacuous unless the sound
Indo-European. It demonstrates genetic relatedness by distin- changes yielding conditioned correspondences are constrained,
guishing cross-linguistic similarities due to retention from those since any correspondence is regular if its conditioning environ-
due to chance, borrowing, or the nature of language. Four (not ment is suiciently narrow (for example, a single morpheme).
necessarily mutually exclusive) approaches to identifying shared he neogrammarians restricted conditioning to purely phonetic
retentions are now considered. environments (see Hale 2003, 343).
Chance similarities like Mokilese padil paddle are iden-
NEGATIVE SIEVING. Similarities due to borrowing or the nature of tiiable because there is no regular correspondence between
language must be identiied. One expects similarity in onomato- Mokilese /p/ and English /p/, as there is between Mokilese /p/,
poetic and schematic constructions, where the form is to a degree Trukese /f/, and Gilbertese . Of course, distinguishing regular
iconic of the meaning. Such natural similarities are eliminable by from chance correspondences is a function of token frequency.
restricting comparison to symbols, whose form/meaning relation Lexical replacement over suicient time or in some contact
is arbitrary. (On restrictions on the CM, see Harrison 2003.) here is situations may reduce the number of tokens exhibiting regular
less consensus regarding what is (not) likely to be borrowed. Some correspondence so that regular correspondence is indistinguish-
have argued for a core lexicon resistant to borrowing. For an assess- able statistically from chance. (Statistical methods are seldom
ment, see homason (2001, 71 f). Although constructions with used in the standard CM. Quantitative methods proposed in
grammatical meaning (like conjunctions or adpositions) are less subgrouping include lexicostatistics and, more recently, compu-
often borrowed than those with lexical meaning (like most nouns tational cladistics see McMahon and McMahon 2005.)
or verbs), Sarah Grey homason and Terrance Kaufman (1988) Borrowings may be identiiable similarly if their number is
have demonstrated that, in principle, anything can be borrowed. small. Large-scale borrowing is often recognizable as a parallel
set of apparently regular correspondences, as in Rotuman. Bruce
REGULAR SOUND CORRESPONDENCE AND THE STANDARD CM. Biggs (1965) was able to associate one set of Rotuman correspon-
Most comparativists identify demonstrations of genetic related- dences with native vocabulary and another with Polynesian bor-
ness through regular sound correspondence as the standard CM. rowings. Latinate borrowings into English might be identiiable
To rule out chance and, to some extent, borrowing as accounts of similarly if we did not already know their history.
similarity, the standard CM exploits the neogrammarian move- he identiiability of borrowings as irregular or parallel cor-
ments regularity assumption that every sound change, inas- respondences is inhibited when:
much as it occurs mechanically, takes place according to laws
that admit no exception (Osthof and Brugmann 1967, 204). (i) the source language cannot be identiied, or:
he Micronesian data and Proto-Micronesian (PMC) recon- (ii) the source is a related language spoken by many target
structions (largely from Bender et al. 2003) in Table 1 exemplify language speakers, who apply their knowledge of source/tar-
regular sound correspondence (where - indicates a morpheme get correspondences to nativize borrowings, or:
boundary) from which emerge the correspondences in Table 2. (iii) later phonological changes mask earlier borrowings.
Multiple correspondences with a single reconstructed pho-
neme relect context-dependent sound changes. In Tables 1 In cases of massive borrowing across many languages, as
and 2, they include the loss of short inal vowels in Trukese and reported in Grace (1996) for New Caledonia, the number of
184
Comparative Method
regular correspondences proliferate to the point that one must chance also qualify as individual identiiers. Nichols (1996, 50)
reconstruct a proto-language phonemic inventory far larger than cites Proto-Indo-European *widhew-a widow. It is crucial that
that of any of its daughters and possibly larger than one would her example is a reconstruction. he existence of that word may
consider natural. In such cases, the standard CM fails. be statistically unattributable to chance, but one must have coni-
It is vital in comparison that there be some measure of simi- dence that its relexes (including Sanskrit vidhav, Greek itheos,
larity to show that we are comparing likes with likes. he CM Latin vidua, Old Irish febd Russian , OldEnglish widuwe)
says little about similarity in meaning. Comparativist practice are suiciently similar to be instances of the same word. Regular
favors meaning identity. Since semantic similarity remains ill- sound correspondences are necessary to give that conidence.
deined, one is guided by experience and common sense. For
phonetic similarity, we can appeal to phonetic theories. What MASS COMPARISON. he logic of mass comparison (as in
is seldom appreciated is that the standard CM does not need a Greenberg 1987) is that by identifying a very large number of
theory of phonetic similarity because the regularity assumption similar constructions in many languages, one statistically rules
is a stand-in. We dont need to know that two sounds are similar, out similarity due to chance or to borrowing. Most comparativ-
only that there is a regular correspondence between them. Much ists do not regard mass comparison as an instance of the CM. he
of the data for modern theories of phonetic similarity undoubt- volume of criticism leveled against it has been vast. (For a short
edly came from regular correspondences identiied by the CM. review, see McMahon and McMahon 2005, 1926.)
he empirical validity of the regularity assumption has been he fundamental objection is the same as that just raised to
controversial from the outset. hose opposed to the neogram- the independence of shared aberrancies. Mass comparison pro-
marian position asserted that each word has its [own] history. vides no measure of similarity. here is no statistical reason to
A current manifestation of this opposition is lexical diffusion consider signiicant the identiication of numerous vague simi-
(Chen and Wang 1975), the view that sound changes move larities in many languages unless attested in the same forms in
through the lexicon, afecting diferent words at diferent times. most of the languages compared. hat does not seem to be true
Words yet unafected will appear to be exceptions to the change. in those cases in which mass comparison has been used.
For example, the shortening of Early Modern English (ENE) /u:/,
as in good (ENE /gu:d/, English /gd/), has yet to afect food, and Summary
has afected roof only in some dialects. Any method for determining genetic relatedness must provide a
Examples of nonphonetic conditioning are less often cited. he similarity measure and a means of distinguishing shared reten-
Micronesian [aa]-[oa]-[a] correspondence set in Table 2 might be tions from other sources of similarity. None is without laws and
such a case. In Trukese and some other Micronesian languages, limitations. he standard CM is unique in deining similarity
lengthening afects only the V1 of PMC (C1)V1(C2)V2 nouns. If, as through regularity. In some cases it will fail, but less often in prin-
other evidence suggests, these nouns are the residue of a process ciple than the search for shared aberrancies, which depends on
afecting all prosodic phrases, phonetic conditioning is preserved. the existence of data of a restricted sort.
Since the regularity assumption is crucial to the CM, it may he quantitative methods remain to be tested, but there is
be problematic were it proven false. hough William Labov reason to doubt that they can replace the standard CM, or even
(1981) argues that the regularity assumption holds for one class supplement it where the latter fails. hese methods have been
of sound changes while others difuse through the lexicon, we applied to the subgrouping problem, not the genetic related-
need not rely on his assessment to save regularity. Sound change ness problem, and the two difer crucially. he subgrouping
might begin variably, but given enough time, it moves toward problem is the search for the best tree for a set of languages
regularity. (See Durie and Ross 1996 for a range of perspectives already assumed to be genetically related. hese mathematical
on the regularity assumption.) techniques can determine genetic relatedness only if they fail to
incorporate unrelated languages into the trees they generate by
SHARED ABERRANCIES. A shared aberrancy is a correspondence failing to identify cognates. And as long as a measure of similarity
between lexically or morphologically related forms so unusual as is required to identify cognacy, something equivalent to the reg-
to be unattributable to chance or borrowing. An oft-cited exam- ularity assumption of the standard CM remains essential.
ple is the 3s/3p alternations in the present tense of to be in Indo-
S. P. Harrison
European:
Sanskrit sti snti
Latin est sunt Bender, Byron W., Ward H. Goodenough, Frederick H. Jackson, Jefrey C.
Marck, Kenneth L. Rehg, and Ho-min Sohn. 2003. Proto-micronesian
Old High German ist sind
reconstructions 1. Oceanic Linguistics 42.1:1110.
Biggs, Bruce. 1965. Direct and indirect inheritance in Rotuman. Lingua
For many comparativists, like Lyle Campbell (2003, 268 f),
14: 383445.
shared aberrancies are an alternative to regular correspondences
Campbell, Lyle. 2003. How to show languages are related: Methods for
in demonstrating genetic relatedness. Others, like Johanna distant genetic relatedness. In Joseph and Janda 2003, 26282.
Nichols (1996), insist that only shared aberrancies or other indi- . 2004. Historical Linguistics: An Introduction. Cambridge, MA: MIT
vidual identiiers demonstrate genetic relatedness. She con- Press.
signs regular correspondence a subsidiary role in subgrouping. Chen, M., and W. S-Y Wang. 1975. Sound change: Actuation and imple-
Similarities in words suiciently long to be unattributable to mentation. Language 51: 25581.
185
Competence
Durie, Mark, and Malcolm Ross, eds. 1996. he Comparative Method rules and principles, unlike Saussures langue, which was essen-
Reviewed: Regularity and Irregularity in Language Change. tially a taxonomic inventory of grammatical elements.
Oxford: Oxford University Press Chomsky has always considered competence a psychological
Grace, G. W. 1996. Regularity of change in what? In Durie and Ross construct, deining it as the speaker-hearers knowledge of his
1996, 15779.
language (1965, 4). Hence, support for the notion tends to be
Greenberg, Joseph H. 1987. Language in the Americas. Stanford,
derived from the apparent disparity between our mental repre-
CA: Stanford University Press.
Hale, Mark. 2003. Neogrammarian sound change. In Joseph and Janda
sentations of grammatical patterning and the actual use of lan-
2003, 34368. guage in communication. So it is frequently pointed out that
Harrison, S. P. 2003. On the limits of the comparative method. In Joseph the structural principles that characterize grammars are far from
and Janda 2003, 21343. being in a one-to-one relation with the principles and conven-
Hock, Hans Heinrich. 1986. Principles of Historical Linguistics. tions governing use (Newmeyer 1998). More direct psychological
Berlin: Mouton de Gruyter. evidence for competence has been adduced from observations
Joseph, B., and R. Janda, eds. 2003. he Handbook of Historical Lingusitics. about child language learning. Experimentation has shown that
Oxford: Blackwell. even very young children exhibit subtle grammatical knowledge
Labov, William. 1981. Resolving the neogrammarian controversy. that points to their possessing a cognitive system encoding strictly
Language 57: 267308.
grammatical facts. For example, one-word speakers between 13
McMahon, April, and Robert McMahon. 2005. Language Classiication by
and 15 months know that words presented in strings are not iso-
Numbers. Oxford: Oxford University Press.
Nichols, Johanna. 1996. he comparative method as heuristic. In Durie
lated units but are part of larger constituents; one-word speakers
and Ross 1996, 3971. between 16 and 19 months recognize the signiicance of word
Osthof, Hermann, and Karl Brugmann. 1967. Preface to morphological order in the sentences that they hear; and 28-month-old chil-
investigations in the sphere of the Indo-European languages I. In A dren who have productive vocabularies of approximately 315
Reader in Nineteenth Century Historical Indo-European Linguistics, ed. words and who are speaking in four-word sentences can use a
W. P. Lehmann, 197209. Bloomington: Indiana University Press. verbs argument structure to predict verb meaning (Hirsh-Pasek
homason, Sarah Grey. 2001. An Introduction to Language Contact. and Golinkof 1996). here also appears to be neurological evi-
Edinburgh: Edinburgh University Press. dence for the competence-performance dichotomy. Numerous
homason, Sarah Grey, and Terrance Kaufman. 1988. Language Contact,
pathological cases have been observed in which grammatical
Creolization, and Genetic Linguistics. Berkeley: University of California
abilities are lost while other cognitive faculties are preserved,
Press.
and vice versa (Pinker 1994).
Some linguists have applied the notion of competence to a far
COMPETENCE broader range of abilities than the sort of grammatical knowledge
outlined here. For example, Dell Hymes coined the term com-
he competence-performance dichotomy lies at the center of
municative competence as the most general term for the speak-
transformational grammar, the linguistic theory introduced
ing and hearing capacities of a person (1971, 16). A broadened
by Noam Chomsky in the late 1950s (Chomsky 1957). Virtually all
notion of competence was soon applied to such capacities as the
current approaches to grammatical theory that descended from
ability of bilinguals to switch languages appropriately (Gumperz
Chomskys original work take the dichotomy as their starting point.
1972), the proper control of stylistic registers (White 1974), the
In brief, competence represents the system of abstract structural
ability of readers to fathom aspects of literature properly (Culler
relationships that characterize grammars, and performance the
1975), and even the use of language by doctors in emergency
faculties involved in putting that knowledge to use. It is generally
wards (Candlin, Leather, and Bruton 1976). he all-too-easy met-
assumed that performance is determined in part by competence,
aphorical extension of the ordinary English word competence has
but is also a function of physiology, the communicative and social
led Chomsky and others to avoid use of the term in recent years.
aspects of language, and general cognitive architecture.
Rather, it has become standard to use the term i-language
Competence and performance are modern reinterpretations
(short for internalized language). In this usage, I-language con-
of the dichotomy between language and speech, which was
trasts not with performance but with E(xternalized)-language.
bequeathed to the ield about a century ago by the great Swiss
Finally, it should be mentioned that some linguists have
linguist Ferdinand de Saussure ([1916] 1966). he French words
questioned the existence of the competence-performance
that Saussure used for language and speech, langue and
dichotomy on the basis of the belief that grammatical structure
parole respectively, are still encountered today: For Saussure,
is an emergent property of language use (see, for example,
langue represents the structural system at the heart of language
Langacker 1987 and Bybee and Hopper 2001).
a system shared by all members of the speech community; parole
is the individual act of speaking. Saussure compared language Frederick J. Newmeyer
to a symphony. Langue represents the unvarying score, parole
the actual performance, no two of which are identical. Rather WORKS CITED AND SUGGESTIONS FOR FURTHER READING
than sticking with langue and parole, Chomsky coined the new
Bybee, Joan L., and Paul Hopper, eds. 2001. Frequency and the Emergence
terms competence and performance since he wished to of Linguistic Structure. Vol. 45. of Typological Studies in Language.
underscore two important diferences between competence and Amsterdam: John Benjamins.
langue: Competence for Chomsky encompasses syntactic rela- Candlin, Christopher N., Jonathan H. Leather, and Clive J. Bruton. 1976.
tionships, despite Saussures consignment of much of syntax Doctors in casualty: Applying communicative competence to compo-
to parole; and competence is characterized by a set of generative nents of specialist course design. IRAL 14: 24572.
186
Competence and Performance, Literary
Chomsky, Noam. 1957. Syntactic Structures. Vol 4. of Janua Linguarum that just as the goal of the analysis of a language is not the descrip-
Series Minor. he Hague: Mouton. tion of a corpus of utterances but an explicit account of the lin-
. 1965. Aspects of the heory of Syntax. Cambridge, MA: MIT Press. guistic competence of speakers of the language, so ought the goal
. 1988. Language and Problems of Knowledge: he Managua of poetics and quite possibly of literary study generally not be the
Lectures. Vol. 16 of Current Studies in Linguistics. Cambridge, MA: MIT
analysis and interpretation of literary works but an account of the
Press. Chomskys most readable defense of the notion competence.
rules, conventions, and procedures that enable readers to make
Culler, Jonathan. 1975. Structuralist Poetics. Ithaca, NY: Cornell University
Press.
sense of literary works as they do (1975, viii, 215, 301, 11330).
Gumperz, John. 1972. he communicative competence of bilin- His account stresses, for example, the shared knowledge and
guals: Some hypotheses and suggestions for research. Language in processing techniques that enable readers to grasp the plot of a
Society 1: 14354. narrative (a matter on which considerable agreement usually can
Hirsh-Pasek, Kathy, and Roberta Golinkof. 1996. he Origins of be reached) and to construct characters from the implicit and
Grammar: Evidence from Early Language Comprehension. Cambridge, explicit information scattered through a text, as well as to engage
MA: MIT Press. in the thematic and symbolic interpretation that the institution
Hymes, Dell. 1971. Competence and performance in linguistic theory. of literature encourages (ibid., 189238). He also stresses the dis-
In Language Acquisition: Models and Methods, ed. Renira Huxley and tinctive assumptions and operations involved in making sense of
Elisabeth Ingram, 324. New York: Academic Press.
a lyric poem, such as a presumption of signiicance, the relevance
Langacker, Ronald W. 1987. Foundations of Cognitive Grammar. Vol.
of sound patterning, and so on (ibid., 13188).
1: heoretical Prerequisites. Stanford, CA: Stanford University Press.
Newmeyer, Frederick J. 1998. Language Form and Language Function.
Culler presents literary competence as a revision of the
Cambridge, MA: MIT Press. framework and goals of literary studies, an attempt to integrate
Pinker, Steven. 1994. he Language Instinct: How the Mind Creates the accomplishments of structuralism and narratology
Language. New York: Morrow. An entertaining, but still serious, dis- in literary studies with the program of a generative linguistics,
cussion of competence and performance, bringing in evidence from but others have suggested that taking the concept and the model
many diferent areas of investigation. of generative grammar seriously would lead to a genera-
Saussure, Ferdinand de. [1916] 1966. Course in General Linguistics. New tive poetics. As a description of competence, a fully adequate
York: McGraw-Hill. Translation of Cours de linguistique gnrale. grammar must assign to each of an ininite range of sentences
Paris: Payot.
a structural description indicating how the sentence is under-
Smith, Neil, and Ianthi-Maria Tsimpli. 1995. he Mind of a
stood by the ideal speaker-hearer (Chomksy 1965, 45). Ellen
Savant: Language Learning and Modularity. Oxford: Blackwell.
Schauber and Ellen Spolsky maintain that [a] generative poet-
Support for competence from Christopher, a savant who is severely
impaired cognitively but can learn a language virtually overnight. ics, therefore, will need to describe the derivation of competing
White, Ronald. 1974. Communicative competence, registers, and sec- well-formed interpretations and to distinguish them from inade-
ond language teaching. IRAL 12: 12741. quately derived interpretations (1981, 397). Calling Cullers con-
ception of literary competence focused on literary conventions
and distinctive interpretive operations intolerably restrictive,
COMPETENCE AND PERFORMANCE, LITERARY
Schauber and Spolsky propose that a generative poetics should
Literary competence, by analogy with Noam Chomskys concepts integrate three competencies: linguistic competence, commu-
of linguistic competence and performance, is the implicit nicative competence, and literary competence, on the principle
knowledge that enables readers to process literary works as they that literary competence in Cullers sense could never lead to the
do, connecting elements and deriving meaning; performance derivation of well-formed interpretations (ibid., 398; 1986).
would be their actual engagements with literary works. he con- Critiques of the concept of literary competence have sug-
cept of literary competence works to highlight the importance in gested that Chomskys speciication of the competence of an
literary studies of a poetics that describes the conventions and ideal speaker-hearer makes the concept of competence inher-
interpretive operations that make possible the intelligibility of ently elitist. Joseph Dane, while disputing the parallel between
literary works, as opposed to a hermeneutics (see philology linguistic and literary competence, contrasts a technical sense of
and hermeneutics) that seeks to develop new interpreta- competence as knowledge that makes any literary performance
tions. It is also a claim about the relation between linguistics and (including interpretation) possible with the everyday sense
literary study: Rather than apply techniques of linguistic analy- where competence is a matter of qualiications and credentials
sis directly to the language of literary works, it is more fruitful to (1986, 53, 59). Despite Cullers argument that literary competence
attempt to take from linguistics the methodological model for does not involve a supposition that readers will agree upon an
the construction of a poetics. interpretation but only that there are literary conventions that
Chomsky makes a fundamental distinction between compe- guide interpretation and make possible some conclusions and
tence (the speaker-hearers knowledge of his language) and perfor- not others, Dane argues that a principle of stability must remain.
mance (the actual use of language in concrete situations) (1965, Some of us possess this competence; others of us must go to the
4). he notion of literary competence is introduced, on the anal- university to learn how to be perceptive and competent (ibid.,
ogy with linguistic competence, in Jonathan Cullers Structuralist 60); [c]ompetence is simply that which is possessed by the most
Poetics (1975). Rejecting corpus-based versions of descriptive powerful leaders of the literary community (ibid., 72).
linguistics, Chomsky argues that the task of linguistics is not the he prestige of interpretation in literary studies, where the
discovery of regularities in a corpus but a modeling or rendering task of the critic is to produce a more powerful interpretation,
explicit of the speaker-hearers implicit knowledge. Culler argues has blocked the program of the study of literary competence as
187
Compositionality
something shared by readers, though it is implicit in any account compositional procedure. herefore, compositionality is a nec-
of narratology, for example, or of literary interpretation generally. essary property of any semantics of natural language that claims
he cognitivist turn in literary studies (Turner 1996) provides an complete coverage. he result, however, leaves open what the
opportunity for returning to aspects of literary competence and lexical expressions of natural language are and how many com-
the key question raised by the Chomskian model of the extent position principles there are. Often, words can be assigned a
to which such competence involves kinds of knowledge speciic compositional meaning; for example, the meaning of slept is the
to literature. If our ability to make sense of the world is deined result of sleep combined with past tense. In other cases, how-
in terms of perceiving stories, organizing perceptions according ever, syntactically complex phrases seem to have a noncomposi-
to metaphorical ields, and so on, it may be possible to go on to tional meaning. For example, that kick the bucket is synonymous
identify interpretive moves that are speciic to the reading and with die does not follow naturally from the meanings of kick
appreciation of literary works. and the bucket (cf. idioms). In the history of language, complex
expressions often take on a noncompositional meaning over
Jonathan Culler
time (cf. grammaticalization.)
he composition principles are closely tied to a particular
semantic theory. Compositionality plays a central role in formal
Chomsky, Noam. 1965. Aspects of the heory of Syntax. Cambridge, semantics and truth conditional semantics of natural
MA: MIT Press. he classic theorization of a grammar as a description language, while other theories of language meaning have not
of linguistic competence. addressed compositionality (cf. construction grammars
Culler, Jonathan. 1975. Structuralist Poetics: Structuralism, Linguistics,
and cognitive grammar). he textbook by Irene Heim and
and the Study of Literature. London: Routledge. Assesses structuralist
Angelika Kratzer (1998) provides one inluential account. his
work and attempts to show that description of literary competence is a
fruitful program for literary studies.
account assumes that humans construct a syntactic representa-
Dane, Joseph. 1986. he defense of the incompetent reader. tion of a sentence, the logical form, which is then mapped
Comparative Literature 38.1: 5372. A critique of the analogy with lin- at the syntax-semantics interface to a meaning. his mapping
guistic competence and of an implicit elitism. is a recursive procedure (see recursion, iteration, and
Schauber, Ellen, and Ellen Spolsky. 1981. Stalking a generative poetics. metarepresentation).
New Literary History 12.3: 397413. Starting with literary competence, In addition to the meanings of a inite set of lexical items,
lays out a broader program. general composition rules determine the meaning of complex
. 1986. he Bounds of Interpretation: Linguistic heory and Literary phrases. Of the meanings of lexical items, only some aspects
Text. Stanford, CA: Stanford University Press. Develops the conception are important for composition. In Heim and Kratzers analysis,
of a generative poetics at greater length.
these aspects are captured by the semantic type. For example,
Turner, Mark. 1996. he Literary Mind. New York: Oxford University Press.
the meaning of proper names like Kai and Berlin have the type
An important instance of the cognitivist program in literary studies.
of individuals, and the meanings of both to like and to hate are
of the type of two-place functions. he parts of a complex phrase
COMPOSITIONALITY can be either lexical items or complex phrases themselves.
herefore, only one composition rule is required: a rule that
he principle of compositionality was irst formulated by the
combines the meanings of two subphrases into one. Heim and
German philosopher Gottlob Frege (1892) and is also referred
Kratzers analysis makes use of three composition rules: function
as the Frege principle. It states that the meaning of a complex
application, predicate modiication, and predicate abstraction.
expression is a function of the meaning of its parts. A mapping
Which composition rule is applied is determined by the types of
from expressions to meanings that satisies this principle is
the meanings of the two parts of the complex phrase. he simple
called compositional. Frege identiied compositionality as a basic
example Kai likes Berlin illustrates only function application. We
requirement for an account of the meaning of natural language
assume that the sentence consists of only three lexical items: Kai,
(see language, natural and symbolic), and all serious
likes, and Berlin, though a full analysis would contain at least
accounts of sentence meaning are compositional. herefore,
present tense as well. he lexical meanings of Kai and Berlin
current research seeks to ind more restrictive notions of compo-
are the individual concepts kai and berlin. he lexical entry for
sitionality that can be used to assign a degree of compositionality
likes is the function like which applied to one individual yields
to a semantic analysis, as discussed in this entry. he question
another function that, when applied to another individual, yields
of compositionality has also been asked for nonlinguistic com-
a sentence meaning. he logical form of the sentence shown
munication systems among humans and other species, which I
in (1) determines the order in which like is composed with its
mention towards the end.
arguments.
For a semantics of natural language, compositionality is
a basic requirement because humans can generate ininitely
many sentences (see discrete infinity) and associate them
with one from an ininite set of meanings. Since human memory (1) like(berlin)(kai)
is a inite resource, there can only be a inite set of memorized
lexical meanings (see lexical semantics). It follows that Kai like(berlin)
natural language must contain nonlexical expressions and that
likes Berlin
the meaning of such nonlexical expressions is determined by a
188
Compositionality Computational Linguistics
While necessarily abstract, the analysis captures two important COMPUTATIONAL LINGUISTICS
aspects: 1) the commonality between the meaning of the sen-
Computational linguists develop working models of various
tence and that of structurally similar sentences such, as Jan hates
aspects of languages in the form of computer programs. hese
the capital of Germany, and 2) the incompleteness of examples
models fall under three main headings: analysis, generation, and
like *Kai likes.
learning. Analysis models take in (usually typewritten) texts and
Recent work has pointed out a need to develop a stricter for-
igure out the details of their linguistic structure, possibly pro-
mal notion of compositionality. One motivation is the following
ducing a meaning representation. Starting from an abstract rep-
theorem of Wlodek Zadrozny (1994): If there is a function that
resentation of a meaning, generation models compose text (e.g.,
assigns a meaning to each complete expression of a language,
a sentence) expressing that meaning in a particular language.
a compositional meaning function can also be given. his result
Some systems combine analysis and generation with other tasks.
relies on an extension of function beyond its natural domain.
For example, a database enquiry system analyzes queries in order
For example, we might construct a compositional semantics for
to igure out what information is sought, retrieves the requested
the idiom kick the bucket in the following way: For one, stipu-
information from a database, and uses a generation system to
late that the bucket has in addition to its ordinary meaning also
express that information in natural-language output. Machine
the special symbol X as its meaning. Secondly, deine the mean-
translation systems analyze input in one language and generate
ing of kick applied to X as the meaning of die, thereby composi-
corresponding expressions in a diferent language. Most systems
tionally deining the meaning of kick the bucket. However, this
rely on grammar rules and resources such as text corpora and
analysis strikes most researchers as less desirable than a formally
dictionaries. Machine learning researchers build models that
noncompositional one. For this reason, current research tries
learn the relevant information from training examples to avoid
to formulate notions of compositionality stricter than Freges
hand-crafted rules.
(Kazmi and Pelletier 1998; Szab 2000). In particular, Ali Kazmi
and Francis J. Pelletier suggest restricting the use of functions as
meanings, but it is still an open question how exactly to do this. Grammatical and Lexical Analysis
Looking beyond human language, compositionality has Computational analysis of linguistic input using a parsing
emerged as an important property to classify communication algorithm quickly reveals the ambiguity of syntax, due in
systems. Tim Horton (2001) investigates the compositionality part to the fact that many words have several word classes.
of music (see music, language and). Even more interesting Parsing is greatly assisted if the lexical category can be resolved
is the case of animal communication and human evolution by looking at each words neighbors, a task performed by a
(Bickerton 1990). Elizabeth Spelke (2003) proposes that compo- part-of-speech tagging (i.e., labeling) program. he two main
sitional semantics is crucial for human intelligence. She argues approaches are i) rule-based versus ii) probabilistic. he rule-
that humans and higher animals possess a similar ability to form based approach exploits rules such as the can precede a noun or
basic concepts. Only humans, however, via the compositional adjective but never a verb to rule out contextually inappropriate
semantics of language have the ability to combine these basic part-of-speech tags. hese rules must be written and tested by an
concepts into an ininite array of derived meanings. expert and need to be debugged by trial and error since no one
can discover all the correct rules immediately. he probabilistic
Uli Sauerland
approach determines the most likely sequence of tags, calculated
using probability theory according to the frequency with which
one tag follows another in a training corpus. For example, Daniel
Bickerton, Derek. 1990. Language and Species. Chicago: University of Jurafsky and James H. Martin (2000, 305) estimate the probabil-
Chicago Press. ity that race is a noun as P = 0.000007 if the previous word is to; it
Fodor, Jerry, and Ernest Lepore. 2002. he Compositionality Papers. is more likely to be a verb (P = 0.00001) in that context.
Both approaches face diiculties with new words and new
Frege, Gottlob. 1892. ber Sinn und Bedeutung. Zeitschrift fr
uses of old words. Web pages, for example, contain so many
Philosophie und philosophische Kritik, NF 100: 2550.
names, new technical terms (especially new compounds), and
Heim, Irene, and Angelika Kratzer. 1998. Semantics in Generative
Grammar. Oxford: Blackwell. misspellings that about 15 percent of words are not listed in
Horton, Tim. 2001. he compositionality of tonal structures: A gen- dictionaries. In such cases, morphological analysis of word
erative approach to the notion of musical meaning. Music Scienti structure (perhaps using probabilistic methods) may help. In
5.2: 13156. agglutinative languages (such as Finnish) and in languages with
Kazmi, Ali, and Francis J. Pelletier. 1998. Is compositionality formally complex morphological patterns (such as Arabic), it is almost
vacuous? Linguistics and Philosophy 23: 62933. essential. No dictionary contains the word Shamoization
Partee, Barbara. 2006. Compositionality in Formal Semantics. (coined for this article), but we can infer the stem Shamo and
Oxford: Blackwell. be fairly conident that it is a proper noun, because -ization com-
Spelke, Elizabeth. 2003. What makes us smart? Core knowledge and
bines with nouns and it begins with a capital letter. With pro-
natural language. In Language in Mind, ed. Dedre Gentner and Susan
ductive morphemes like -ability in English, decomposition of
Goldin-Meadow, 279311. Cambridge, MA: MIT Press.
Szab, Zoltn Gendler. 2000. Compositionality as supervenience.
words into stems and aixes can reduce the dictionary size and
Linguistics and Philosophy 23: 475505. may also help us deal with new words (crushability, etc).
Zadrozny, Wlodek. 1994. From compositional to systematic semantics. he commonest methods for morphological analysis use
Linguistics and Philosophy 17: 32942. inite-state automata, corresponding to the least powerful kind
189
Computational Linguistics
of grammar in the Chomsky hierarchy. Despite their limitations, pairs) than to develop translation rules and bilingual dictionar-
inite-state automata are eicient and adequate for lexical pre- ies for all 210 pairs. One technical problem with this approach
processing, including decomposition of words into morphemes is that logical formulae are not unique representations of mean-
(e.g., geese goose + Nplu) and normalization of spelling (e.g., ing: for example, if A then B is equivalent to not (A and not
driver [drive/V + er]Nsing). For text-to-speech conversion, B). But an overseas booking clerk might be confused if your
interpretation of abbreviations and symbols may be necessary, statement If meals arent served in economy class, I want a irst
too, for example, Mr. Mister, 4.36 four pounds thirty-six class ticket were translated as I dont want no meals served
(not pound four point three six). in economy class and a irst class ticket, even though this is
logically correct. Consequently, real machine translation sys-
Semantic Analysis tems combine transfer methods, which map structures of one
In real-world applications of computational linguistics, syntac- language to the other, direct methods that use word and phrase
tic and morphological analysis are merely a means to an end. correspondences with as little linguistic manipulation as pos-
In database enquiry systems or machine translation, we may sible, and some statistics to help choose the most likely ways of
compute a representation of the meaning of the input. One of expressing the output.
the diiculties facing the computational treatment of meaning Machine translation and information retrieval requires
is presented by groups of words with similar or related mean- the generation of linguistic output, a task with its own particu-
ings. For example, in Whats the irst class fare? and Whats lar challenges. When there are many equivalent ways of saying
the price of a irst class ticket?, fare and price have almost the the same thing (e.g., John drove the car, the car was driven by
same meaning (i.e., the answer would be found in the same entry John), the most appropriate variant must be chosen, observing
in the ticket-suppliers database). In order to recognize lexical pragmatic conventions, such as putting given information
relations, many systems use computationally implemented before new, the conventional order of words (big, red bus,
thesauruses, such as WordNet, Princeton Universitys lexical not red, big bus), and the time sequence they suggest: he
database for English. accused broke his leg and fell out of the window does not mean
To represent sentence meanings, computational linguis- the same as the accused fell out of the window and broke his
tics often employs formal logic. A question, for example, can be leg. Sentence generation often uses a slot-illing technique: he
translated into a logical proposition with some information miss- agent of an action is placed in subject position; the undergoer
ing. Experts in formal semantics may express the meaning of is the object, and so on. focus might prompt a particular sen-
I would like the cheapest light from Washington to Atlanta as tence pattern, for example, it was the policeman who broke his
the predicate calculus formula: leg. Outputs are also generated in dialogue systems and in text
A.light(A) & from(A,washington) & to(A,atlanta) &

summarization. Dialogue systems collect information from the
(1)
user and provide information that the user requires according
cheapest(A) & like(i,A)
to the accepted conventions of dialogue sequence. he dialogue
that is, here exists a light A and A is from Washington and A is may be managed via a script that successively prompts the user
to Atlanta and A is cheapest and I like A. In order to answer the for gobbets of information. his is akin to form illing, as when
question, an information retrieval system (e.g., a ticketing sys- purchasing products on the Internet. he order in which the
tem) could search its light information database to ind one with user gives the information may not matter so long as all required
the desired origin, destination, and so on. Formulae like (1) can ields are eventually illed in. To navigate its script, the system
be automatically converted to statements of a database query takes the lead in the conversation.
language, or used in programming languages such as Prolog In text summarization, documents are analyzed in order to
(Clocksin and Mellish 2003). extract the most important pieces of information according to
various criteria, such as discourse structure and word frequency.
Machine Translation his information is then used to generate a summary of the origi-
Conversion from natural languages to logical formulae and vice nal, to a required length. Often, the summary simply consists of
versa suggests one method of machine translation: the most relevant extracts of the original.
(2) Language 1 input Logical representation of its meaning

(same for all languages) Output in language 2.
Probabilistic Methods
Computational linguists employ a wide range of probabilistic
In this scheme, logic is used as an interlingua. Rather than going methods that are helpful in various problems, especially sentence
directly from one language to the other, an interlingua is attrac- and word-sense disambiguation. For example, the girl saw the
tive because diferent languages can express similar ideas in dog with the telescope has at least two structures and meanings,
quite diferent ways. For example, the English verb like trans- depending on whether the girl or the dog has the telescope. Both
lates in Japanese as suki desu an adjective + to be, is likeable. these structures and meanings are legitimate, but in real-world
In Irish, I can X is expressed X is possible for me, using an applications such as machine translation, we may need to deter-
adjective. Using an intermediate meaning representation might mine which structure and meaning is intended. Parsers quickly
overcome such grammatical diferences between languages. reveal that average-length sentences have many possible struc-
Also, to translate between a large number of diferent languages tures, some quite implausible and unwanted. It is impractical to
(e.g., the 21 oicial languages of the European Union), it seems model a speakers world knowledge, such as the fact that dogs
simpler to translate them all into an interlingua (21 language cannot have telescopes. But it is feasible to use the statistics of
190
Concepts
word combinations, such as the fact that telescope occurs with see Objects and events (from household items to emotions to gender
more often than dog, to select more likely analyses. to democracy), although unique, are acted toward as members
Probabilities can also help with word-sense disambigua- of classes. Without this ability to categorize, it would be impos-
tion: For example, in she joined the club, it is not hard to work sible to learn from experience. Since at least the nineteenth
out that club is more likely to be association of persons than century, it has been common to refer to the mental or cogni-
heavy staf of wood or suit of cards, simply on the basis of the tive aspect of categories as concepts. Philosophy, psychology,
frequency of the collocation join club. Consequently, lexical computer science, and linguistics have all made contributions
semantic analysis often combines probabilistic methods and to conceptual theory and research. At present, there are seven
symbolic resources such as thesauruses. major views of the nature of concepts that form the basis for
inquiry and debate: 1) the classical view, 2) the prototype and
Learning graded structure account, 3) the theory theory, 4) neoclassi-
In order to determine probabilities of rules, word senses, and so cal combination models, 5) connectionist computer models,
on, a system must be trained on a corpus of language data, in 6) conceptual atomism, and 7) nonrepresentational ecological
efect learning them. We would like computers to do more of the approaches.
hard work of inding the best grammar for a language. Consider
the pairs of rules that, according to X-BAR THEORY, deine the The Classical View
structure of noun phrases in all languages: he classical view is the approach to concepts derived from the
history of Western philosophy. When humans begin to look at
(3) a) NP Det N1 (as in English) or b) NP N1 Det (as in
their experience by means of reason, questions about the reli-
Norwegian)
ability of the senses and the bases for knowledge arise, as do
(4) a) N1 Adj N1 (as in English) or b) N1 N1 Adj (as in French) more speciic questions about how categories can have gen-
erality (called the problem of universals), how words can have
(5) a) N1 PP N (as in German or Japanese) or b) N1 N PP (as
meaning, and how concepts in the mind can relate to categories
in English)
in the world. he Greeks and most Western philosophers ever
Given these predetermined possible rules, a language learner since have agreed that experience of particulars, as it comes
(computer or human) only needs to count the number of times moment by moment through the senses, is unreliable; therefore,
each rule is applicable. In English, rule (4a) will be applicable only stable, abstract, logical categories can function as objects
every time an adjective precedes a noun, whereas in French, rule of knowledge and objects of reference for the meaning of words.
(4b) will be applicable where adjectives follow nouns. By count- To fulill these functions: a) conceptual categories had to be
ing the frequencies with which these rules apply, a learner can exact, not vague (i.e., have clearly deined boundaries), and b)
soon work out that (4a) is more suitable for English than (4b) and their members had to have attributes in common, which were
vice versa for French. Probabilistic methods are now so common the necessary and sufficient conditions for member-
in computational linguistics that a detailed review is impossible ship in the category. It follows that c) all members of the con-
here. One framework for computation and learning, however, ceptual category were equally good with regard to membership;
connectionism, is much used in some areas and has attracted either they had the necessary common features or they didnt.
considerable interest from psycholinguists. Categories were thus seen as logical sets. It is on this basis that
conceptual categories could be the basis for logical inferences
John Coleman
as in the familiar All men are mortal; Socrates is a man; Socrates
is mortal (see verbal reasoning). his is also the basis of the
way in which words are deined by genus and diferentia in our
Clocksin, William F., and Christopher S. Mellish. 2003. Programming in dictionaries.
Prolog: Using the ISO Standard. Berlin: Springer. An authoritative yet In psychology, the irst body of research on concept learning
readable textbook. mirrored the philosophers view of conceptual categories. Led by
Coleman, John. 2005. Introducing Speech and Language Processing.
the work of Jerome Bruner and his associates (Bruner, Goodnow,
Cambridge: Cambridge University Press. An elementary introduction
and Austin 1956), subjects were asked to learn categories that
to the ield, aimed at readers with a less technical background (espe-
cially linguists).
were logical sets deined by explicit attributes, such as red and
Jurafsky, Daniel, and James H. Martin. 2000. Speech and Language square, combined by logical rules, such as and. heoretical inter-
Processing. Upper Saddle River, NJ: Prentice Hall. A highly respected est was focused on how subjects learned the attributes that were
and compendious textbook. relevant and the rules that combined them. In developmental
Manning, Christopher D., and Hinrich Schtze. 1999. Foundations of psychology, the theories of Jean Piaget and Lev Vygotsky were
Statistical Natural Language Processing. Cambridge, MA: MIT Press. combined with the concept-learning paradigm to study how
Enormous and comprehensive. childrens ill-structured, often thematic, concepts developed
into the logical adult mode. Artiicial stimuli were typically used
in research at all levels, structured into microworlds in which
CONCEPTS
the prevailing beliefs about the nature of categories were already
Concepts are the central constructs in most modern theories of built in.
the mind. Humans (and arguably other organisms) are seen as liv- In linguistics, most mainstream twentieth-century work
ing in a conceptually categorized world (see categorization). in phonology, semantics, and syntax rested on the
191
Concepts
assumptions of the classical view. phonemes were analyzed as meaningful, or interesting. he classical view of concepts cannot
sets of universal, abstract, binary features (Chomsky and Halle deal with any of this.
1968). word meaning, the province of semantics, was like- he prototype view has spread beyond psychology to many
wise represented by a componential analysis of features (see ields, including linguistics and narratology. Gradients of
feature analysis); for example, bachelor was rendered as the exemplariness are ubiquitous in linguistic phenomena, even
features +human, +male, +adult, +never married (Katz and Postal in phonology where actual speech is less clear-cut than would
1964). A complex concept such as bald bachelor was considered appear in an abstract componential analysis. In semantic and
the unproblematic intersection of the features of bachelor with syntactic analyses (particularly in cognitive grammar and
those of bald. Synonymy, contradiction, and other relational the understanding of metaphor), prototype efects, in addition
aspects of word meaning were accounted for in a similar fashion. to providing speciic case studies, are often used as evidence that
Syntax was analyzed by formal systems such as transforma- formal analysis is insuicient of itself and that world knowledge
tional grammar (Chomsky 1965) that also relied on decom- must be part of ones theory (Lakof 1987a, 1987b; Langacker
position into features (see Taylor 2003). Such an understanding 1990; Taylor 2003).
of language was adopted with enthusiasm by computer science
because meaning could be divorced from world knowledge and Theories
readily represented by the substitutable strings of symbols on he theories approach to concepts takes advantage of peoples
which computers work. intuitions that life activities and the concepts that map them take
place in a context larger than is ofered by either formal descrip-
Prototypes and Graded Structure tion or laboratory experiments. he basic claim is that concepts
Consider the color red: Is red hair as good an example of your get their meaning through mental theories. here are actually
idea or image of red as a red ire engine? Is a dentists chair as two groups of theory theorists: cognitivist-oriented cognitive
good an example of chair as a dining room chair? Are you imme- psychologists, who primarily address categorization issues, and
diately sure how to classify and name every color and object you developmental psychologists of the theory theory school, who
see? From its inception as a discipline separate from philosophy, address conceptual change. he irst group (Medin 1989; Medin
psychology has investigated types of learning and behavior that and Wattenmaker 1987; Murphy and Medin 1985) has used the
show graded efects. For example, Ivan Pavlovs dogs produced idea of theories primarily as criticism of previous categorization
decreasing amounts of saliva as tones grew farther from the tone research. hese theorists point out that previous accounts of
originally combined with meat powder. his is called stimulus concepts cannot properly deine or explain either attributes or
generalization. Note how diferent it is from the classical view of similarity and that previous experiments on conceptual catego-
conceptual categories. ries are all subject to context efects; for example, judgments of
he irst programmatic, empirically based challenge to the the prototypicality of animals changes if a zoo context is speci-
classical view came from Eleanor Roschs work on prototypes ied. hey do not, however, themselves give an account of attri-
and graded structure (Rosch 1978, 1999; Rosch and Lloyd 1978). butes or similarity, nor do they specify what a theory is or give
A wide variety of conceptual categories were shown to have any concrete examples of a theory deining a concept.
gradients of membership; that is, subjects can easily, rapidly, In contrast, the theory-theory school of developmental psy-
and meaningfully rate how well a particular item its their idea chology (see Gopnik and Meltzof 1997) explicitly deines theory
or image of its category. his is true for perceptual, semantic, as analogous to scientiic theories, much like Kuhnian paradigms,
social, biological, and formal concepts. In contrast, subjects can- and argues that cognitive development should be viewed as the
not list criterial attributes for most categories (Rosch and Mervis successive replacement of one paradigm theory held by the child
1975). More importantly, the psychological import of gradients by another. Interest in concepts tends to be from the point of
of membership was demonstrated by their efect in a series of view of change in the childs (rather than the researchers) theory
experiments on virtually every major method of study and mea- of what a concept is. When speciic concepts are studied (such
surement used in psychological research: learning, association, as biological types Carey 1985; Keil 1979), the thrust is to show
speed of processing, expectation, inference, probability judg- them as parts of larger theoretical units.
ments, and judgments of similarity. Rosch suggested a model In linguistics, the discussion tends to be formulated in terms
in which categories formed around perceptually, imaginally, or of the relation of word meaning to general knowledge in a vari-
conceptually salient stimuli, which she called prototypes, then, ety of speciic contexts. Such contexts have been characterized as
by stimulus generalization, spread to other similar stimuli with- schemas, frames, scripts, image schemas, domains, and
out necessarily any analyzable criterial attributes, formalizable perspectivization. For example, Ronald Langacker (1990) talks
deinitions, or deinite boundaries. It is the prototype that was of the seven-day week as the semantic domain within which
claimed to mentally represent the category for most purposes. A Monday is understood, and George Lakof (1987a) points to ive
profusion of factors have been found to create prototypes: phys- frames needed to explain our use of the word mother (genetic,
iological saliency, statistical frequencies (including central ten- birth, nurturance, genealogical, and marital). Computer science
dencies and family resemblances), social structure, formal has worked on similar formulations in the design of the type of
structure, extremes of attribute dimensions, cultural ideals, program known as story understanders. Such work tends to be
causal beliefs, and particular stimuli (exemplar theories classiied under theories despite its lack of general explanatory
are based on these) that are the irst learned, or most recently hypotheses because, lacking speciication of what is to count
encountered, or the most emotionally charged, vivid, concrete, as a theory, virtually any demonstration of the embedding of
192
Concepts
individual concepts in larger semantic complexes or in world Laurence and Margolis 1999). Jerry Fodors (1990, 1998) concep-
knowledge has been argued as support for the theories view a tual atomism attempts to sidestep such issues by arguing that
difuseness that has also been used as a critique of that view. the concept BIRD (Fodors notation) simply expresses the single
atomic property bird. he concept derives that meaning from its
Neoclassical Combination Models causal history (as in Kripke [1972] 1980 and Putnam 1975 see
Such approaches are called neoclassical because they incorpo- also essentialism and meaning). Concepts have no struc-
rate elements of the classical view, considering it a necessary ture and are not decomposable into any kind of properties, inter-
basis, but add elements from other approaches. Psychology nal or external. his view has not been adopted by psychologists
and linguistics treat the issue diferently. Psychology ofers dual who presume (despite Fodors denial) that atomic concepts
models. hese typically begin with criticisms of prototype theory, would need to be innate (how could they be learned?) and that
some of which seem based on misinterpretation (e.g., taking it seems highly unreasonable to propose biologically innate con-
graded structure as a probability distribution [Smith and Medin cepts for everything in the universe, including televisions, peni-
1981] or limiting prototypes to only one type of prototype), but cillin, and so on. More broadly, conceptual atomism has not so
most seem to be based on the philosophical intuition that the far appeared generative of psychological empirical research.
real meaning of a concept, that to which the concept refers, must
be the identiiable necessary attributes of a classical deinition. Nonrepresentational Ecological Approaches
Two main types of evidence are ofered for this account. One is his is the approach that deals with the use of concepts. Ludwig
that prototype and graded structure efects can be found for con- Wittgenstein (1953) is much cited in concept research for fam-
ceptual categories that have a formal classical deinition, such ily resemblances, but that observation was a small fragment of
as odd number (Armstrong, Gleitman, and Gleitman 1983), the his argument that concepts and words should be understood
other that prototypes do not form componential combinations not referentially but as embedded interactive parts of forms of
as do the elements of classical deinitions (e.g., a good example life. Since forms of life show up in a succession of situations,
of pet ish is neither a prototypical pet nor a prototypical ish one way to approach this issue is to study concepts in their natu-
[Osherson and Smith 1981]). Both indings are taken to indicate ral situational environments (for example, Cantor, Mischel, and
that prototypes are something other than and irrelevant to a con- Schwartz 1982). Another way is to analyze concepts as mental
cepts meaning. he solution is a dual model in which prototypes simulators of situations (Kahneman, Slovic, and Tversky 1982),
are assigned the function of rapid recognition of conceptual and another is to show the contextual sensitivity of concepts
referents, whereas the true meaning is provided by a classically (Barsalou 1987; see also ad hoc categories). Yet a diferent
deined core (Osherson and Smith 1981; Smith, Shoben, and point of entry into conceptual use may be to ask how the basic
Rips 1974). level of abstraction, the default level at which conceptual cat-
Linguistic models are more complex. All wish to include struc- egories appear to be named and understood (chair rather than
tured real-world knowledge in some form, along with at least a furniture or oice chair), maps onto the forms of human activities
minimum of necessary deining classical attributes. Some forgo (Rosch et al. 1976). General theoretical ecological accounts are
a complete characterization of the concept to concentrate on provided in Rosch (1999) and Gabora, Rosch, and Aarts (2007).
grammatically relevant structure (Pinker 1989). Perhaps the most
Conclusion
complete attempt to cover all bases is Ray Jackendofs (1983)
Concepts occur in use only in particular moments in particular
account of the conditions needed to specify word meaning; these
situations. From the perspective of their use, one can see the
include partial deinitions (red must at least include color), gra-
aspects of mental and interpersonal activities in which each of the
dients of relevant attributes (such as hue for colors), and sensory
seven accounts of concepts ofers insight. For example, one might
speciications, such as a model of what the referent looks like.
be actively seeking to ind the attributes for a classical deinition
by means of thoughts that use mental prototypes. his would be
Connectionist Computer Models
done against a background of loosely organized frames, scripts,
he previous accounts of concepts are all, to greater or lesser
and so on the sort of knowledge structures toward which the
extent, based on the idea of mental representations and are
theories view points. he concepts being used could be atomic
formulated at the symbolic level of mental functioning, the the-
and inherent to that moment, given that recognition of items had
ories view being the most top-down. In sharp contrast, connec-
already been performed and was now inherited from previous
tionist semantics seeks to derive apparently symbolic functions
moments. In short, each of the views maps a particular intuition
from subsymbolic mechanistic neuron-like processes, such
that humans seem to hold about concepts: he classical view
as weighted connections among units, the strengths of which
ofers essences of a sort; the prototype view highlights concrete,
are gradually adjusted on the basis of feedback to the program
holistic mental representations; the theories view points toward
(Rogers and McClelland 2004). A question for this approach,
a background of conceptually structured world knowledge; con-
as for much present psychology, is the extent to which indings
nectionism points to a subsymbolic neuronal substrate; atomism
about biological (or pseudobiological) substrates are to preclude
brings in history (and simplicity); combination models attempt
feedback and explanation at the higher symbolic levels.
to bring it all together; and the ecological approach attempts to
bring it all together in terms of the ways in which concepts par-
Conceptual Atomism
ticipate in real-world uses.
All of the previous approaches are subject to philosophical criti-
cisms, even the classical views (for reviews, see Fodor 1998 and Eleanor Rosch
193
Concepts Conceptual Blending
WORKS CITED AND SUGGESTIONS FOR FURTHER READING . 1978. Principles of categorization. In Cognition and
Categorization, ed. Eleanor Rosch and Barbara Lloyd, 2748. Hillsdale,
Armstrong, Sharon, Lila Gleitman, and Henry Gleitman. 1983. What
some concepts might not be. Cognition 13: 263308.
. 1996. he environment of minds: Towards a noetic and hedonic
Barsalou, Lawrence. 1987. he instability of graded structure: Implications
ecology. In Cognitive Ecology (Handbook of Perception and Cognition).
for the nature of concepts. In Neisser 1987, 10140.
2d ed. Ed. Morton Friedman and Edward Carterette, 524. San Diego,
Bruner, Jerome, Jacqueline Goodnow, and George Austin. 1956. A Study
CA: Academic Press.
of hinking. New York: Wiley.
. 1999. Reclaiming concepts. Journal of Consciousness Studies
Cantor, Nancy, Walter Mischel, and J. C. Schwartz. 1982. A prototype
6.11/12: 6177.
analysis of psychological situations. Cognitive Psychology 14: 4577
Rosch, Eleanor, and Barbara Lloyd, eds. 1978. Cognition and
Carey, Susan. 1985. Conceptual Change in Childhood. Cambridge,
Categorization. Hillsdale, NJ: Erlbaum.
MA: MIT Press.
Rosch, Eleanor, and Carolyn Mervis. 1975. Family resemblances: Studies
Chomsky, Noam. 1965. Aspects of a heory of Syntax. Cambridge, MA: MIT
in the internal structure of categories. Cognitive Psychology
Press.
7: 573605.
Chomsky, Noam, and Morris Halle.1968. he Sound Pattern of English.
Rosch, Eleanor, Carolyn Mervis, Wayne Gray, David Johnson, and
New York: Harper and Row.
Penelope Boyes-Braem. 1976. Basic objects in natural categories.
Fodor, Jerry. 1990. Information and representation. In
Cognitive Psychology 8: 382439.
Information, Language, and Cognition, ed. Philip Hanson, 17590.
Smith, Edward, and Douglas Medin. 1981. Categories and Concepts.
Vancouver: University of British Columbia Press.
Cambridge: Harvard University Press.
. 1998. Concepts: Where Cognitive Science Went Wrong. New
Smith, Edward, Edward Shoben, and Lance Rips. 1974. Structure and
process in semantic memory: A featural model for semantic deci-
Gabora, Liane, Eleanor Rosch, and Diederik Aerts. 2007. Toward an
sions. Psychological Review 81: 21441.
Ecological heory of Concepts. Ecological Psychology 20.1: 84116.
Taylor, John. 2003. Linguistic Categorization. Oxford: Oxford University
Gopnik, Allison, and Andrew Meltzof. 1997. Words, houghts, and
Press.
heories. Cambridge, MA: MIT Press.
Wittgenstein, Ludwig. 1953. Philosophical Investigations. New
Jackendof, Ray. 1983. Semantics and Cognition. Cambridge MA: MIT
York: Macmillan.
Press.
Kahneman, Daniel, Paul Slovic, and Amos Tversky, eds. 1982. Judgment
under Uncertainty: Heuristics and Biases. New York: Cambridge
CONCEPTUAL BLENDING
University Press.
Katz, Jerrold, and Paul Postal. 1964. An Integrated heory of Linguistic Conceptual blending is a basic mental operation that has been
Descriptions. Cambridge, MA: MIT Press. explored as a central mechanism indispensable to grammar and
Keil, Frank. 1979. Semantic and Conceptual Development: An Ontological language. Conceptual blending leads to new meaning, global
Perspective. Cambridge: Harvard University Press.
insight, and conceptual compressions useful for memory and
Kripke, Saul. [1972] 1980. Naming and Necessity. Cambridge: Harvard
manipulation of otherwise difuse ranges of meaning. It plays a
University Press.
fundamental role in the construction of meaning in everyday life,
Lakof, George. 1987a. Cognitive models and prototype theory. In
Neisser 1987, 63100. in the arts and mathematics, in the natural sciences, and in the
. 1987b. Women, Fire, and Dangerous hings: What Categories social and behavioral sciences. he essence of the operation of
Reveal about the Mind. Chicago: University of Chicago Press. conceptual blending is to construct a partial match between two
Langacker, Ronald. 1990. Concept, Image, and Symbol: he Cognitive inputs and to project (see projection [blending theory])
Basis of Grammar. Berlin: Mouton de Gruyter. selectively from those inputs into a blended mental space,
Laurence, Stephen, and Eric Margolis. 1999. Concepts and cognitive which dynamically develops emergent structure.
science. In Concepts: Core readings, ed. Eric Margolis and Stephen It has been suggested that the capacity for complex concep-
Laurence, 381. Cambridge, MA: MIT Press. tual blending (double-scope integration) is the crucial capacity
Medin, Douglas. 1989. Concepts and conceptual structure. American
needed for language and for higher-order cognition of the sort
Psychologist 44: 146981.
that characterizes cognitively modern human beings.
Medin, Douglas, and William Wattenmaker. 1987. Cognitive cohesive-
ness, theories, and cognitive archeology. In Neisser 1987, 2562.
A systematic study of conceptual blending was initiated in
Murphy, George, and Douglas Medin. 1985. he role of theories in con- 1993 by Gilles Fauconnier and Mark Turner, who discovered the
ceptual coherence. Psychological Review 92: 289316. structural uniformity and wide applications. he central intro-
Neisser, Ulric, ed. 1987. Concepts and Conceptual Development: Ecological ductory statement of the ield is Fauconnier and Turner 2002.
and Intellectual Factors in Categorization. Cambridge: Cambridge (See also Turner 1996 and 2001; Fauconnier 1997; Fauconnier
University Press. and Turner 1996 and 1998; and Turner and Fauconnier 1999.)
Osherson, Daniel, and Edward Smith. 1981. On the adequacy of proto- he blending Web site at http://blending.stanford.edu presents
type theory as a theory of concepts. Cognition 9: 3558. an extensive body of work done since then by many research-
Pinker, Steven. 1989. Learnability and Cognition: he Acquisition of ers in various ields on the theory of conceptual blending and its
Argument Structure. Cambridge, MA: MIT Press.
empirical manifestations in language and grammar, mathemat-
Putnam, Hillary. 1975. Mind, Language and Reality. New York: Cambridge
ics, art, natural and social science, literature, social pragmatics,
University Press.
Rogers, Timothy, and James McClelland. 2004. Semantic Cognition: A
and music. Additional research considers mathematical and
Parallel Distributed Processing Approach. Cambridge, MA: MIT Press. computational modeling of conceptual blending and experi-
Rosch, Eleanor. 1973. Natural categories. Cognitive Psychology mental investigation in the cognitive neuroscience of neural and
4: 32850. cognitive processes.
194
Conceptual Blending
Some Simple Examples Generic space

RITUAL OF THE NEWBORN BABY. In a European ritual, the new-
born baby is carried up the stairs of the parents house as part of
a public event. he ritual is meant, symbolically, to promote the
childs chances of rising in life. One input is the ordinary action
of carrying a baby up the stairs. he other input is the schematic
space of life, already structured so that living a life is metaphori-
cally moving along a path, such that good fortune is up and mis- cross-space mapping
fortune is down. In a partial match between these inputs, the
path up the stairs corresponds to the course of life, the baby is the Input space 1 Input space 2
person who will live this life, the manner of motion up the stairs selective projection
corresponds to how the person goes through life, and so on. In
the symbolic ritual, the two inputs are blended, so that the ascent
of the stairs is the course of life, an easy ascent is an easy rise in
life for the person that the baby will become, and stumbling or
falling might take on extraordinary signiicance. Blended space
Figure 1.
BOAT RACE. A famous example of blending is the boat race or
regatta. A modern catamaran is sailing from San Francisco to
Boston in 1993, trying to go faster than a clipper that sailed the
same course in 1853. A sailing magazine reports: he boat race example is a simple case of blending. Two
inputs share structure. hey get linked by a cross-space map-
As we went to press, Rich Wilson and Bill Biewenga were barely
ping and projected selectively to a blended space. he projection
maintaining a 4.5 day lead over the ghost of the clipper Northern
allows emergent structure to develop on the basis of composi-
Light, whose record run from San Francisco to Boston theyre try-
tion, pattern completion (based on background models), and
ing to beat. In 1853, the clipper made the passage in 76 days, 8
elaboration (running the blend).
hours. (Great American II 1993, 100)
Informally, there are two distinct events in this story, the run CLINTON AND ROOSEVELT. he type of conceptual blend just dis-
by the clipper in 1853 and the run by the catamaran in 1993 on cussed (technically called a mirror network because the same
(approximately) the same course. In the magazine quotation, the frame organizes both inputs) is very general. For example, a
two runs are merged into a single event, a race between the cat- political comment on Bill Clintons presidency after he had been
amaran and the clippers ghost. he two distinct events corre- in oice a few months might have been:
spond to two input mental spaces, which relect salient aspects
By this point, Roosevelt was far ahead of Clinton.
of each event: the voyage, the departure and arrival points, the
period and time of travel, the boat, and its positions at various he two inputs are Roosevelts and Clintons presidencies.
times. he two events share a more schematic frame of sail- hey are mapped onto each other in a natural way: Starting
ing from San Francisco to Boston; this is a generic space, which points, midpoints, and so on are matched. In the blend, Roosevelt
connects them. Blending consists in partially matching the two and Clinton are brought together within the same time frame so
inputs and projecting selectively from these two input spaces into that they are competing against each other. Blends of this sort are
a fourth mental space, the blended space, as shown in Figure 1. routinely elaborated for reasoning purposes in political analysis.
In the blended space, we have two boats on the same course
that left the starting point, San Francisco, on the same day. Computer Interfaces
Pattern completion allows us to construe this situation as a race A nice example of conceptual blending in action and design is
(by importing the familiar background frame of racing and the the desktop interface, in which the computer user moves icons
emotions that go with it). his construal is emergent in the blend. around on a simulated desktop, gives alphanumeric com-
he motion of the boats is structurally constrained by the map- mands, and makes selections by pointing at options on menus.
pings. Language signals the blend explicitly in this case by using Users recruit from their knowledge of oice work, interpersonal
the expression ghost-ship. By running the blend imaginatively commands, pointing, and choosing from lists. All of these are
and dynamically by unfolding the race through time we have inputs to the imaginative invention of a blended scenario that
the relative positions of the boats and their dynamics. serves as the basis for integrated performance. Once this blend
Crucially, the blended space remains connected to the inputs is achieved, it delivers an amazing number of multiple bind-
by the mappings, so that real inferences can be computed in the ings across quite diferent elements, bindings that seem, in ret-
inputs from the imaginary situation in the blended space. For rospect, entirely obvious. A coniguration of continuous pixels
example, we can deduce that the catamaran is going faster overall on the screen is bound to the concept folder, no matter where
in 1993 than the clipper did in 1853, and, more precisely, we have that coniguration occurs on the screen. Folders have identities,
some idea (four and a half days) of their relative performances. which are preserved. he label at the bottom of the folder in one
We can also interpret the emotions of the catamaran crew in view of the desktop corresponds to a set of words in a menu in
terms of the familiar emotions linked to the frame of racing. another view. Pushing a button twice corresponds to opening.
195
Conceptual Blending
Pushing a button once when an arrow on the screen is superim- Emergent structure arises routinely, as in his surgeon is a lum-
posed on a folder corresponds to lifting into view. Of course, in berjack, which suggests that the surgeon is incompetent, though
the technological device that makes the blend possible, namely, incompetence is a feature of neither surgeon nor lumberjack.
the computer interface, there is no ordinary lifting, moving, or here are opposing pressures within an integration network
opening happening at all, only variations in the illumination of a to maximize topology matching, integration, unpacking of the
inite and arranged number of pixels on the screen. he blend is blend, Web connections, compression, and intentionality. More
not the screen; the blend is an imaginative mental creation that complex integration networks (multiple blends) allow multiple
lets us use the computer hardware and software efectively. In input spaces, and successive blending in which blends at one
the blend, there is lifting, moving, opening, and so on happen- level can be inputs at another.
ing, imported not from the technological device at hand, which
is only a medium, but from another input, namely, our mental Compression
conception of work we do on a real desktop. Blending is a remarkable tool of compression over vital relations
like time, space, causeefect, identity, and change. In the new-
The Network Model born ritual, time is compressed: An entire lifetime becomes, in
Conceptual blending is described and studied scientiically in the blend, the short time it takes to climb the stairs; in the desktop
terms of integration networks. In its most basic form, a con- interface, the complex sequence of events that move the mouse
ceptual integration network consists of four connected mental horizontally and cause an apparent vertical motion of the arrow
spaces: two partially matched input spaces, a generic space (and other objects) on the screen is compressed and integrated
constituted by structure common to the inputs, and the blended into a single action moving the arrow. his is a compression of
space. he blended space is constructed through selective space, causeefect, and change.
projection from the inputs, pattern completion, and dynamic
elaboration. he blend has emergent dynamics. It can be run
Language Science
he role of conceptual blending in language has been investi-
while its connections to the other spaces remain in place.
gated in many areas.
Neurobiologically, it has been suggested that elements in men-
tal spaces correspond to activated neural assemblies and that
FICTIVE MOTION. Languages have means of describing static
linking between elements corresponds to neurobiological bind-
scenes in terms of ictive motion:
ing (e.g., coactivation). On this view, mental spaces are built up,
interconnected, and blended in working memory by activat- he fence runs all the way down to the river.
ing structures available from long-term memory. Mental spaces his works by having an imaginary trajector move along the rele-
can be modiied dynamically as thought and discourse unfold. vant dimension of an object, in this case the fence or along some
Products of integration networks can become entrenched to imaginary path linking two objects. his is a remarkable mode
constitute grammatical constructions, basic metaphors, new of expression: It conveys motion and immobility at the same time.
frames, and other elements of the conceptual repertoire. Objective immobility is expressed along with perceptual or con-
Four main types of integration networks have been distin- ceptual motion. his apparent contradiction is a consequence of
guished: simplex, mirror, single-scope, and double-scope. In conceptual blending, which allows several connected, but hetero-
simplexes, one input consists of a frame and the other consists of geneous, mental spaces to be maintained simultaneously within
speciic elements. A frame is a conventional and schematic orga- a single mental construction. An input space containing a static
nization of knowledge, such as buying gasoline. In mirrors, a scene of a fence and a river is blended with an input space that
common organizing frame is shared by all spaces in the network. contributes a moving trajector on a path with a reference point.
In single-scopes, the organizing frames of the inputs are diferent,
and the blend inherits only one of those frames. In double-scopes, COUNTERFACTUALS. Human thought depends heavily on the
essential frame and identity properties are brought in from both capacity for counterfactual thought, and counterfactuals are
inputs. Double-scope blending can resolve clashes between complex blends. Most of us can efortlessly understand state-
inputs that difer fundamentally in content and topology. his is ments like In France, Watergate would not have hurt Nixon. his
a powerful source of human creativity. counterfactual is intended to highlight some diferences between
he main types of networks just mentioned are actually the American and French cultural and political systems. It is a
prototypes along a continuum that anchors our intuitive every- blend that brings in aspects of the French system from one input
day notions about meaning to a uniied understanding of the and the Watergate scandal and President Richard Nixon from the
unconscious processes at work. Varieties of meaning traditionally other. In the blend, we have a Watergate-like situation in France.
considered unequal or even incommensurable categoriza- Running this blend delivers attitudes quite diferent from those
tions, analogies, counterfactuals, metaphors, rituals, logical in the American input, and so in the blend, the president is not
framing, grammatical constructions can all be situated on this harmed. Counterfactuals can blend frames and identities in pow-
continuum. Conceptual blending has been shown to operate in erful ways (If I were you ). Such blends have been shown to
the same way at the highest levels of scientiic, artistic, and literary play a major role for reasoning in the natural and social sciences.
thought and at the supposedly lower levels of elementary under-
standing and sentence meaning. Elaborate blending is at work in THE ORIGIN OF LANGUAGE. he central problem in the origins
supericially simple expressions like safe gun versus safe child, of language is that conceptual structure is vast relative to
guilty pleasures, cafeine headache, or money problem. expressive structure. he central problem of expression is that we
196
Conceptual Blending Conceptual Development and Change
and perhaps other mammals have a vast, open-ended number of Liddell, Scott K. 2003. Grammar, Gesture, and Meaning in American Sign
frames and provisional conceptual assemblies that we manipu- Language. Cambridge: Cambridge University Press.
late. Even if we had only one word per frame, the result would be Mandelblit, Nili. 2000a. Conceptual blending and the interpreta-
too many words to manage. Double-scope integration permits tion of relatives: A case study from Greek. Cognitive Linguistics
11.3/4: 197252.
us to use vocabulary and grammar for one frame or domain or
Mandelblit, Nili.. 2000b. he grammatical marking of conceptual inte-
conceptual assembly to say things about others. It brings a level
gration: From syntax to morphology. In Coulson and Oakley 2000,
of eiciency and generality that suddenly makes the challenging 197252.
mental logistics of expression tractable. he forms of language Nikiforidou, Kiki. 2005. Conceptual blending and the interpreta-
work not because we have managed to encode in them these tion of relatives: A case study from Greek. Cognitive Linguistics
vast and open-ended ranges of meaning but because they make 16.1: 169206.
it possible to prompt for high-level blends over conceptual arrays Sweetser, Eve. 1999. Compositionality and blending: Semantic
we already command. Neither the conceptual operations nor the composition in a cognitively realistic framework. In Cognitive
conceptual arrays are encoded, carried, contained, or otherwise Linguistics: Foundations, Scope, and Methodology, ed. heo Janssen and
captured by the forms of language. he forms need not and can- Gisela Redeker, 12962. Berlin and New York: Mouton de Gruyter.
not carry the full construal of the speciic situation but, instead, Tobin, Vera. 2006. Ways of reading Sherlock Holmes: he entrenchment
of discourse blends. In Dancygier 2006, 7390.
consist of prompts for thinking about situations in the appro-
Turner, Mark. 1996. he Literary Mind: he Origins of hought and
priate way to arrive at a construal. Blended spaces can have as
Language. New York: Oxford University Press.
projections grammatical and lexical forms that come from the . 19992007. he Blending Web site. Available online at: http://
input spaces. Accordingly, meaning that is special to the blend blending.stanford.edu
can be expressed through forms that are already available from . 2001. Cognitive Dimensions of Social Science: he Way We hink
the inputs. In virtue of double-scope blending, new or contextu- About Politics, Economics, Law, and Society. New York: Oxford
ally dependent meaning does not require new expressive forms. University Press.
Double-scope blending is, accordingly, the indispensable opera- Turner, Mark, and Gilles Fauconnier. 1999. A mechanism of creativity.
tion that makes cognitively modern human language possible. Poetics Today 20.3: 397418.
OTHER WORK. Other analyses of the role of conceptual blending in

CONCEPTUAL DEVELOPMENT AND CHANGE
language consider nominal compounds (Fauconnier and Turner
2002); relative clauses (Nikiforidou 2005); semantic extensions Most researchers allow for some kind of change in concepts
(Coulson 2001); sign languages (Liddell 2003); discourse con- over the course of development and as adults go from lay knowl-
structions (Tobin 2006); syntax and morphology (Mandelblit edge to expert knowledge. he main exception to this view would
2000b); polysemy (Fauconnier and Turner 2003); semantic be those who maintain that concepts have no discernible inter-
change and composition (Sweetser 1999); and many other areas. nal mental structure and, therefore, lack suicient substrate to
change (Fodor 1998). For present purposes, it is assumed that
Mark Turner
concepts do have internal structure that then can be used to
characterize change. For those more minimalist accounts of
structure, conceptual change must be understood as a proxy for
Coulson, Seana. 2001. Semantic Leaps: Frame-shifting and changes in how stable concepts are accessed, used, and mentally
Conceptual Blending in Meaning Construction. New York and manipulated.
Cambridge: Cambridge University Press he study of conceptual change has often been hampered
Coulson, Seana, and Oakley, Todd, eds. 2000. Cognitive Linguistics 11.3/4. by quite diferent senses of what is meant by the phrase, with
Special issue on conceptual blending.
the consequences that scholars who seem to be disagreeing are
Coulson, Seana, and Oakley, Todd. 2005. Journal of Pragmatics 37.10.
often talking past one another. It is therefore useful to consider
Special issue on conceptual blending.
Dancygier, Barbara, ed. 2006. Language and Literature 15.1. Special issue
several distinct senses of conceptual change as well as patterns
on conceptual blending. of developmental change that do not relect alterations in the
Fauconnier, Gilles. 1997. Mappings in hought and Language. concepts proper.
Cambridge: Cambridge University Press. he most minimal forms of conceptual change involve elabo-
Fauconnier, Gilles, and Mark Turner. 1996. Blending as a central pro- rations on concept structures in ways that do not cause changes
cess of grammar. In Conceptual Structure, Discourse, and Language, in any other concepts and that do not cause a restructuring of the
ed. Adele Goldberg, 11330. Stanford, CA: Center for the Study of concept in which the elaboration occurs. For example, a child
Language and Information (CSLI). might learn that chairs can be subdivided into kitchen chairs
. 1998. Conceptual integration networks. Cognitive Science
and living room chairs. Such a subdivision might not appreciably
22.2: 13387.
change the concept of the superordinate category of chair (see
. 2002. he Way We hink: Conceptual Blending and the Minds
categorization). One can also learn more details about mem-
Hidden Complexities. New York: Basic Books.
. 2003. Polysemy and conceptual blending. In Polysemy: Flexible
bers of a category by simply adding more features to all mem-
Patterns of Meaning in Mind and Language, ed. Brigitte Nerlich, bers. hus, a child might initially know that cars have wheels and
Vimala Herman, Zazie Todd, and David Clarke, 7994. Berlin and New carry people and only later add the additional features of having
York: Mouton de Gruyter. windshield wipers, brakes, and seatbelts. Concepts can therefore
Great American II. 1993. Latitude 38 (April): 100. go from relatively feature-sparse representations of members of
197
Conceptual Development and Change
a category to feature-rich representations. Some call this con- weight or heaviness that later splits into two concepts of weight
ceptual enrichment, arguing that it shouldnt really count as true as physical quantity and of density (Smith, Carey, and Wiser
conceptual change at all (Carey 1991). 1985). Conceptual diferentiation is diferent from the case of
Some changes are in the kinds of features that make up subdivision described earlier because it makes the original par-
concepts. Others are in the ways those features are mentally ent concept nonviable and, therefore, creates incommensurabil-
represented and used. For example, young children might ity between the diferentiated conceptual structures and their
favor perceptual features over functional ones (Nelson 1973; parent (Kuhn 1970; Carey 1988).
Tomikawa and Dodd 1980), as well as concrete features over Conceptual coalescences often occur in the same system
abstract ones (Werner and Kaplan 1963). Instances of a concept where diferentiations occur. hus, as children split apart one
category would then initially be picked out on the basis of one concept, they merge two other concepts together in a manner
type of feature (e.g., perceptual features of chairs) and later on that makes the original two incoherent. Young children may see
the basis of another kind of feature (e.g., functional features of solids and liquids as of one kind of stuf and air as something
chairs). A diferent approach would see changes not in the fea- entirely diferent. Later, however, they may see all three phases
ture types but rather in how those features are used to make deci- of matter as just variants on the same stuf (Smith, Solomon, and
sions about members of a category. For example, children have Carey 2005). Similarly, young children may see stars and the sun
been described as moving from holistic representations to more as very diferent kinds of things, only to realize later that they are
analytic ones (Vygotksy [1934] 1986; Kemler and Smith 1978). all of the same kind (Nussbaum 1979).
hus, they might initially use a broad, roughly equal weighting Deeper levels of explanation occur when a whole new realm
of all features that typically occur with members of the category of causal regularities get added to the theory. For example, a
and then later switch to a focus on a few critical deining or cen- child might initially understand the bodys functions in terms of
tral features. A child might initially identify an uncle as a friendly gross macroscopic events, such as the chewing of food and motor
adult about the age of ones parents and who is present around movement, and then later sense causal patterns at work at the
holidays and has close bonds with one or both parents. Later, microscopic level. Although such additions can occur without
that same child might focus exclusively on the features having to changing the high level of explanation, new insights at a lower
do with whether a person is a male blood relative of one parent level can often feed back and inluence the higher level.
(Keil 1989). In this characteristic-to-deining shift, feature sam- Finally, conceptual revolutions can involve a dramatic reor-
pling can change with development, even if feature types do not ganization of all the elements in a domain. For example, young
(Keil and Batterman 1984). children might only understand animals, plants, and people as
Changes in feature types or distributions often seem to hap- either psychological entities or mechanical physical ones (Carey
pen in ways that are related across concepts, leading to the idea 1985). hey would not understand plants and animals as living
that concepts often change in clusters or domains. his pattern things. At some point, however, in a manner analogous to con-
suggests that concepts might get their meanings not just from ceptual revolutions in the history of science (e.g., Kuhn 1970;
their constituent features but also from the ways in which they hagard 1992), a dramatic reorganization of an entire belief
relate to other concepts in the same domain. Experimentally, system (see meaning and belief) occurs and a new category
such efects have been shown in cases where shifts in feature of living things emerges (Chinn and Brewer 1993). It is unclear
usage, such as for uncle, are closely linked in time to shifts for how often such dramatic revolutions really occur (Inagaki and
other kinship terms (Keil 1989). Similarly, once a child learns how Hatano 2002; Keil 2003).
to extend one term in a domain to a new domain, such as the tex-
ture term rough to personality, the child is likely to extend at the Imposters
same time all other texture terms to personality, such as smooth A major problem in the study of conceptual change concerns
and slippery (Lehrer 1978; Keil 1986). hese semantic field cases where other patterns of cognitive developmental change
efects suggest that conceptual change occurs in a larger frame- may give an appearance of conceptual change when none is
work that then inluences all concepts within that framework. actually happening. Two common cases involve increasing
One view of concepts as parts of larger structures is known access and shifting relevance.
as the theory theory, in which conceptual change is understood Increasing access refers to cases where cognitive limitations
as part of a process of theory change (Carey 1985; Gopnik and having nothing to do with the concept per se limit its use (Rozin
Wellman 1994). hus, having a concept such as bird involves not 1976). hus, younger children might difer from older ones in
only knowing the features associated with birds but also having terms of memorial or attentional capacities that make them
a sense of why those features co-occur. Birds have wings, feath- unable to access a concept in a certain set of tasks. For example,
ers, and hollow bones because all those structures work together children might fail to engage in transitive reasoning in a wide
causally to support light (Murphy and Medin 1985). Several range of tasks, not because the children lack the concept of tran-
forms of restructuring have been proposed to model theory sitivity but because of the memory burdens imposed by having to
change: conceptual diferentiation, conceptual coalescences, keep several inequalities in mind at the same time. When those
addition of new deeper levels of explanation, and complete reor- memory burdens are reduced by intensive practice with the
ganizations or revolutionary changes. inequalities, the concept of transitivity is easily accessed, even
Conceptual diferentiation occurs when a single concept dif- as the learning of the inequalities might be quite diicult (Bryant
ferentiates into two new concepts that make the earlier concept and Trabasso 1971). One way of thinking about increasing access
obsolete. hus, children might initially have a concept of felt can be seen in the metaphor of a young child learning to use a
198
Conceptual Development and Change Conceptual Metaphor
heavy hammer. We might note at irst that the child cannot use Kuhn, homas. 1970. he Structure of Scientiic Revolutions.
the hammer at all and think that he or she has a hammer dei- Chicago: University of Chicago Press.
cit, only to ind out that with a much lighter hammer, the child Lehrer, A. 1978. Structures of the lexicon and transfer of meaning.
reveals a full understanding of hammers. In other cases, there Lingua 4: 95123.
Murphy, Gregory, and Douglas Medin. 1985. he role of theories in con-
may be a real deicit in the form of the missing concept.
ceptual coherence. Psychological Review 92: 289316.
Shifting relevance refers to changes in which several possi-
Nakhleh, M. B., and A. Samarapungavan. 1999. Elementary school chil-
ble conceptual interpretations irst come to mind in a task. For drens beliefs about matter. Journal of Research in Science Teaching
example, when young children are asked if worms eat, they 36: 777805.
might initially say that worms do not because they interpret eat Nelson, Katherine. 1973. Some evidence for the cognitive primacy of
in a psychological manner involving feelings of satiation, hunger, categorization and its functional basis. Merrill-Palmer Quarterly
and pleasant tastes. Older children who interpret eat in a biolog- 19: 2139.
ical sense of providing nutrition might judge that worms do it. Nussbaum, J. 1979. Childrens conception of the earth as a cosmic
Such changes have been interpreted as evidence for the emer- body: A cross age study. Science Education 63: 83.
gence of the new conceptual domain of biology (Carey 1985). Yet Rozin, Paul. 1976. he evolution of intelligence and access to the cog-
younger children may also be able to access the biological sense nitive unconscious. In Progress in Psychobiology and Physiological
Psychology. Vol. 6. Ed. J. M. Sprague and A. N. Epstein, 24528. New
of eat when shown that such an interpretation is appropriate
York: Academic Press.
(Gutheil, Vera, and Keil 1998).
Smith, Carol, Susan Carey, and Marianne Wiser. 1985. On diferen-
In summary, there are several distinct varieties of conceptual tiation: A case study of the development of size, weight, and density.
change as well as other patterns of cognitive change that can Cognition 21: 177237.
masquerade as conceptual change. It is critical in discussions of Smith, Carol, Gregory Solomon, and Susan Carey. 2005. Never getting to
conceptual change and of the relations of conceptual change to zero: Elementary school students understanding of the ininite divis-
other topics, such as word meaning, to know which senses are ibility of matter and number. Cognitive Psychology 51: 10140.
in play. hagard, Paul. 1992. Conceptual Revolutions. Princeton, NJ: Princeton
University Press.
Frank Keil Tomikawa, S. A., and D. H. Dodd. 1980. Early word meanings: Perceptually
or functionally based? Child Development 51: 11039.
WORKS CITED AND SUGGESTIONS FOR FURTHER READING Vygotsky, L. S. [1934] 1986. hought and Language. Cambridge, MA: MIT
Press.
Bryant, Peter E., and homas Trabasso. 1971. Transitive inferences and
Werner, H., and B. Kaplan. 1963. Symbol Formation: An Organismic-
memory in young children. Nature 232: 4568
Developmental Approach to Language and the Expression of hought.
Carey, Susan. 1985. Conceptual Change in Childhood. Cambridge,
New York: Wiley.
MA: MIT Press.
. 1988. Conceptual diferences between children and adults.
Mind and Language 3: 16781. CONCEPTUAL METAPHOR
. 1991. Knowledge acquisition: Enrichment or conceptual
change? In he Epigenesis of Mind: Essays on Biology and Cognition, According to proponents of conceptual metaphor theory, con-
ed. S. Carey and R. Gelman, 25791. Hillsdale, NJ: Erlbaum. ceptual metaphors are metaphors that we have in our minds
Chinn, C., and W. Brewer. 1993. he role of anomalous data in knowl- that allow us to produce and understand abstract concepts. he
edge acquisition: A theoretical framework and implications for science theory was irst expounded by George Lakof and Mark Johnson
instruction. Review of Educational Research 63: 149. (1980), who argued that conceptual metaphors structure how
Fodor, Jerry. 1998. Concepts: Where Cognitive Science Went Wrong. he people perceive, how they think, and what they do. According
1996 John Locke Lectures. Oxford: Oxford University Press. to Lakof (1993), conceptual metaphors represent habitual ways
Gopnik, Allison, and Henry Wellman. 1994. he theory-theory. In of thinking, in which people metaphorically construe abstract
Mapping the Mind: Domain Speciicity in Cognition and Culture, ed.
concepts such as time, emotions, and feelings, in terms of more
L. Hirschfeld and S. Gelman, 25793. New York: Cambridge University
concrete entities.
Press.
Gutheil, G., A. Vera, and F. Keil. 1998. Houselies dont think: Patterns of
induction and biological beliefs in development. Cognition 66: 3349. Some Conceptual Metaphors and Mappings
Inagaki, Kayoko, and Giyoo Hatano. 2002. Young Childrens Naive Conceptual metaphors are usually expressed in an A IS B format,
hinking about the Biological World. New York: Psychology Press. using capital letters. For example, in the conceptual metaphor
Keil, Frank. C. 1986. Conceptual domains and the acquisition of meta- THEORIES ARE BUILDINGS, theories (an abstract concept)
phor. Cognitive Development 1: 7396. are viewed metaphorically as buildings (a concrete entity).
. 1989. Concepts, Kinds, and Cognitive Development. Cambridge, Conceptual metaphors consist of a source domain and a tar-
MA: MIT Press. get domain (see source and target ). So in the conceptual
. 2003. hats life: Coming to understand biology. Human metaphor THEORIES ARE BUILDINGS, buildings constitute
the source domain, and theories constitute the target domain.
Keil, Frank C., and Nancy Batterman. 1984. A characteristic-to-deining
THEORIES are thus viewed as if they were BUILDINGS (exam-
shift in the acquisition of word meaning. Journal of Verbal Learning
and Verbal Behavior 23: 22136.
ples follow). Lakof (1993) describes the relationship between
Kemler, Deborah G., and Linda B. Smith. 1978. Is there a developmen- the two domains of a conceptual metaphor as a function,
tal trend from integrality to separability in perception? Journal of where speciic properties of the source domain are mapped
Experimental Child Psychology 26: 498507. onto the target domain (see mapping). So, in the conceptual
199
Conceptual Metaphor
Conceptual Metaphors Linguistic Metaphors

e.g., ARGUMENT IS WARFARE e.g., Mr. Marshall had the knives out
for Mr. Manning
They involve the drawing together of They involve the drawing together of
incongruous domains. incongruous words.
Figure 1. The main differences between conceptual

They are structures that are deeply They are surface-level linguistic features. and linguistic metaphors.
embedded in the collective
subconscious of a speech community.
They are thought to constitute a They are usually used to get a particular
structured system upon which much point across, or to perform a particular
abstract thought is based. function.
metaphor THEORIES ARE BUILDINGS, properties of the back in the 60s and to move on. In the same way, ARGUMENT
source domain, BUILDINGS, such as needing a foundation or is often thought of in terms of WARFARE, UNDERSTANDING is
being built from component parts, are mapped onto the target often expressed in terms of SEEING, LOVE is often thought of in
domain of THEORIES, allowing us to talk about theories being terms of a PHYSICAL FORCE, and IDEAS are often thought of in
built on assumptions and axioms or put together by connect- terms of OBJECTS. Conceptual metaphors are thought to exist
ing smaller ideas. he relationship is thus one way: heories are for every abstract concept that we have, although there is no one-
treated as buildings, but buildings are not treated as theories. to-one mapping: A single abstract concept can be understood
Source domains are thus broad, often complex, cluster-like cat- through several conceptual metaphors, and a single conceptual
egories that can provide a rich source of mappings (Littlemore metaphor can be used to explain several abstract concepts. Some
and Low 2006). hey are sometimes described as image sche- conceptual metaphors are universal, whereas others vary from
mas, as they can be represented in highly abstracted simple language to language (cf. metaphor, universals of).
diagrams. Conceptual metaphors are often very complex, and one
Conceptual metaphors are thought to be acquired in infancy, conceptual metaphor will frequently give rise to a series of
through our physical interaction with the world, by the way in mappings. For example, the conceptual metaphor THINKING
which we perceive the environment, move our bodies, and exert IS PERCEIVING gives rise to mappings such as IDEAS ARE
and experience force. Other peoples habitual ways of selecting THINGS PERCEIVED (its quite clear to me); ATTEMPTING TO
and using image schemas will also be inluential. GAIN KNOWLEDGE IS SEARCHING (Im still looking for a solu-
he conceptual metaphor THEORIES ARE BUILDINGS mani- tion); and BEING IGNORANT IS BEING UNABLE TO SEE (you
fests itself in expressions such as: have allowed yourself to be blinded to the truth) (Gibbs 2006).
Conceptual metaphors difer from conceptual metonymies in
You have to construct your argument carefully.
that they involve mappings between diferent domains, whereas
but they now have a solid weight of scientiic evidence. in conceptual metonymies, one part of the single domain is used
he pecking order theory rests on sticky dividend policy. to refer to another, related part of that domain.
his theory is totally without foundation.
Conceptual and Linguistic Metaphor
in which case, the entire theory would have no support.
It is useful to distinguish between conceptual metaphor and
He has done his best to undermine the theory. linguistic metaphor. Conceptual metaphors are cognitive struc-
In an attempt to build a formal theory of underdevelopment tures that are deeply embedded in our subconscious minds,
whereas linguistic metaphors are surface-level linguistic phe-
he value of a scholarly theory should stand or fall on the nomena. It is important to note that the precise words used to
character of the evidence. describe the two domains in a conceptual metaphor (like TIME
and MONEY) are not important or at least not crucial. his is very
One of the most productive conceptual metaphors is the diferent from the situation with linguistic metaphors, where it
conduit metaphor in which communication is seen as trans- is the exact words that constitute the metaphor (Littlemore and
fer from one person to another, allowing us to talk, for example, Low 2006). Indeed, the whole point of a conceptual metaphor is
about conveying information, and getting the message across. that it stands apart from actual exemplars. Figure 1 shows the
Another conceptual metaphor, PROGRESS THROUGH TIME IS main diferences between conceptual metaphors and linguistic
FORWARD MOTION, results in expressions such as plan ahead, metaphors.
200
Conceptual Metaphor
At times, the ability to understand linguistic metaphors concept of synaesthesia, where one sensory stimulus evokes a
(when they are irst encountered) may rely on the success- stimulus in a diferent sensory organ (see Ramachandran and
ful identiication of a relevant conceptual metaphor; at other Hubbard 2001). For example, the fact that the color red often
times, it may not. However, the ability to identify an appropri- denotes heat is due to our ability to make synaesthesic map-
ate conceptual metaphor in itself is rarely suicient to allow a pings between the senses of sight and touch. he synaesthetic
complete understanding of a linguistic metaphor. Additional relationship between sound and vision is relected in the fact
metaphoric thinking is usually required, which takes into that dark or heavy music is likely to involve low notes and
account the context in which the metaphor appears and the minor keys, whereas light music is more likely to involve high
function that it is intended to perform. For example, in order notes and major keys. Primary metaphors thus constitute a
to understand the metaphor slavery was well on the road to more clearly delimited, cognitive, embodied phenomenon and
extinction, it may be helpful (but not necessary) to think in lend themselves much more readily to rigorous empirical test-
terms of the conceptual metaphor PROGRESS IS FORWARD ing than do conceptual metaphors.
MOTION. However, further metaphoric thinking is required to Another criticism of conceptual metaphors is that they often
understand that considerable progress has already been made give only a partial explanation of more creative linguistic meta-
and that there is likely to be no turning back. hus, concep- phors, and the relationship between the two is unclear. In order
tual metaphors sometimes help us to understand linguistic to address this criticism, Andrew Goatly (1997) has extended
metaphors, but they are not always a necessary prerequisite conceptual metaphor theory to take account of the more cre-
nor a suicient condition (see necessary and sufficient ative extensions of conceptual metaphors. Instead of conceptual
conditions ). metaphors, he refers to root analogies. He uses this term to
relect the fact that the original analogy often remains hidden
Developments in Conceptual Metaphor Theory and its relationship to the creative expression is not always clear.
Although conceptual metaphor theory has been hugely inluen- To illustrate his point, Goatly cites the expression the algebra
tial, it has come in for a certain amount of criticism, which has led was the glue they were stuck in. his novel metaphorical expres-
to the theory itself being developed and reined. he main criti- sion is a creative extension of the root analogy DEVELOPMENT
cisms of conceptual metaphor theory include the following: that IS FORWARD MOVEMENT, but the relationship is complex and
the number of conceptual metaphors has had a tendency to pro- not immediately apparent. he root is there, but it cannot actu-
liferate; that they vary signiicantly in the extent to which they ally be seen.
are employed and elaborated; and that there is a huge amount of A inal criticism of conceptual metaphor theory has been that
overlap between them. Moreover, as Graham Low (1999; 2003) the examples used to illustrate conceptual metaphors are not
points out, although it may be tempting, for example, to identify always taken from real language data. Signiicant eforts are now
a single conceptual metaphor of A THEORY IS A BUILDING in a being made to address this issue, most notably by Alice Deignan
text containing phrases such as those listed previously, the ana- (2005) and Anatol Stefanowitsch and Stefan Gries (2006), all of
lyst has no proof that buildings were ever present in the writers whom have used language corpora not only to identify examples
mind when he or she was producing the text (cf. metaphor, of conceptual metaphors but also to reine and develop concep-
information transfer in). If the conceptual metaphor isnt tual metaphor theory itself (see corpus linguistics). his
in the writers mind, then where is it? Could it be that it exists approach allows for a more systematic assessment of the types of
only in the analysts mind? source domains that feature in diferent genres and of the com-
In a partial response to criticisms such as these, Joseph Grady plex interplay between conceptual and linguistic metaphor. An
(1997) suggests that conceptual metaphors do not in fact consti- interesting insight to have come from their research is the fact
tute the most basic level of mapping. Instead, he proposes the that the phraseological patterns surrounding the metaphorical
idea of primary metaphors, which constitute a more fundamen- senses of a word often difer from those that surround its more
tal type of metaphor (Grady and Johnson 2002). Primary meta- literal senses. Phraseological patterning is thus likely to make an
phors arise out of our embodied functioning in the world (Gibbs important contribution to the creation of meaning, and this must
2006) and, as such, are more basic than conceptual metaphors. be taken into account when studying conceptual metaphors.
hey include very basic concepts, such as CHANGE IS MOTION,
Jeannette Littlemore
HELP IS SUPPORT, and CAUSES ARE PHYSICAL SOURCES. One
primary metaphor can often underlie several conceptual meta-
phors. For example, the primary metaphor EXPERIENCE IS A
VALUED POSSESSION is held to underlie the conceptual meta- Cameron, Lynne, and Graham Low, eds. 1999. Researching and Applying
phors DEATH IS A THIEF, A LOVED ONE IS A POSSESSION, and Metaphor. Cambridge: Cambridge University Press.
OPPORTUNITIES ARE VALUABLE OBJECTS. Deignan, Alice. 2005. Metaphor and Corpus Linguistics. London: John
Benjamins.
Primary metaphors are experiential, in that they result from
Gibbs, Raymond. 2006. Embodiment and Cognitive Science.
a projection of basic bodily experiences onto abstract domains.
As such, they are representative of a wider view of human cog- Goatly, Andrew. 1997. he Language of Metaphors. London: Routledge.
nition that gives a central role to embodiment. Proponents Grady, Joseph. 1997. heories are buildings revisited. Cognitive
of embodiment argue that we understand abstract concepts Linguistics 8: 26790.
in terms of our physical experiences with the world and that Grady, Joseph, and Christopher Johnson. 2002. Converging evidence for
the two are impossible to separate. his is closely linked to the the notions of subscene and primary scene. In Metaphor and Metonymy
201
Conduit Metaphor Connectionism and Grammar
in Comparison and Contrast, ed. Rene Dirven and Ralf Prings, 53354. capture this by specifying a set of constraints on the way that
Berlin: Mouton de Gruyter. words are put together to form diferent types of constituents,
Kovecses, Zoltan. 2002. Metaphor: A Practical Introduction. Oxford: Oxford such as noun phrases and verb phrases, as well as the way these
University Press. phrases may be combined to produce well-formed sentences.
Lakof, George. 1987. Women, Fire and Dangerous hings: What
Connectionist models have begun to show how constituent struc-
Categories Reveal About the Mind. Chicago and London: University of
ture may be learned from the input. J. L. Elman (1990) trained a
Chicago Press.
. 1993. he contemporary theory of metaphor. In Metaphor and
simple recurrent network (which has a copy-back loop providing
hought 2d ed. Ed. Andrew Ortony, 20251. Cambridge: Cambridge it with a memory for past inputs) on a small context-free gram-
University Press. mar and was able to show that the network could acquire aspects
Lakof, George, and Mark Johnson. 1980. Metaphors We Live By. of constituent structure. In related work, M. H. Christiansen and
Chicago: University of Chicago Press. N. Chater (1994) demonstrated that this kind of model is capable
Littlemore, Jeannette, and Graham Low. 2006. Figurative hinking and of generalizing to novel syntactic constructions involving long-
Foreign Language Learning. Basingstoke, UK: Palgrave MacMillan. distance dependencies across constituents, suggesting that it is
Low, Graham. 1999. Validating metaphor research projects. In able to exploit linguistic regularities that are deined across con-
Researching and Applying Metaphor, ed. Lynne Cameron and Graham stituents. A subsequent model by D. L. T. Rohde (2002) has fur-
Low, 4865. Cambridge: Cambridge University Press.
ther shown that constituent structure can be learned from more
. 2003. Validating models in applied linguistics. Metaphor and
natural language-like input than that used by previous models,
Symbol 18.4: 23954.
Ramachandran, V. S., and E. M. Hubbard. 2001. Synaesthesia a win-
indicating that this approach may scale up well to deal with full-
dow into perception, thought and language. Journal of Consciousness blown language.
Studies 8.12: 334. he notion of constituency that emerges in these models is
Stefanowitsch, Anatol, and Stefan Gries, eds. 2006. Corpus-based not the same as what is found in standard models of grammar.
Approaches to Metaphor and Metonymy. Berlin: Mouton de Gruyter. Rather, connectionist models suggest a more context-sensitive
notion of constituency, dividing words and phrases into clus-
ters without categorical boundaries and treating them difer-
CONDUIT METAPHOR ently depending on the linguistic context in which they occur.
he conduit metaphor (Reddy [1979] 1993) models communi- For example, Elmans (1990) model was able to learn context-
cation as a process in which the speaker puts information into sensitive animacy constraints from word co-occurrence infor-
words and gets it across to a receiver, who tries to ind the mean- mation, thus allowing it to distinguish semantically meaningful
ing in the words. Words are understood as containers, mean- sentences (e.g., he boy broke the plate) from nonsensical ones
ings as objects that can be put into words. Reddy was concerned (e.g., he plate broke the boy).
with the biasing inluence this model has on our thinking about he generative power of grammars derives from recursion, the
communication. notion that constituents can be embedded within one another
and even within themselves. he model by Elman (1991) was
Jrg Zinken perhaps the irst to demonstrate the acquisition of a limited abil-
ity to process recursive structure in the form of right-branching
WORK CITED relative clauses (e.g., he cat chased the mouse that bit the dog),
Reddy, Michael J. [1979] 1993. he conduit metaphor: A case of frame as well as center-embedded constructions (e.g., he mouse that
conlict in our language about language. In Metaphor and hought, the cat chased bit the dog). Christiansen and Chater (1994), as
ed. A. Ortony, 164201. Cambridge: Cambridge University Press. well as Rohde (2002), extended this initial work by incorporating
several additional types of recursive structure, including sen-
tential complements (e.g., Mary thinks that John says that ),
CONNECTIONISM AND GRAMMAR
possessive genitives (e.g., Johns brothers friend ), and prep-
Connectionist approaches to language employ artiicial neural ositional phrases (e.g., he house on the hill near the lake ).
networks to model psycholinguistic phenomena (see connec- Additionally, Christiansen and Chater (1999) demonstrated
tionist models, language structure, and represen- that the performance of connectionist models closely match
tation). Although a few connectionist models have been used to human data from German and Dutch that relates to complex
directly implement traditional types of grammar (e.g., Fanty 1986), sentences involving recursive center embeddings (with the fol-
most aim to ofer new ways of capturing key properties of grammar, lowing dependency relationship between nouns and verbs N1 N2
such as constituent structure and recursion (see recur- N3 V3 V2 V1) and cross-serial dependencies (N1 N2 N3 V1 V2 V3),
sion, iteration, and metarepresentation). In particular, respectively. Speciically, people ind doubly center-embedded
the latter models seek to demonstrate how important aspects of constructions in German much harder to process than com-
grammar may emerge through learning, rather than being built parable levels of cross-serial dependency embedding in Dutch
into the language system. his entry, therefore, focuses on the (controlling for semantic factors across the two languages), and
radical connectionist models as they promise to provide new ways this pattern of processing diiculty was mirrored closely by the
of thinking about grammar and, as such, potentially could provide model. As with the connectionist notion of constituency, the
the most substantial contribution to the language sciences. recursive abilities of connectionist models deviate from standard
Words in sentences are not merely strung together as beads conceptions of recursion. Speciically, connectionist models are
on a string but are combined in a hierarchical fashion. Grammars unable to accommodate unlimited recursion; it is important
202
Connectionism, Language Science, and Meaning
to note, however, that they are able to capture recursion at the he constituents are parallel to neurons, and the operations are
level of human abilities, as evidenced by psycholinguistic parallel to the iring of neurons. However, connectionist mod-
experimentation. els are not strictly neurobiological and may be implemented in
Connectionist approaches to grammar are still very much various materials (e.g., computers). More exactly, a connection-
in their infancy and currently do not have the kind of coverage ist architecture has nodes as its basic constituents. hese nodes
and grammatical sophistication as seen in more traditional are linked to one another, forming circuits. he nodes may have
computational models of syntax. Moreover, the question diferent degrees of activation, and they receive activation from
remains as to whether the initial encouraging results described other nodes in the circuit. When a node is activated in some
here can be scaled up to deal with the full complexities of real models, when it reaches a particular level of activation, a thresh-
language in a psychologically realistic way. If successful, how- old it ires, transmitting its activation to subsequent nodes in
ever, then the conception of grammar may need to be radically the circuit.
rethought, including notions of constituency and recursion. he individual connections among nodes are commonly
Already, connectionist models have suggested that the idea understood to have diferent degrees of strength. Strength is typi-
of an ininite linguistic competence, as typically prescribed cally a multiplicative relation, such that the activation of the ir-
by generative grammar, may not be required for capturing or input node is multiplied by the connection strength to yield
ing human language performance. In this regard, the kind the amount of activation transmitted to the recipient node (e.g.,
of grammatical framework hinted at by connectionist models a node iring at level 1 delivers a level of activation to a second
more closely resemble those of construction grammars node of .5 if the strength of the connection between the nodes
and the usage-based theory of language than the tradi- is .5). hese connection strengths may be altered by activation
tional generative grammar approaches. Whatever the future sequences (e.g., in many models, when nodes activate together,
outcome of the connectionist approach to grammar may be, it the strength of their connection increases). he connections may
is likely to stimulate much debate over the nature of grammar be excitatory or inhibitory that is, a irst node my increase or
and language itself as it has done in the past and this, in decrease the activation of a second node. Connectionist circuits
the long run, may be where connectionism will have the larg- or neural networks commonly have a set of input nodes, a set
est impact on the way we think about grammar within the lan- of output nodes, and layers of hidden nodes. Connectionist
guage sciences. models also incorporate some way that errors may be detected
and corrected. In a connectionist model, correction is a mat-
Morten H. Christiansen
ter of readjusting connection strengths among the nodes in the
circuit.
Connectionist modeling has two broad purposes. One relates
Christiansen, M. H., and N. Chater. 1994. Generalization and connec- to artiicial intelligence. he other relates to actual human cog-
tionist language learning. Mind and Language 9: 27387. nition. Insofar as connectionist models are designed to explain
. 1999. Toward a connectionist model of recursion in human lin- human cognition, the models are constrained by properties of
guistic performance. Cognitive Science 23: 157205.
human behavior. Take, for example, a connectionist model
Elman, J. L. 1990. Finding structure in time. Cognitive Science 14:
of plural formation in English. If a connectionist is merely set-
179211.
. 1991. Distributed representation, simple recurrent networks, and
ting out to create a program that generates plurals, he or she
grammatical structure. Machine Learning 7: 195225. does not need to worry about the precise sorts of errors actual
Fanty, M. A. 1986. Context-free parsing with connectionist networks. human beings make with plurals, the way plural usage develops
In Neural Networks for Computing, ed. J. S. Denker, 14045. New in childhood, and so on. However, a connectionist who is mod-
York: American Institute of Physics. eling actual human language will wish to design a system that
Onnis, L., M. H. Christiansen, and N. Chater. 2005. Cognitive sci- produces the same curve of correct plurals and errors that we
ence: Connectionist models of human language processing. ind among real people.
In Encyclopedia of Language and Linguistics, ed. K. Brown.
Oxford: Elsevier. his review article provides a more detailed treatment Connectionism and Neuroscience
of the issues discussed here.
he artiicial intelligence value of connectionism seems clear.
Rohde, D. L. T. 2002. A connectionist model of sentence comprehension
But with respect to human language, one might ask why bother
and production. Ph.D. diss., Carnegie Mellon University, Department
of Computer Science, Pittsburgh, PA.
with connectionist modeling at all? Why dont we simply do
Rohde, D. L. T., and D. C. Plaut. 2003. Connectionist models of language neuroscience? After all, connectionism takes up the basic prin-
processing. Cognitive studies 10: 1028. Another review of connec- ciples of neurobiology neuronal units, iring thresholds, cir-
tionist models of language. cuits. However, it tends to eschew the ine-grained, empirically
based assignment of specialized neuronal or regional functions.
CONNECTIONISM, LANGUAGE SCIENCE, Moreover, it simply leaves out such important components of
AND MEANING neurobiology as neurochemistry.
Certainly, connectionist modeling of human cognitive archi-
Connectionism, or parallel distributed processing, is a general tecture cannot replace neuroscience. Moreover, it does seem
term for a set of particular cognitive architectures. With some vari- clear that such modeling should follow the basic principles of
ations, these architectures model mental processes on a shared neuroscience (e.g., in modeling human language, it should not
set of constituents and operations, drawn from neurobiology. posit processes that have no correlate in the brain). However,
203
connectionist analyses serve two purposes beyond empirical otherwise. Second, there is a parallel element in symbolic archi-
neuroscience. he irst is the general purpose of abstract mod- tectures, at least when they are fully elaborated. For example,
eling. Empirical neuroscientists are rightly concerned with try- in minimalism, the production of a sentence involves at least
ing to deine just what particular neurons, circuits, and regions some phonological, logical, and syntactic parallelism. Moreover,
of the brain do. Connectionist modeling is concerned with just connectionist networks clearly involve rules (even rules that
what kinds of things an architecture of this general sort can do. constitute operations over variables, the key set of rules stressed
Clearly, the actual human brain does not do everything that a by Gary Marcus [2001; see particularly 3583]). he rules are
similarly structured connectionist model can do. However, the embodied in the ways activation operates (e.g., through summa-
actual human brain does some subset of those things. hus, tion of inputs to thresholds).
abstract modeling tells us what we should not be investigating Finally, it is not entirely clear that distributed versus uniied
in terms of neural operation (those things that no system of this is really an opposition. For example, lexical items have compo-
sort could ever do). It also tells us what sorts of things we should nents in symbolic accounts (e.g., semantic features). Writers in
not ignore in investigating neural operation (those things that we the tradition of symbolic architecture do seem to envision those
may have thought were impossible for such a system, but which components as occurring in one place, much like items in a dic-
connectionist models indicate are not impossible). For example, tionary entry occur on a single page. However, in terms of the
it may be argued that connectionist networks have been used theories themselves, that only means that the components are
successfully to model the emergence of semantic categories conceptually related, accessed together, and so on. A symbolic
in infancy, even when they have incorporated little in the way architecture need not be committed to localism in the neural
of innate structure (see Rogers and McClelland 2004, 12173; substrate. he meaning of a given symbol may be realized in a
innateness and innatism). In this way, connectionist mod- pattern of activation across diferent areas of the brain.
eling may serve as an important guide for the empirical investi- his is not to say that there are no diferences between con-
gation of the neural substrate of language. nectionist and symbolic accounts. But many of these are just the
he second function of connectionism is related. Despite general sort of diferences that arise when one moves through
genuine advances in neurolinguistics in recent years (see brain distinct levels of structure. For example, connectionist models
and language), there is considerable divergence of opinion in stress dispersal. Symbolic accounts stress unities. But this need
the ield and relatively little is well established. Apparently, irm not constitute a contradiction anymore than diferences between
ideas (e.g., about the precise location and function of brocas physical and social accounts or macroscopic and microscopic
area and wernickes area) are continually being revised. It accounts need constitute a contradiction. We dont stop speaking
seems highly unlikely that our fundamental understanding of of trees and discussing their biology or their gross physical prop-
neurons, neuronal circuits, and other basic functional properties erties just because we discover that they are composed of atoms.
will change very much. But particular neuroscientiic accounts Nor do we say that a railway system is not reasonably treated as a
of language operation are far less stable. Neuroscientists rightly single thing just because it is dispersed in space. A similar point
tend to follow currently promising avenues of research, pursu- holds for rules. It is indeed the case that symbolic systems tend to
ing particular empirical hypotheses in line with recent theories. involve many more rules and much more speciic rules than con-
But, as a wise man (Hilary Putnam) once remarked, one of the nectionist architectures. But these rules are reasonably thought
few things we know about our current theories is that they are of as emerging from neural networks. he existence of a neural
wrong. Given this, it is valuable to have a research program that substrate without, for example, speciic grammatical rules does
operates on the same basic, well-established principles as neu- not invalidate a linguistic discussion of grammatical principles
rolinguistics, but which is not closely tied to the vagaries of cur- anymore than the existence of a particle substrate invalidates an
rent theorization. engineers discussion of macroscopic causal laws.
Consider, for example, the head-directionality parameter
Connectionism and Symbolic Architectures in principles and parameters theory. As Mark Baker
Another diference between connectionism and neurobiology is explains in his entry on parameters, Roughly put, when a
that connectionist models move, so to speak, in the direction of word-level category X [the head] merges with a phrase Y to create
symbolic architectures (what we commonly think of as mental a phrase of type X, there are two ways that the elements can be
architectures, rather than neurobiological ones). It is common ordered: he order can be X-Y within XP [X phrase], or it can be
for connectionists to deine their models in opposition to those Y-X. In languages throughout the world, heads tend to be added
of symbolic architectures, such as generative grammar. in the same position, either irst or last (not half in each, as one
Symbolic architectures are commonly thought to operate seri- might expect from random distribution). Grossly oversimplify-
ally, rather than in parallel, and to operate locally, rather than in ing, we could imagine a connectionist model in which the net-
a distributed fashion. So a symbolic architecture would typically works for a range of processes develop some alternation between
be understood as involving some singular representation (e.g., initial position and inal position. For example, it is easy to set up
a concept or a sentence) that is put through a series of pro- a model in which some set of items triggers a directionality node,
cesses in a certain sequence following certain rules with rules which in turn activates either a beginning or an end node. Suppose
sometimes also (putatively) rejected in connectionist models. it activates the beginning node. Once that node is activated in
Conceived of in absolute terms, the dichotomy is a false one. the context of a task say, identifying a determiner it will lead
First, there is a sequential element in all connectionist mod- to a behavioral output of checking the beginning. he absence
els. here would be no sense in speaking of input and output of the determiner at the beginning will initiate the correction
204
mechanism. Given a general relation between beginning and viviparous (e.g., it is an amoeba farm) and, more importantly,
end nodes within the network, that mechanism could by default when x is a farm animal and x is both oviparous and viviparous.
lead to the activation of the end node. Computer simulations of he former case is the obvious one, since it does not rely on the
connectionist models could be used to examine whether this peculiar nature of exclusive or (or the existence of simultane-
might lead to a coordination of diferent heads as irst or last. A ously oviparous and viviparous creatures). However, real people
successful connectionist model would not imply that one should at some point come to understand the second possible falsity as
not speak of a principle here (heads are added in one direction) well. For example, they come to draw an inference very swiftly
with a parameter to be set (irst/last). Rather, it would provide a when they learn that x is oviparous (i.e., they infer that x is not
model for the substrate of an emergent principle. viviparous) or when they learn that x is viviparous (i.e., they infer
his account is, as I said, grossly oversimpliied. For example, that x is not oviparous). A connectionist model should produce
it assumes that the system collects together all heads, rather than this result as well.
treating diferent sorts of heads separately or rather than orga- To model exclusive or, we begin with input nodes. In a full
nizing grammar in ways that do not even include heads. his is model, we would need an array of nodes to represent farm,
a very small instance of the sort of problem Steven Pinker has in another array for animal, and so on. But, for simplicity, lets
mind when he questions whether or not connectionist models can assume one node per word or object. Speciically, lets take the
scale up to the level of a full grammar (2002, 79). Pinker ofers case of Flufy. Little Bufy arrives on the farm and encounters
this criticism in the context of supporting a symbolic approach to Flufy. She already has a node for farm animal. Farm is acti-
grammar. But here, too, the opposition is mistaken. If we assume vated by her presence in the barn. Animal is activated by various
that the substrate of any grammatical operation is the human properties of Flufy. Together, these activate farm animal. Flufy
brain, then something along the lines of a connectionist network is assigned a new node. he activation of the (new) Flufy node
will have to scale up to the level of a full grammar. As Marcus puts along with the farm animal node serves to link the two. Bufy
it, referring to a speciic case, he right question is not Can any already knows that animals either have eggs or have babies.
connectionist model capture the facts of inlection? but rather hus, part of her stable knowledge involves the relation of exclu-
What design features must a connectionist model that captures sive disjunction between oviparous and viviparous. (Obviously,
the facts of inlection incorporate? (2001, 83). it doesnt matter if she knows these particular words.)
Of course, Pinker has in mind something else here innatism. his may be modeled in the following way (see Figure 1). Both
It is true that symbolic approaches to language have often posited oviparous and viviparous have their own nodes (marked o and
a rich innate grammar, whereas connectionist approaches have v). here are also nodes for nonoviparous (-o) and nonvivipa-
tended to minimize innatism. In general, it is valuable to have con- rous (-v). Oviparous has an excitatory connection (marked by a
licting views on this issue driving competing research programs. pointed arrow) with nonviviparous, which, in turn, has an inhibi-
However, it seems that the association of innatism with symbolism, tory connection with viviparous (marked by an arrow with a cir-
on the one hand, and the association of noninnatist learning with cular head). Similarly, viviparous has an excitatory connection
connectionism, on the other hand, is contingent. here is no rea- with nonoviparous, which, in turn, has an inhibitory connection
son that one could not have specializations and biases in a connec- with oviparous. hus, when Bufy sees Flufy sitting on a nestful
tionist network (cf. Martindale 1995, 250). hese would, of course, of eggs, she infers not only that Flufy is oviparous but also that
not be rules in the symbolic sense but they would be equivalent she is nonviviparous. (Moreover, due to inhibition, she does not
to rules at the level of the emergent structure. Conversely, there is infer that Flufy is viviparous, even though viviparous might have
no reason that a symbolic account of grammar as such requires
rich innatism. It could begin from a view that grammar is learned
through experience, given general learning mechanisms (perhaps
of the sort currently associated with connectionism).
v
An Example from Truth Conditional Semantics
Consider a variation on a standard example exclusive or (i.e.,
o
one of two items, but not both) embedded, in this case, in a con-
ditional assertion. Conditionals and logical connectives, such
as or, are important not only in the semantics of formal
languages but also in the semantics of natural languages. For v
example, they are critical for truth conditional semantics.
Take a sentence such as If x is a farm animal, then x is ovipa- o
rous or viviparous. he semantics of this sentence are, in fact,
enormously complex. A full connectionist model would have Figure 1. A simpliied model for exclusive or in the context oviparous or
to account for many things. For the moment, lets assume that viviparous. Arrows indicate excitatory connections. Lines with solid circles
most of this is taken care of, focusing only on the truth condition indicate inhibitory connections. The symbol - stands for not; o stands
issue. In a truth conditional account, we need to map the sen- for oviparous; v for viviparous. Note that this model is not intended to
tence onto two values. hese are T (or true) and F (or false). here capture the truth conditions per se. Rather, it models the psychological
are only two conditions in which the sentence is mapped onto process idealized in the truth conditions. Thus, a disruption in a particular
F that is, when x is a farm animal and x is neither oviparous nor
connection could lead someone to hold logically contradictory beliefs.
205
Connectionism, Language Science, and Meaning Connectionist Models, Language Structure, and Representation
by denying that this complex of parallel and distributed processes

T yields principles. More importantly, this account also indicates
that the standard logical rule for exclusive or may not be our actual
c psychological rule. For example, this model partially incorporates
o
our common bias toward conirmation, rather than falsiication or
v neutrality (see, for example, Nisbett and Ross 1980, 23842). he
sorts of statistical processes manifest in fully developed connec-
o
ov tionist models would be able to suggest other deviations from the
v F logical rule as well the place of exceptions, the degree to which
our memories often revert to the assumption of truth even after dis-
conirmation, and so on. Finally, such developments may suggest
Figure 2. A simpliied model of the psychologically operative truth condi- avenues of inquiry for neurological investigations (e.g., in semantic
tions for If x is a farm animal, then x is oviparous or viviparous. The letter and episodic memory), which are not initially obvious in the study
c stands for the entire conditional. T stands for true and F for false. The of conditionals or logical connectives.
letters o and v stand for oviparous and viviparous; - stands for not; In sum, connectionism provides us with a way of modeling
and -ov stands for neither oviparous nor viviparous. Arrows indicate language processing, storage, and acquisition indeed, history,
excitatory relations. Lines with solid circles indicate inhibitory connections.
evolution, variation, and other areas of language science as well.
Solid lines indicate a connection strength of 1. Broken lines indicate a con-
It is inspired by neurobiology, but departs from this source to
nection strength of .5. Activation of a node occurs at 1. Thus, for example,
follow a more independent research program that may direct
both -o and -v would have to be activated for -ov to be activated. This is
because the .5 connection strengths reduce the activation they commu- us toward productive areas of neurobiological research, or away
nicate to .5. Note that this is not a model of logical/empirical truth condi- from unproductive areas. At the same time, it mediates between
tions but of psychologically operative truth conditions. Thus, it seeks to neurobiology and symbolic approaches, in some cases suggest-
capture an initial presumption about the truth of the conditional based, ing problems with and possibilities for the latter as well as the
in this case, on authority. This is presumably more psychologically realistic former. Although connectionist and symbolic architectures are
than a model that yields T only in cases of inductive validity. often viewed as diametrically opposed, it may be best to see the
former as modeling neurobiological substrates of language and
the latter as treating (rule-approximating) structures that emerge
received some degree of activation simply from the [unpictured] from that substrate.
activations of mother, etc., in connection with Flufy.)
Patrick Colm Hogan
Going further, we might model the entire sentence. here
are many ways in which this can be done. he following is an
attempt at a version that, while extremely simple, at least points
toward the way disjunctive or might operate psychologically Marcus, Gary. 2001. he Algebraic Mind: Integrating Connectionism and
(see Figure 2). Suppose that when the node for our oviparous/ Cognitive Science. Cambridge, MA: MIT Press.
viviparous conditional (c) is activated, the truth node (T) is also Martindale, Colin. 1995. Creativity and connectionism. In he Creative
activated. For instance, if Uncle Bob tells Bufy that farm animals Cognition Approach, ed. Steven Smith, homas Ward, and Ronald
Finke, 24068. Cambridge, MA: MIT Press.
either lay eggs or have babies, but not both, then Bufy assumes
McLeod, Peter, Kim Plunkett, and Edmund Rolls. 1998. Introduction
that Uncle Bob is right. Bufy does not require positive evidence
to Connectionist Modelling of Cognitive Processes. Oxford: Oxford
to judge the conditional to be true. However, Bufy may discover University Press.
that Uncle Bob is secretly farming amoeba, which are nonovipa- Nisbett, Richard, and Lee Ross. 1980. Human Inference: Strategies and
rous and nonviviparous, or that he is farming some mutant spe- Shortcomings of Social Judgment. Englewood Clifs, NJ: Prentice-Hall.
cies that lays some eggs and gives birth to some babies. In the Pinker, Steven. 2002. he Blank Slate: he Modern Denial of Human
latter case, the simultaneous activation of oviparous and vivipa- Nature. New York: Viking.
rous could be modeled as activating F (through connections of Rogers, Timothy, and James McClelland. 2004. Semantic Cognition: A
strength .5). In a slightly more complex sequence, positive acti- Parallel Distributed Processing Approach. Cambridge, MA: MIT Press.
vation of both nonoviparous and nonviviparous might activate a
hidden node (say, -ov) through .5 connection strength links; this
node could, in turn, activate F (through a connection strength of
CONNECTIONIST MODELS, LANGUAGE STRUCTURE,
1). Finally, in this model, F would inhibit T.
AND REPRESENTATION
his is, of course, an extremely simple model. In fact, con- Connectionist models have many desirable properties when
nectionist models of language phenomena are highly complex, it comes to understanding several very basic things about
requiring computer simulations to plot their predictions and devel- language:
opments (see, for example, Rogers and McClelland (2004); see also
Why language has the structure that it does;
connectionist models, language structure, and repre-
sentation and connectionism and grammar). Nonetheless, How knowledge of language structure is represented so as to
this model may partially illustrate some of the relations between lexibly encompass the sort of structure that it has;
connectionist models, symbolic architectures, and neurobiology How language can change over time in a direction that main-
outlined previously. For example, we do not seem to gain anything tains and accentuates this sort of structure.
206
Connectionist Models, Language Structure, and Representation
Connectionist models also have desirable properties when it related cases, such as MIND, FIND, MILD, and so on, that over-
comes to understanding language acquisition. lap with PINT and share the atypical vowel correspondence seen
in this item. he other cases have similar properties.
Quasi Regularity he point, in all of these cases, is that an inherent feature
A key, but underrecognized, aspect of all of natural languages is of language is its quasi regularity: irregular forms are nearly
the fact that they exhibit quasi regularity. hat is, linguistic expres- always partially regular. Approaches to language that relegate
sions generally have properties that are shared with many other all quasi-regular forms to the status of exceptions are, in general,
forms, while also having properties that are more idiosyncratic. eschewing an account of an important and productive aspect of
In general, they are neither completely regular nor completely language.
irregular, but instead are best understood as lying somewhere
along a continuum of regularity, with most forms somewhere Connectionist Models and Quasi Regularity
between the extremes. Languages do have a tendency to pull Connectionist models ofer ways of representing language
novel and infrequent forms into conformity with other forms, but knowledge and language learning that explain why it has the
they also have a tendency to maintain (even promote) forms with quasi-regular structure that it does and why it changes in some
item-speciic idiosyncrasies and clusters of similar items that of the ways that it does. No extant connectionist model is perfect
share such idiosyncrasies. hese idiosyncrasies coexist within in its account of any speciic aspect of language, but such models
items that also exhibit sensitivity to more general regularities. have the fundamental properties necessary for addressing this
Lets consider a range of examples from several diferent sub- very general property, its quasi-regular structure, justifying their
domains of language: further development.
I present the principles of distributed connectionist models
Inlectional morphology
within the context of a simple model of single word reading. I
keep-kept
use the model to underscore the point that the knowledge that
tell-told
governs processing in connectionist models is fundamentally
say-said
diferent from the knowledge that linguists have traditionally
have-had
attributed to speakers and hearers, but it is ideally suited to
All these cases exhibit the correct regular past tense inlec- addressing the previous points, and thus is a candidate form of
tion together with a vowel change or consonant deletion in the knowledge for all aspects of language.
stem. he pattern in keep is shared with a number of other verbs, he essence of connectionist models is that they provide a
including sleep, creep, sweep, and somewhat more distantly mechanism that is at once sensitive to both general and speciic
dream, kneel, and mean. information in ways that mirror human sensitivity to such infor-
mation. A connectionist model can behave as though it knows
Derivational morphology
very general and completely idiosyncratic information; and it
predict, prefer
can exploit both general and speciic information in processing
dirty, rosy
novel forms in ways that appear to mirror those seen in normal
idolize, replicate,
human subjects. he model I describe is a model of single word
As many have noted, and Joan L. Bybee (1985) and Luigi Burzio reading (Plaut et al. 1996). It has input units representing graph-
(2002) have considered extensively, there are many derived emes, output units representing phonemes, and one intermedi-
inlectional forms that preserve semantic characteristics of ate layer of hidden units that are initially uncommitted.
their constituents but bring in altered or additional elements of In networks such as this, the knowledge that governs process-
meaning. Treating these items as either fully compositional or ing of an item is in the strengths of the connections among the
as fully opaque misses out on capturing one or the other aspects simple processing units. Knowledge enters the system gradu-
of their meaning. ally, through a connection adjustment process, based on experi-
ence with corresponding inputs and outputs. he experience is
Idioms, constructions, and collocations
assumed to mirror human experience in reading in that frequent
I want to see a doctor.
items (words like HAVE and TAKE) occur very frequently, while
She felt the baby kick.
less frequent items (words like LINT and COPE) occur far less
Hour by hour I grow more and more lonely.
frequently.
Such expressions nearly always participate in general patterns Processing and learning take place as follows: he letters from
characteristic of other expressions with overlapping constituents a word are presented over the input units; activation is allowed
(I saw a doctor, I saw a movie, I want to see a baseball game) but to propagate forward to the output units; and adjustments are
carry idiosyncratic meaning. then made to the connections within the network to reduce the
diference between the output produced by the network and the
Spelling sound correspondences
pronunciation paired with the spelling. Note that this process is
pint
not thought of as explicit correction of overt errors in the models
clown
performance. he networks outputs can be translated into an
great
overt product, but learning is thought to occur from exposure
Consider PINT. his item is an exception, but three of the letters to spelling-sound pairs. (It is possible that once the process gets
(P, N, and T) take their regular correspondences, and there are started, many of the sounds are provided internally as a result of
207
the readers ability to correctly predict the correct word based on neighbors items like BOPE with a high degree of consis-
context. Of course, at irst this is unlikely, and learning depends tency with conventional spelling-sound rules, but shows con-
on there being a supportive context in which the child hears the siderable inconsistency in productions of forms such as:
sound associated with each word as it is being read by a parent
GROOK, PREAD, MAVE
or teacher).
Consider the learning that occurs as a result of processing an he pattern in the model, as in human readers, is to choose
ordinary pair such as MINT /mint/: It afects weights coming one of the correspondences of the verb that occurs either in
from units for M, I, N, and T and going to units for /m/, /I/, /n/, forms traditionally treated as regular or irregular, but not neces-
and /t/. Over the corpus, words with M in the input will over- sarily to choose the regular form.
whelmingly have /m/ in the output and little else will be com- he model exhibits sensitivity to the following:
mon across them, and so connections will gradually arise that First-order spelling-sound regularities:
map M reliably to /m/, and similarly for most of the other let- M /m/
ters. he result is that the weights coding for the relation between B /b/
spellings that begin with M and sounds beginning with /m/ will Local context-sensitive patterns:
largely be restricted to the weights out of the M unit and into the C /s/ when followed by {I | E} as in cell, cent
/m/ unit. But now consider the vowel. he situation here is far O oh when followed by L{L|D|K|T} as in told, colt
more complex. Depending largely on aspects of the coda, it will Partially consistent clusters, such as
tend to be pronounced either as /ai/ or as /I/: he former occurs OO{K, sometimes T,F} {vowel in put}
when there is a single consonant followed by a inal E, as in MINE, EA{often D,TH, sometimes F} {vowel in bread}
FINE, DINE, and so on, while the latter tends to occur when the I Exceptions that weakly cluster with others, such as PINT.
is followed only by consonants as in MINT, LINK, SILT, and so on. Idiosyncratic high-frequency exceptions like HAVE.
But there are several exceptions, including MIND, WIND; MILD,
CHILD; PINT. he network must learn to rely on the presence of he model also accounts for the near (but not total) indepen-
the inal E to signal the /ai/ correspondence; to generally produce dence of onsets and rhymes, and the much greater dependency
/I/ otherwise; and to overcome this tendency in the speciic con- between vowels and codas. his independence is not complete,
stellation of circumstances corresponding to the exceptions. as indicated by a few special forms like wash, warm, and so on.
As a result of the gradual learning process, the weights coding he model captures this sensitivity in these cases.
for the pronunciation of the vowel in PINT will depend not only he model produces gradient efects of the degree of con-
on the letter I but also on all of the other letters in PINT. While sistency with properties of words with similar rhyme-spellings
one way in which this could occur would be for the network to and of the items own frequency. he speciic pattern of these
carve out a separate hidden unit for it, that is not what it tends gradient efects corresponds to key features of experimental
to do; the knowledge appears to be distributed over connections data: Performance on items that are highly consistent with their
into and out of many of the hidden units. he important points neighbors is relatively insensitive to the frequency of the item
here are the following: itself (leading some to assert that such items are dealt with by
algebraic rules, but fully consistent with the properties of connec-
In general, any and all aspects of an input can be relevant to tionist models). Performance on items that are inconsistent with
any aspect of the output, but complex cases are harder for the their neighbors is highly sensitive to the items own frequency.
network to master and must work against the grain of the gen- Sensitivity to an items own frequency is a matter of degree itself
eral tendencies embodied in the training corpus. dependent on the degree of inconsistency.
For the most part, even when some aspects of the output he model does these things without having an explicit rep-
such as the vowel in an exception like PINT depend on resentation of either any speciic lexical item or of any other
all aspects of the input, many other aspects in this case, subword unit (other than the grapheme and the phoneme), and
the handling of the P, N, and T are largely componentially without having an explicit representation of any of the rules or
determined. correspondences.
Secondarily but still important, even the exceptional aspect of It also has the very important property of using general knowl-
PINT beneits to a degree from the learning that occurs with edge not only to process the fully regular cases but also, to the
related exceptions like MIND, FIND, WIND, and so on. fullest extent possible, to process those items that are partially
he model accounts for exceptional: It is only the idiosyncratic aspects of exceptions that
are processed diferently from fully regular forms. It thus cap-
main efects of regularity, frequency, and frequency by reg- tures the tendency for exceptions to be largely regular in nature,
ularity interactions in human RTs and in pattern of errors the key, underappreciated feature of all languages with which I
(under conditions where errors are likely); began this entry.
graded efects of consistency of a word with pronunciations of he same general principles have been used to capture a wide
other known words; range of phenomena in reading, language processing, and many
the pattern of performance exhibited by human adults in read- other domains.
ing pronounceable nonwords that contain bodies that vary Here, I briely related mention several related models, in part
in the consistency of their pronunciation. Speciically: he to indicate the generality of the approach and in part to address
model reads regular forms that are consistent with all of their concerns that some readers may have with some of the apparent
208
commitments made by the reading model of D. C. Plaut and commitment to structure in the representations of inputs and
colleagues. outputs.
Models of past tense formation, based on mapping Models that work from raw speech input (Keidel et al. 2003).
stem->past (e.g., Rumelhart and McClelland 1986; Plunkett
here are several models the irst was an unpublished
and Marchman 1991).
efort by Elman from the late 1980s in which the model works
hese models illustrate the same principles as the reading directly with recorded spoken language. More recent eforts
model already discussed. Both articles discuss how such models (e.g., Keidel et al. 2003) along these lines are in their infancy,
can capture sensitivity to quasi-regular patterns, and K. Plunkett but when they have matured they will eliminate a residual
and V. Marchman explain why there are so few suppletions drawback of all of the aforementioned models, namely, that
(stem-past pairs like go-went) and those that exist are only of they stipulate speciic units on both their inputs and outputs.
high frequency. Essentially, it is very hard work for a connec- Models that work directly with raw speech aford the possibil-
tionist network to learn a completely arbitrary mapping. A ten- ity of seeing phones, phonemes, and syllables as approximate
dency for weights to be shared across items forces these models descriptive conveniences similar to the other sorts of units we
to exhibit sensitivity to regularities, while the fact that they are have already been discussing. his overcomes a contradiction
trained gradually through small adjustments of connections inherent in the Plaut et al. reading models, and in most of the
makes them inherently sensitive to frequency efects. other models discussed: hese models eschew units internally,
on the one hand, but appear to depend on such units in their
Models of past tense formation, mapping from meaning to
inputs and outputs, on the other. he computational demands
sound and sound to meaning (Joanisse and Seidenberg 1999;
of working directly with raw speech are daunting, but some
Lupyan and McClelland 2003).
progress is being made in ongoing work.
he same general principles are at work here as well, but the A radical connectionist hypothesis would be that approaches
approach brings out the quasi-systematic relationship between to representing language structure and language knowledge that
semantic and syntactic properties of words, on the one hand, eschew prior commitment to units of any type will ultimately be
and their phonology, on the other. his line of work has not been the most informative and successful. It will, of course, always be
pursued fully enough. useful to summarize facts that characterize language more suc-
cinctly in the form of rules. Such rules, however, will ultimately
Models of derivational morphology (Plaut and Gonnerman
be seen only as summary descriptions that approximately cap-
2000).
ture important aspects of language, rather than as directly char-
hese authors ofer a model that maps from form to mean- acterizing the way in which language knowledge is represented,
ing, with such forms as government, predict, and so on. he acquired, and used.
model captures graded priming relations, depending on degree
of compositionality. In ongoing work, Plaut and colleagues have Rules or Connections?
considered other languages (e.g., Hebrew) that show stronger S. Pinker reacted strongly against the connectionist model of D.
morphological efects. Here, one gets morphological priming E. Rumelhart and McClelland (Pinker and Prince 1988), taking
even in cases where there is no actual morphological relation- strong exception to the notion that language exhibits the graded,
ship. Extensions of the Plaut/Gonnerman model show that when similarity-based properties characteristic of connectionist mod-
trained with a more highly systematic corpus (characteristic of els. He subsequently (Pinker 1991, 1999) accepted that language
Hebrew), networks also exhibit this property. In other words, the does indeed have some gradient-like properties similar to those
networks tendency to parse morphologically complex forms into of neural networks, but maintained that such properties are
components (see the entries on parsing) relects both the spe- restricted to operating within the lexicon. He made the case that
ciic properties of individual items and the prevailing tendency there is a separate, pure, rule-based system, operating accord-
found among items in the language. ing to principles quite distinct from those characteristic of con-
nectionist networks. He made his case on the basis of a series
Models of sentence processing (Elman 1990; McClelland,
of empirical claims, arguing for the special status of categori-
St. John, and Taraban 1989).
cal, structure- but not content-sensitive rules in performance,
Both models are trained with word sequences generated acquisition, and breakdown under brain damage, also claim-
according to a simple grammar. J. L. Elmans model simply pre- ing support from cross-linguistic evidence. McClelland and K.
dicts the next word in the sequence after training with gram- Patterson (2002) evaluated all of these claims and found instead
matical sequences. he model in J. L. McClelland, M. St. John, that properties of performance, acquisition, breakdown, and
and R. Taraban maps from the word stream to a representation cross-language variation do not support the view that language is
of the set of role-iller pairs characterizing the event described based on categorical rules; rather, the evidence supports the idea
in a given input sentence. Both of these models illustrate many that language knowledge is graded, semantic- and phonological-
of the properties already considered, and operate strictly of of a content sensitive, and that there is no separate system for regular
linear sequence of inputs. here are also word reading models as opposed to irregular aspects of language.
that read the sequence of sounds appropriate to a spelled word he fact that connectionist models can capture both system-
sequentially. his eliminates the concern that arises with some atic and idiosyncratic properties of linguistic forms leads to the
of the more parallel models that theyre really hiding a speciic following suggestions:
209
Connectionist Models, Language Structure, and Representation Consciousness and Language
here are no lexical entries in the mechanisms of language McClelland, J. L., and K. Patterson. 2002. Rules or connections in past-
processing as such, only sensitivity to the idiosyncratic prop- tense inlections: What does the evidence rule out? Trends in Cognitive
erties of particular items. Sciences 6.11: 46572.
he rules that characterize regularities in the relations McClelland, J. L., M. St. John, and R. Taraban. 1989. Sentence compre-
hension: A parallel distributed processing approach. Language and
between, for example, sound to meaning (past tenses tend
Cognitive Processes 4: 287335.
to end in a variant of the dental stop) have no special status
Pinker, S. 1991. Rules of language. Science 253: 5305.
either; however, they do capture the tendency that connec- . 1999. Words and Rules. New York: Basic Books.
tionist systems have to extend consistent relations among Pinker, S., and A. Prince. 1988. On language and connectionism: Analysis
constituent parts of expressions to new expressions contain- of a parallel distributed processing model of language acquisition.
ing the same constituents. Cognition 28: 73193.
Similarly, other units besides the word, including subword Plaut, D. C., and L. M. Gonnerman. 2000. Are non-semantic morpho-
units such as phone, onset, rhyme, nucleus, coda, sylla- logical efects incompatible with a distributed connectionist approach
ble, and morpheme, as well as supraword units, including to lexical processing? Language and Cognitive Processes 15: 44585.
phrases, collocations, idioms, and constructions, may not Plaut, D. C., J. L. McClelland, M. S. Seidenberg, and K. Patterson. 1996.
be stored as such; however, listing such units and noting the Understanding normal and impaired word reading: Computational
principles in quasi-regular domains. Psychological Review
regularities that describe their relations to other units may be
103: 56115.
useful descriptively in characterizing aspects of the emergent
Plunkett, K., and V. Marchman. 1991. U-shaped learning and frequency
behavior of the system in which they need not be represented efects in a multi-layered perceptron: Implications for child language
as such. acquisition. Cognition 38: 43102.
Rumelhart D. E., and J. L. McClelland. 1986. On learning past tenses of
A recent critique of connectionist approaches by Ray Jackendof
English verbs. In Parallel Distributed Processing. Vol 2: Psychological
(2007) raises a number of concerns, centering around a perceived
and Biological Models. Ed. D. E. Rummelhart and J. L. McClelland.
failure of connectionist models to be as systematic as Jackendof
Cambridge, MA: MIT Press.
takes natural language to be. In a reply, McClelland and J. Bybee
(2007) address these concerns, emphasizing that connectionist
models do impose a tendency toward systematicity and regular- CONSCIOUSNESS AND LANGUAGE
ity, while they yet exploit quasi regularity in exceptions, some-
John Locke, one of the founders of the study of meaning, took it
thing that the approaches of Pinker and Jackendof are not able
for granted that consciousness has to play a role in our under-
to do.
standing of language. Ultimately, linguistic signs have their
J. L. McClelland meanings in virtue of their connections to perception. So, for
example, the meanings of the color words can only be grasped by
WORKS CITED AND SUGGESTIONS FOR FURTHER READING someone who has had experience of the colors. his is the gen-
eral case. Although the meanings of many words are grasped by
Bird, H., M. A. Lambon Ralph, M. S. Seidenberg, J. L. McClelland, and
grasping their connections to other words, ultimately the whole
K. Patterson. 2003. Deicits in phonology and past-tense morphol-
ogy: Whats the connection? Journal of Memory and Language system has to bottom out in connections to perception. he gen-
48: 50226. eral picture was that our perceptual experiences are the contact
Burzio, Luigi. 2002. Missing players: Phonology and the past-tense point between us and the world. When the world afects us caus-
debate. Lingua 112: 15799. ally in such a way that we can think and talk about it, it does so by
Bybee, Joan L. 1985. Morphology: A Study of the Relation between Meaning afecting our perceptual experiences. hese experiences, there-
and Form. Philadelphia: John Benjamins. fore, can serve as signs of the external phenomena that cause
Bybee, J., and J. L. McClelland. 2005. Alternatives to the combinatorial them (somewhat as smoke can serve as a sign of ire), and their
paradigm of linguistic theory based on domain general principles of status as signs is inherited by any further linguistic signs that we
human cognition. Linguistic Review 22: 381410.
use as a consequence of having those experiences (Locke [1690]
Elman J. L. 1990. Finding structure in time. Cognitive Science
1975). Something like this general picture is still popular today,
14: 179211.
Jackendof, Ray. 2007. Linguistics in cognitive science: he state of the
partly as a result of Saul Kripkes (1980) reinvigoration of a causal
art. Linguistic Review 24: 347401. view of reference. he principal modiication is that the role of
Joanisse, M. F., and M. S. Seidenberg. 1999. Impairments in verb mor- experience in perception is given little weight; consciousness is
phology following brain injury: A connectionist model. Proceedings of regarded as perhaps an inessential epiphenomenon in the pro-
the National Academy of Sciences 96: 75927. cess and as having little to do with our grasp of language.
Keidel, J. L., J. D. Zevin, K. R. Kluender, and M. S. Seidenberg. 2003. here are two lines of argument behind this elimination of
Modeling the role of native language knowledge in perceiving nonna- consciousness from the study of language. One is the direction
tive speech contrasts. Proceedings of the 15th International Congress taken by the scientiic study of vision. On the cognitive science
of Phonetic Sciences: 22214. approach exempliied by David Marr (1982), vision has to be
Lupyan, G., and J. L. McClelland. 2003. Did, made, had, said: Capturing
understood at three levels: the level of computation, at which we
quasi-regularity in exceptions. Proceedings of the 25th Annual Meeting
specify the task being performed by a visual system; the level of
of the Cognitive Science Society. Available online at: http://csjarchive.
cogsci.rpi.edu/Proceedings/2003/mac/table.html.
algorithm, at which we specify just how the task is performed;
McClelland, J. L., and J. Bybee. 2007. Gradience of gradience: A reply to and the level of implementation, at which we specify the neural
Jackendof. Linguistic Review 24: 43755. basis of the system. It is usually assumed that there is some sense
210
Consciousness and Language
in which we ind the mechanisms of causation only at the level of reply to the argument. It is hard to vanquish the sense that no
implementation. Only at that level, the description of the biology amount of this kind of thing is going to be enough on its own to
of the system, do we ind interactions among physical units in constitute an understanding of language.
virtue of which the whole process counts as causal, even if there Searle (1990) argued that intentional states must, in principle,
is some secondary sense in which the higher-level descriptions be accessible to consciousness, that insofar as we have what he
of the system can be said to be of causal signiicance. Conscious called original intentionality rather than merely some observ-
experience as such does not enter into the characterization of er-relative ascription of intentionality, the state must be one of
any of these levels, and it is hard to see how its causal signii- which the subject could become aware. You might think this
cance could be recognized. Consciousness seems like a kind of suggests one diagnosis of what is missing in the Chinese Room,
miasma that might pervade the system, but of no signiicance for where there is no such thing as the possibility of anyone becom-
its functioning. ing aware of the content held by the messages in Chinese. So
he second line of argument for the elimination of conscious- these meanings cannot be the contents of anyones intentional
ness from the study of language is in the philosophical work of states. hat is, in Searles scenario, there is no such thing as any-
Gottlob Frege. Freges point was that no one can really know one grasping the meanings of Chinese statements.
what experiences someone else is having, and so if conscious- he problem with this diagnosis is that it is hard to see its con-
ness did play a key role in an understanding of language, com- nection with what Searle himself highlighted as the moral of the
munication would be impossible (1952). You would be trying Chinese Room: that no amount of syntactic manipulation can
to convey something whose signiicance ultimately had to do add up to a grasp of semantics, to a knowledge of what is being
with the nature of your experiences, and I would have no way of talked about and what is being said about it. It is hard to see how
knowing what you were talking about. So the shareable, public consciousness of the very intentional state itself does any work in
phenomena of communication and meaning have to be sharply providing one with knowledge of what it is about. We have so far
diferentiated from the realm of conscious experience since the no explanation of why consciousness should matter for the grasp
nature of an experience can be known only to the person hav- of semantics.
ing it. Freges point was pursued by the later Wittgenstein, who he traditional idea that consciousness plays a role in the
argued that meaning cannot be grounded in the contents of the grasp of meaning had been developed in a somewhat diferent
stream of consciousness. Ludwig Wittgenstein (1953) argued direction by Bertrand Russell (1912) and G. E. Moore (1903).
that the conception of meaning as grounded in the idiosyncratic Here, the emphasis is not on intentional states being conscious
experiences that each of us has ultimately makes no sense; the but, rather, on consciousness of the things about which one is
notion of experience that this conception uses is incoherent (see talking or thinking. hese are diferent phenomena. You might
private language argument). To characterize the meaning have a conscious belief about the numbers, without being con-
of a sign, you have to look at its role in communication, and there scious of the numbers. And you might have unconscious beliefs
is no special role here for talk about conscious states. about something of which you are conscious, for example, an
hese two lines of thought, from cognitive science and from unconscious belief that the man before you is dangerous. Russell
the work of Frege and Wittgenstein, are evidently forceful. here argued that all grasp of meaning must be provided by what he
is, nonetheless, a line of argument that asks: Suppose we do called acquaintance with the objects referred to: We must
eliminate consciousness from the study of language. Will we attach some meaning to the words we use, if we are to speak sig-
be left with enough that we can give a recognizable account of niicantly and not utter mere noise; and the meaning we attach
meaning and understanding? he argument was given its clas- to our words must be something with which we are acquainted
sic formulation by John Searle (1980). Searles question was (Russell 1912, 25). Acquaintance here is a matter of consciousness
put as a problem for computational approaches to meaning of the objects talked about: We shall say that we have acquain-
and understanding. Suppose, as a premise for reductio, that we tance with anything of which we are directly aware, without the
have a computer that understands a language, say Chinese. he intermediary of any process of inference or any knowledge of
operations of the computer can be described at Marrs levels of truth (ibid., 25). Russells idea here is evidently intuitive. he
computation, algorithm, and implementation. Fundamentally, problem is that he thought one could only be acquainted with
what we have is a machine capable of complex symbol manipu- ones own sense-data and, possibly, oneself. his greatly restricts
lation. Can this complex symbol manipulation add up to under- the possible topics of conversation and seems to run immediately
standing? Suppose we have an ordinary speaker of English into problems in explaining how meaning is communicable. he
who understands no Chinese. We put this speaker into a room natural proposal is that the range of acquaintance should be
with a massive rule book written in English that tells him what broader; it should include the physical objects around one. What
operations to perform with Chinese symbols when he encoun- is missing in the Chinese Room is any awareness of the objects
ters them. In efect, he mimics the operations of the computer. around the room.
We feed Chinese messages into the room, and he manipulates It is not immediately obvious how this way of inding a role for
the symbols and outputs fresh symbols in response. Could any consciousness is supposed to work. If consciousness is thought
amount of this kind of thing add up to an understanding of of as a matter of having sensations produced by the physical
Chinese? Well, plainly our English speaker has no understand- world, then how exactly could the having of those sensations be
ing of Chinese after all this. So where is an understanding of what enabled one to think about the objects around one? If, on
Chinese to be found? Despite an avalanche of literature written the other hand, consciousness of the objects around one is taken
in response, there has been no consensus about any particular to be a matter of thinking about those objects, then it simply
211
Consciousness and Language
presupposes capacities for thought and talk about those objects acquainted only with our own sensations. Once we have Moores
and cant play any role in explaining those capacities. picture, on which there is a generic relation of consciousness that
Still, there is something intuitive about the idea that what is each of us can stand in to ordinary objects and properties, we can
missing in the Chinese Room is some awareness of the things see that consciousness of those objects and properties may be
under discussion, that if that were provided, we might have the what makes it possible for us all to be talking and thinking about
foundation for an understanding of the language being used. In the same world. Suppose that, as Moore thought, all experiences
fact, Moore articulated the needed conception of awareness of are alike in respect of the generic relation of consciousness and
objects in a famous article, he refutation of idealism (1903). difer only in their objects. hen, whether your experiences are
Moore thought that in any analysis of sensation, we have to rec- the same as mine can be a question only about the objects or
ognize that there are two elements. here is, irst, the generic rela- properties of which we are conscious. And we can know that we
tion, consciousness of, in respect of which all sensations are are conscious of the very same ordinary physical things and their
alike. his is a relation between the subject and something else, properties.
the object of the sensation. he experiences of a single subject here is, indeed, a traditional metaphysical issue in the back-
are always alike in that it is always the same relation, conscious- ground at this point. In the seventeenth century, advocates of
ness, that is involved. he subjects experiences difer only in mathematical physics were at pains to stress that in the scien-
that the objects of diferent experiences may be diferent. So, for tiic image of the world, there were only atoms and the void; the
example, an experience of blueness and an experience of green- ordinary objects and properties about which we think and talk
ness difer in the things of which they are experiences (which had vanished. So the ordinary objects and properties could only
things they are encounters with); they are the same in that it is be projections of the mind onto the underlying scientiic reality.
the same generic relation of consciousness that is in question. Although this idea still enjoys some popularity today, its time has
So the picture is passed. We are now familiar with the alternative, that the world
can be described at many levels, and there is no particular rea-
Subject consciousness object
son to think that all but one of those levels must be projection of
he object varies, but the relation is always the same. Moores the mind. In giving the natural history of a species, for example,
point now is that there is no reason to think that the object must we are describing the world at a diferent level than the level of
always be some psychological state. He writes, I am as directly basic physics. But we are not describing a projection of the mind.
aware of the existence of material things in space as of my own Similarly, when we describe the world in terms of everyday tables
sensations (1903, 453). If we can indeed appeal to this generic and chairs and people, we are not describing the world in terms
relation of consciousness to external phenomena, doesnt this of basic physics. But we are not describing a projection of the
give us our account of what is needed to provide a semantic mind. We are describing objects that are there to be encountered
understanding of the terms we use? by us. And our awareness of them makes it possible for us to
For this approach to work, there has to be some response to think and talk about them.
the lines of argument noted earlier. First, if consciousness does Continued resistance to the idea that there is a role for con-
do any work in our understanding of language, then it must do sciousness in the study of language seems likely to stem from
some causal work. It must make some diference to what hap- something like the following idea. he causal processes in virtue
pens. But if we think of causality as a matter fundamentally of of which the words of a language constitute signs of the phenom-
the mechanistic interactions of physical particles, then how can ena around us may be mediated by perception, but they do not
consciousness be causally signiicant? At best, it will seem that thereby have to be mediated by experience. Linguistic terms are
an appeal to consciousness is an appeal to some ghostly quasi signs of the phenomena around us in something like the sense in
mechanism. However, it is arguable that the trouble here is not which smoke is a sign of ire. And there is nothing special about
the appeal to consciousness but the mechanistic conception of causation of the use of a sign that is mediated by experience; any
causation. he mechanistic conception is independently objec- kind of causation would do. he trouble with this line of thought
tionable (cf, e.g., Woodward 2003). Arguably, we should think of is that it has proven very diicult to try to explain what kind of
X causing Y as a matter of what would have happened to Y had causation is needed for language to represent the phenomena
things been diferent with X. And we can certainly make sense of around us (Millikan 1984). It may be that a causal link to the
the idea that things would have gone diferently had one not been object is needed only because the prototypical causal links are
conscious of this or that external phenomenon. So the idea that those in virtue of which we can be said to be aware of the object
the grasp of semantics is provided by experience of the things we we are talking about. It is, after all, diicult not to be struck by
talk about does not seem to face any intractable diiculty over a thought experiment even simpler than Searles. Suppose we
the possibility of a causal role for consciousness. he idea that had a robot that, though not conscious of its surroundings, had
consciousness as such must be a mere epiphenomenon has, in signiicant behavioral competencies in particular, it could pro-
any case, little intuitive appeal. duce language that sounded just like a humans production of
here were two classical objections to the idea that conscious- language and suppose it interacted freely with its environment.
ness plays a role in our understanding of language. he irst was It is impossible not to suspect that, just because this robot has no
the point about causality. he second objection was that this awareness of anything, it does not have the irst idea what it is
idea would make communication impossible; we would end up talking about. It has no grasp of the semantics of its language.
each talking about only our own sensations. But this objection
John Campbell
disappears when we leave behind Russells view that we can be
212
Consistency, Truth, and Paradox
WORKS CITED AND SUGGESTIONS FOR FURTHER READING The Meaningless Response
Frege, Gottlob. 1952. On sense and reference. In Translations from the One response to the paradox is that the italicized sentence is not
Philosophical Writings of Gottlob Frege, ed. P. T. Geach and Max Black, meaningful. Such a response, while natural, is implausible. Why
5778. Oxford: Blackwell. isnt the italicized sentence meaningful? One might point to the
Kripke, Saul. 1980. Naming and Necessity. Cambridge: Harvard University conspicuous circularity (in particular, self-reference) in the sen-
Press. tence. Self-reference, however, is hardly suicient for a meaning-
Locke, John. [1690] 1975. An Essay Concerning Human Understanding, less sentence. (Witness the sentence immediately preceding the
ed. P. H. Nidditch. Oxford: Oxford University Press. italicized sentence! Witness others, e.g., All sentences are sen-
Marr, David. 1982. Vision. San Francisco: W. H. Freeman.
tences, or etc.) Moreover, if no meaningless sentence is true, then
Millikan, Ruth. 1984. Language, hought and Other Biological Categories.
a fortiori the italicized sentence is not true. But, then, we are led to
accept that the italicized sentence is not true led to accept, appar-
Moore, G. E. 1903. he refutation of idealism. Mind 12: 43353.
Russell, Bertrand. 1912. he Problems of Philosophy. Oxford: Oxford ently, the italicized sentence itself. If we should reject any untrue
University Press. sentence, then we are now stuck. For this and other reasons, the
Searle, John. 1980. Minds, Brains and Programs. Behavioral and Brain meaningless thesis is not a plausible lesson to draw from the para-
Sciences 3: 41757 dox. (See suggestions for further reading for a note on Tarski.)
. 1990. Consciousness, explanatory inversion, and cognitive sci-
ence. Behavioral and Brain Sciences 13: 585642. Paracomplete Language?
Wittgenstein, Ludwig. 1953. Philosophical Investigations. New he principle of excluded middle (PEM, sometimes LEM for
York: Macmillan.
law) has it that, for any (declarative) sentence A, the disjunc-
Woodward, James. 2003. Making hings Happen: A heory of Causal
tion of A and its negation is logically true (or valid). his is usu-
Explanation. Oxford: Oxford University Press.
ally put by saying that, where V is disjunction and ~ negation, all
instances of AV~A are valid (in the given language). A paracom-
CONSISTENCY, TRUTH, AND PARADOX plete language is one in which PEM fails, that is, one in which
he aim of this entry is to convey a lavor of (some) logical and AV~A is not valid.
semantic issues that arise from so-called semantic paradoxes. A popular response to the Liar paradox (and related semantic
here is no aim, given space restrictions, to provide anything like paradoxes) is that it indicates the failure of PEM in English. After
a history or survey of the relevant issues. he focus, for simplic- all, premise (1) of the previous argument toward paradox relies
ity, will be on truth-theoretic paradox, although the same issue on PEM. Without PEM, the conclusion that some sentences are
the consistency of language arises with other semantic notions true and not true fails to ind a sound argument.
(denotation, satisfaction, etc.). Paracomplete responses to the paradox are the most com-
mon approaches today. One problem with them is similar to
The Liar Paradox the meaningless response: hey have trouble expressing their
he Liar paradox is one of the most familiar truth-theoretic para- position. After all, if (say) the Liar-instance of PEM fails, then,
doxes. It arises from a sentence that says (or may be used to say) presumably, neither the Liar nor its negation is true, in which
of itself only that it is not true. English seems to have such sen- case, the Liar (e.g., the italicized sentence) is not true. But the
tences. By way of illustration, consider the italicized sentence Liar says that its not true, and so, if it really is not true, it seems
immediately following this sentence, where CTP abbrevi- to speak truly.
ates consistency, truth, and paradox (this very encyclopedia
entry). Paraconsistent Language?
Another response, being increasingly discussed, is a so-called
he italicized sentence in CTP is not true.
paraconsistent thesis. A paraconsistent language is one in which
Consider the following argument. arbitrary B does not follow from arbitrary A and ~A. (B does fol-
low from A and ~A in classical logic, and in many paracomplete
(1) he italicized sentence in CTP is true or the italicized sen-
languages.) As a result, one can accept that some true sentences
tence in CTP is not true.
have a true negation; it doesnt follow, in a paraconsistent lan-
(2) If the italicized sentence in CTP is true, then the italicized guage, that all sentences are thereby true a genuine absurdity.
sentence in CTP is not true. Accordingly, the paraconsistentist may truly say that (for exam-
(3) If the italicized sentence in CTP is not true, then the itali- ple) the italicized sentence is not true, since, according to the
cized sentence in CTP is true. proposal, it is true and so is its negation!
(4) Hence, the italicized sentence in CTP is true and not
true. Closing Remarks and Curry
here are many other responses to paradox, and the sketches
Let us say that a language is inconsistent if theres some sentence
above only crudely gesture in various directions. here are
of the language such that both it and its negation are true. he
also other, perhaps more diicult, paradoxes. Currys paradox
argument, then, seems to indicate that English is inconsistent.
involves a conditional, and is largely independent of theories of
hat English is inconsistent is surprising, at the very least. (Many
negation. Currys paradox involves sentences such as
have said that the result is literally beyond belief the original
meaning of paradox.) * If the starred sentence in CTP is true, then everything is true.
213
Constituent Structure Constraints in Language Acquisition
A standard conditional proof (assume the antecedent, and derive Constituents are often assumed to be contiguous strings,
the consequent) seems to indicate that the starred sentence in but (4) undermines this assumption since a man whom nobody
CTP is true, in which case, its antecedent is true. But, then, we knew is a semantic unit.
have a true conditional with a true antecedent, and so, by modus
(4) A man walked in whom nobody knew.
ponens, detach the consequent: Everything is true!
Paracomplete and paraconsistent responses to the Liar ofer, his issue is even clearer in freer word order languages, such
in the irst instance, nonclassical theories of negation proposals as German or Warlpiri:
about negations logical behavior. Currys paradox requires care-
(5) Dem Jungen schenken wollte Peter das Buch.
ful theorizing about conditionals, and so calls for more work than
the boy give wanted Peter the book
is provided by standard theories of negation.
Peter wanted to give the boy the book.
J. C. Beall (Uszkoreit 1987, 159)
WORKS CITED AND SUGGESTIONS FOR FURTHER READING (6) Maliki-rli ka kurtu kartirti-rli paji-rni wita-ngku
dog-ERG PRES child tooth-ERG bite-NPST small-ERG
Beall, J. C., ed. 2008. Revenge of the Liar. Oxford: Oxford University Press.
he small dog is biting the child with its tooth.
his is a collection of very recent papers on truth and paradox.
(Simpson 1991, 261)
Martin, Robert., ed. 1984. Recent Essays on Truth and the Liar Paradox.
Oxford: Oxford University Press. his collects a variety of contempo- Responses to inconsistencies among constituency criteria
rary approaches to truth, including so-called revision theory and con- vary. transformational grammar assigns multiple con-
textual theories not mentioned here.
stituent structures to every sentence (Chomsky 1957). Other
Tarski, A. 1983. Logic, Semantics, Metamathematics: Papers from 1923 to
responses include abandoning constituent structure altogether
1938. Ed. John Corcoran. Indianapolis: Hackett Publishing. Tarskis
(Hudson 1984) or giving priority to some criteria (Pollard and
classic approach toward deining truth for a language remains inlu-
ential, but it is highly implausible as an account of truth for natural Sag 1994).
languages (as Tarski himself thought). Many constituents contain one word, called the head, that
determines its distributional properties, its internal struc-
ture, and its semantic content. Names like noun phrase, verb
CONSTITUENT STRUCTURE phrase, and so on relect assumptions about headedness. Noam
Chomsky (1970) proposed a universal schema for the internal
words in sentences cluster into groups called constituents (or
structure of constituents (X-BAR THEORY), varying only in the
phrases). For example, in (1), the dog, barked at a cat, at a cat,
position of the head within this schema. Subsequent literature
and a cat are constituents:
has explored constituent structures across languages.
(1) he dog barked at a cat. While both descriptive and theoretical syntacticians rely
heavily on the notion of constituency, the controversies raised
Among criteria used for constituency are the following:
here remain unresolved.
he words form a semantic unit; for example, the dog refers
homas Wasow
to a particular animal.
he words form a phonological unit, with larger prosodic
breaks at constituent boundaries.
he same string of words (or categories) appears in various Bray, Norman W., Geofrey J. Huck, and Almerindo E. Ojeda. 1987.
contexts; for example, article-noun can appear as subject, Discontinuous Constituency. New York: Academic Press.
verbal object, prepositional object, and elsewhere. Chomsky, Noam. 1957. Syntactic Structures. he Hague: Mouton.
A string can be replaced by a single word without radically . 1970. Remarks on nominalization. In Readings in English
Transformational Grammar, ed. Roderick A. Jacobs and Peter S.
changing the meaning or structure of the sentence; for exam-
Rosenbaum, 184221. Waltham, MA: Ginn.
ple, in (1), the dog could be replaced by it.
Hudson, Richard. 1984. Word Grammar. Oxford: Blackwell.
A string of words occurs as a conjunct in a coordinate struc-
Pollard, Carl, and Ivan A. Sag. 1994. Head-Driven Phrase Structure
ture, such as the dog and two gerbils. Grammar. Chicago: University of Chicago Press.
Unfortunately, such criteria sometimes diverge: In (2), large pro- Simpson, Jane. 1991. Warlpiri Morpho-Syntax: A Lexicalist Approach.
Dordrecht: Kluwer.
sodic breaks come before that, but the relative clauses introduced
Uszkoreit, Hans. 1987. Word Order and Constituent Structure in German.
by that are grouped semantically with the preceding nouns.
(2) his is the rat that ate the cheese that lay in the house that Jack
built.
CONSTRAINTS IN LANGUAGE ACQUISITION
In (3), a new person functions as a constituent syntactically
One of the goals of developmental psycholinguists is to
and prosodically but not semantically since it is contracting
understand how children acquire a system of linguistic knowl-
AIDS, not the person, that is new.
edge that is equivalent to that of adults in the same linguistic
(3) Every minute, a new person contracts AIDS. community. What kind of learning mechanisms ensure that
childrens hypotheses about the sequences of sounds they hear
214
Constraints in Language Acquisition
map onto individuals, events, and concepts in the same way as variety of examples: John said he hid the rabbit, When he was
they do for adults? Observations of childrens language develop- in the woods, John hid the rabbit. In such cases the constraint
ment reveal that acquisition occurs rapidly and that linguistic that blocks coreference fails to apply, and so the pronoun he is
knowledge is often achieved without the support of much, if any, free to refer to John. his illustrates how a single negative con-
relevant experience. On the basis of these observations, it has straint can describe the same set of facts that would require a
been proposed that childrens linguistic hypotheses are circum- host of positive rules, making for a more compact (parsimoni-
scribed by constraints (sometimes termed biases, assumptions, ous) grammar.
or principles). Because constraints are negative statements, they are likely
candidates for being innately speciied. Suppose a child lacked
The Role of Constraints the constraint on coreference. hen, the child would allow the
he role of constraints in language development is to prevent sentence He said John hid the rabbit to mean that John said that
children from forming misguided hypotheses about the forms he, John, hid the rabbit. How could the child learn that the sen-
and meanings of linguistic expressions. One way a hypothesis tence cannot mean this? One way to unlearn something is to
might be misguided is by being too broad, in the sense that it be exposed to so-called negative evidence, such as corrective
allows forms that are not in the language or extends the mean- feedback. As a matter of fact, however, parents do not provide
ings of linguistic expressions to include individuals, events, and consistent corrective feedback (negative evidence) even when
concepts that are not part of the corresponding meanings gener- children do make errors (Brown and Hanlon 1970; Morgan and
ated by other speakers of the language. If children make hypoth- Travis 1989). Without negative evidence, it is diicult to see how
eses that are too broad, in this sense, then they will make errors constraints could be learned. he alternative is to suppose that
that may prove to be diicult for them to recover from, thereby constraints are innately speciied, as part of human biology.
making convergence on the system of adult linguistic knowledge here are interesting empirical consequences of the innateness
slow and onerous (see learnability). he observation that hypothesis: Innate constraints are expected to emerge early
children rapidly master many complex linguistic facts suggests (despite their apparent complexity), and to be universal (see
that mistaken hypotheses are somehow avoided. his is where Crain and hornton 1998; innateness and innatism). he
constraints come in, by placing limits on the kinds of linguistic issue of innate speciicity versus learning crops up in discussions
hypotheses that children can entertain, so that real-world expe- of all kinds of constraints, including constraints on how children
rience will provide relevant data to conirm a hypothesis, or learn the syntactic frames in which verbs can appear, and how
redirect children to a new hypothesis. Although the existence of they learn word meanings. We discuss these topics in turn.
constraints is widely acknowledged, two issues about the nature
of constraints are subject to controversy. One is whether con- Constraints on Argument Structure
straints are learned or are innately speciied. A second issue is In some linguistic frameworks, human languages are character-
whether constraints are speciic to a single cognitive domain ized by constructions (see construction grammars). For
(e.g., language) or cut across several cognitive domains. his example, one set of verbs can appear in the dative construc-
issue is called domain speciicity. tion (e.g., gave, as in John gave a book to the museum). Many of
the same verbs can appear in another construction, called the
The Innateness of Constraints double-object construction (John gave the museum a book). In
Constraints irst assumed prominence in the late 1960s in dis- fact, changing the sentence from the dative to the double object
cussions of structure dependency (Chomsky 1971). Before con- does not change its basic meaning, or communicative function.
straints were introduced, grammars were systems of rules. Rules However, suppose a child formed the following generaliza-
are positive statements, indicating forms and meanings that are tion that all verbs that can appear in the dative construction
possible. In contrast to rules, constraints are often couched in can appear in the double-object construction, without a change
negative statements, dictating the forms and meanings that are in basic meaning. his generalization is lawed because there
not possible. Armed with constraints, then, learners are pre- are verbs that can appear in the dative but not in the double-
vented from producing illicit forms and from assigning illicit object construction. One exceptional verb is donate. Notice that
meanings. John donated a book to the museum is ine, but John donated
To take one example, there is a constraint that governs the the museum a book is not acceptable. To prevent learners from
reference of pronouns in sentences (see binding ). he con- forming illicit double-object constructions for verbs like donate,
straint applies in the sentence He said John hid the rabbit, where a constraint can be introduced. he constraint allows a struc-
the constraint dictates that the pronoun he and the name John tural option for a verb (say, the double-object word order) to
cannot refer to the same individual; they must have disjoint be entered into childrens grammars only if there is evidence
reference (Chomsky 1981). he disjoint reference of the pro- for that word order in the input to children. A proposal of this
noun he and the name John follows from the fact that the pro- kind is the unique argument-structure preference proposed by
noun is positioned higher in the constituent structure Martin Braine and Patricia Brooks (1995). As Pinker points out,
than the name (more precisely, the pronoun must c-command [T]he need for negative evidence in language acquisition can be
the name). Suppose the pronoun is lower in the constituent eliminated if the child knows that when he or she is faced with a
structure than the name (and, hence, does not c-command set of alternative structures fulilling the same function, only one
it). In such cases, the pronoun and the name should be able to of the structures is correct unless there is direct evidence that
corefer. Attesting to this is the acceptability of coreference in a more than one is necessary (1984, 113).
215
Constraints in Language Acquisition Construction Grammars
A similar uniqueness constraint has been advanced to explain is inclusive-or (including iii). For most sentences that children
how children avoid errors in learning the past tense form of a verb. experience, however, one or the other of the expressions sur-
he simple past tense rule add -ed (with its phonological vari- rounding or (its disjuncts) is false (excluding iii): Eat your veg-
ants) provides the right answer for many verbs, but this rule, too, gies, or youll have to go to bed, Is his name Ted, or Fred? his
is plagued with exceptions; it outputs the incorrect forms comed leads to the expectation that children should initially attribute
and bringed, for example, rather than came and brought. Gary the exclusive-or meaning to the word or. In fact, there is evidence
Marcus and colleagues (1992) proposed that the past tense forms that despite the input, children initially interpret or as inclusive-
of regular verbs (lifted, walked, showed) are formed by the sim- or as in classical logic. One possibility is that classical logic (or a
ple past tense rule, but irregular past tense forms (e.g., came and universal grammar) imposes constraints on childrens ini-
brought) must be learned from the input and entered as excep- tial interpretations of logical words. On this scenario, children
tions into the childs mental dictionary (cf. Pinker 1999). his is initially assign or the truth conditions associated with inclusive-
where a constraint comes into the story. Once an irregular past or, and later learn to limit these truth conditions to those associ-
tense form is entered in the lexical entry for a verb, a constraint ated with exclusive-or, based on principles of conversation. his
prevents the child from accessing the past tense rule to produce could be an example of an innate constraint but one that may or
regular forms like comed and bringed. Essentially, the constraint may not be domain speciic.
ensures that a lexical entry contains only a single past tense form To summarize, constraints typically direct children to begin with
unless there is direct evidence for more than one form. narrow hypotheses (i.e., to start small) and only broaden these
hypotheses if the input demands it. Constraints function quite dif-
Constraints on the Meanings of Words ferently in diferent parts of the linguistic system. Some constraints
Constraints also igure prominently in the literature on word are viable candidates for being domain speciic and innately speci-
learning (see lexical acquisition). One family of constraints ied, but others may be domain general and learned.
assists children in associating labels with objects in the world.
Stephen Crain, Rosalind hornton
he complexity of associating labels to objects was driven home
by W. V. O. Quine (1960), who invites us to consider how we
would interpret an expression used by a speaker of another lan-
guage in the presence of a passing rabbit. Suppose the speaker Bloom, Paul. 2000. How Children Learn the Meaning of Words.
utters gavagai as the rabbit passes by. Quine asks how we can Cambridge, MA: MIT Press.
be sure that gavagai is being used to refer to the rabbit and not Braine, Martin, and Patricia Brooks. 1995. Verb argument structure and
the problem of avoiding an overgeneral grammar. In Beyond Names
some property of the rabbit, such as food, or furry, white,
for hings: Young Childrens Acquisition of Verbs, ed. M. Tomasello and
or even undetached rabbit parts. Until we become native
W. E. Merriman, 35376. Hillsdale, NJ: Lawrence Erlbaum.
speakers of the language with gavagai, there is what Quine calls Brown, Roger, and Camille Hanlon. 1970. Derivational complexity and
an indeterminacy of translation between that language order of acquisition in child speech. In Cognition and the Development
and our own. of Language, ed. J. Hayes, 153. New York: Wiley.
Constraints on word learning are part of the solution to the Chomsky, Noam. 1971. Problems of Knowledge and Freedom. New
indeterminacy of translation. One constraint, called mutual exclu- York: Pantheon.
sivity, ensures that children assign only one label per category . 1981. Lectures in Government and Binding. Dordrecht, the
(e.g., Markman and Wachtel 1988). Mutual exclusivity guides Netherlands: Foris.
childrens initial hypotheses, but in many cases it is overridden as Crain, Stephen, and Rosalind hornton. 1998. Investigations in Universal
it becomes clear that it has exceptions. For example, a child who Grammar. Cambridge, MA: MIT Press.
Marcus, Gary, Steven Pinker, Michael Ullman, Michelle Hollander, T. John
has mastered dog as the label for the pet at home will soon hear
Rosen, and Fei Xu. 1992. Overregularization in Language Acquisition.
that the dog is also labeled animal, so both labels will be incorpo-
Monographs of the Society for Research in Child Development, no. 57.
rated into the childs mental dictionary. Some researchers sup- Chicago: University of Chicago Press.
pose that mutual exclusivity is domain speciic, applying just to Markman, Ellen, and G. Wachtel. 1988. Childrens use of mutual exclu-
word learning. An alternative claim is advanced by Paul Bloom sivity to constrain the meanings of words. Cognitive Psychology
(2000), who proposes that inferences about the communica- 20: 12157.
tive intentions of others, and not mutual exclusivity, are used Morgan, James, and Lisa Travis. 1989. Limits on negative information in
to derive childrens associations of labels to unfamiliar objects. language input. Journal of Child Language 16: 53152.
Such inferences are likely to cross the boundaries of cognitive Pinker, Steven. 1984. Language Learnability and Language Development.
domains, including logical reasoning in addition to language. Cambridge: Harvard University Press.
Labeling objects is just one aspect of word learning. Children . 1999. Words and Rules. New York: Basic Books.
Quine, W. V. O. 1960. Word and Object. Cambridge, MA: MIT Press.
also have to achieve the mapping for abstract words and for func-
tion words (called closed class vocabulary items) and not just for
nouns and verbs (called open class vocabulary items). In these
CONSTRUCTION GRAMMARS
cases, the environmental input clearly has less impact. Consider,
for example, how a child learns the meaning of the logical word here has been a broad convergence in many quarters in recent
or (see semantics, acquisition of). In classical logic, a state- years toward a view of grammar in which constructions play a cen-
ment of the form a or b is true if (i) only A is true, (ii) only B is tral role; approaches that share this view are here referred to as
true, and (iii) both A and B are true. So, the or of classical logic construction grammars. Construction grammars view linguistic
216
Construction Grammars
Table 1. Examples of constructions, varying in size and complexity (3) An emphasis is placed on psychological validity. A lin-
guistic theory must interface naturally with what we know
Simple words (illed or partially e.g., the, theory, blue, re-V about acquisition, processing, and historical change.
illed)
(4) An emphasis is placed on subtle aspects of the way we
Complex word e.g., sh taco, rehouse construe of events and states of afairs.
Idiom (illed) e.g., cock and bull story (5) Constructions are understood to be learned on the basis
Idiom (partially illed) e.g., the apple of of the input and general cognitive mechanisms and are
<someones> eye expected to vary to some degree cross-linguistically.
Ditransitive (double object) Subj V Obj1 Obj2 (e.g., She (6) Cross-linguistic generalizations are explained by appeal
construction gave him a kiss; He baked to general cognitive constraints, together with the functions
her an apple pie.) of the constructions involved.
Passive Subj aux VPpast participle (7) Language-speciic generalizations across construc-
(PPby) tions are captured via inheritance networks much like those
(e.g., The man was hit by a that have long been posited to capture our nonlinguistic
meteor.) knowledge.
(8) he totality of our knowledge of language is captured by a
network of constructions: a construct-i-con.
patterns of varying complexity as instances of conventional pair-
ings of form and meaning. hese pairings include words (with Constructionists have traditionally emphasized unusual
or without open slots), idioms (with or without open slots), and phrasal patterns such as those in Table 2. As an example of an
fully general abstract phrasal patterns (with or without any ixed unusual pattern, consider the Covariational Conditional construc-
words). Examples of constructions at varying degrees of com- tion in Table 2. he construction is interpreted as involving an inde-
plexity are given in Table l. (See Goldberg 2006 for arguments pendent variable (identiied by the irst phrase) and a dependent
that the shared formal properties and closely related semantics variable (identiied by the second phrase). he normally occurs
of dative and benefactive ditransitives warrant treating them as with a head noun, but in this construction it requires a compara-
instances of the same construction.) tive phrase. he two major phrases resist classiication as either
Any linguistic pattern is recognized as a construction so long noun phrases or clauses. he requirement that two phrases of this
as some aspect of its form or some aspect of its function is not type be juxtaposed is another nonpredictable aspect of the pat-
strictly predictable from its component parts or from other con- tern. Because the pattern is not strictly predictable, a construction
structions recognized to exist. In addition, most constructionists is posited that speciies the particular form and function involved.
argue that patterns are stored even if they are fully predictable as Research has revealed subtle syntactic and semantic prop-
long as they occur with suicient frequency. erties of this and other construction types in Table 2 (e.g.,
he emphasis on the pairing of function with form is what Culicover 1999; Jackendof 2002; Lambrecht 1990; Michaelis and
sets construction grammars apart from both other generative Lambrecht 1996; Williams 1994). he existence of these clearly
approaches (which tend to downplay function) and other func- learned, partially productive, syntactically constrained patterns
tional approaches (which tend to downplay form). At the same leads to the implication that much more of grammar may be
time, constructionists bring these two approaches together in learned on the basis of the input than had been generally rec-
some ways. hey recognize the importance of two major ques- ognized by the generative approach. hat is, if these unusual
tions that have been brought to the fore by generative grammar- patterns could be learned, why should we assume that the more
ians (see generative grammar): frequent, regular patterns could not possibly be?
In fact, constructionists have ofered construction-based
(1) How can all of the complexities of language be learned such accounts of many of the more core aspects of grammar,
that we are able to produce an open-ended set of utterances? including argument structure (e.g., Goldberg 1995), control
(2) How are cross-linguistic generalizations (and language phenomena (Culicover and Jackendof 2005), aspectual (see
internal generalizations) accounted for? aspect) interpretation (Michaelis 2004), raising (Langacker
1992), existential constructions (Lakof 1987), and island con-
Moreover, constructionists recognize that the answers to these
straints (Deane 1991; Goldberg 2006).
questions rely heavily on traditional functionalist methodology
Constructionists aim to provide accounts of such phenom-
and indings that emphasize the usage-based nature of lan-
ena without appealing to underlying levels of representation,
guage and the importance of general cognitive processes.
traces, or phonetically null functional projections. hat is, a
Construction grammars, broadly conceived, each share at
WSWYG approach to form is adopted. Beyond methodological
least most of the basic tenets listed that follow:
parsimony, this is due to the fact that there generally exist subtle
(1) All levels of description are understood to involve form functional diferences between surface forms (Goldberg 2002).
function pairings, including morphemes or words, idioms, par- herefore, surface forms are generated directly. Note that surface
tially lexically illed patterns, and fully abstract phrasal patterns. form need not specify a particular word order, nor even par-
(2) A what you see is what you get (WSWYG) approach to ticular grammatical categories, although there are constructions
syntactic form is adopted. that do specify these features.
217
Construction Grammars
C. J. Fillmore and P. Kay irst coined the term construction

Table 2. Examples of partially idiosyncratic constructions grammar. heir early work on idioms and idiomatic phrasal pat-
terns such as let alone, even, and Whats X doing Y? laid the foun-
Mad Magazine construction Him, a doctor?!
dation for many of the variations of construction grammar that
N P N construction time after time; day after have since developed. Yet their version, sign-based construction
day
grammar (SCxG), has developed quite distinctly from the other
Time away construction Dancin the night away construction grammars. Key diferences include the fact that
Whats X doingY?! Whats that y doing in my SCxG is not uniformly usage-based nor does it generally seek
soup?! motivation for the relationship between form and function.
Nominal extraposition construction Its amazing the
difference! A Comparison with Mainstream Generative Grammar
Proposals
Enough already construction Enough with the
Certain mainstream generative grammar frameworks share the
examples!
basic idea that some type of meaning is directly associated with
Stranded preposition construction Who did he give that to? some type of form, independently of particular lexical items (cf.
Covariational conditional The more you have, the also Borer 1994, 2003; Hale and Keyser 1997; Marantz 1997). To
construction more you want the extent that syntax plays a role in contentful meaning, these
other approaches are constructionist, and they are occasionally
Constructions are combined freely to form actual expressions referred to that way in the literature. However, the approaches
as long as they can be construed as not being in conlict. hat are fundamentally diferent from the type of constructionist
is, an actual expression typically involves the combination of at approaches just outlined. For example, these mainstream gen-
least a half dozen diferent constructions. he observation that erative accounts do not adopt a nonderivational (monostratal)
language has an ininite creative potential is accounted for, then, approach to syntax, but appeal instead to underlying levels of
by the free combination of constructions. representation in which constituents (or entities that are never
Most constructionist approaches aim to provide motivation for realized) move around abstract trees. Moreover, these accounts
each construction posited. Motivation aims to explain why it is at emphasize rough paraphrases instead of speakers detailed con-
least possible and at best natural that this particular formmean- struals of situations. Empirical challenges faced by the accounts
ing correspondence should exist in a given language. Motivation are discussed in some detail in Goldberg (2006).
is distinct from prediction: Recognizing the motivation for a con- Adele E. Goldberg
struction does not entail that the construction must exist in that
language or in any language. It simply explains why the construc-
tion makes sense or is natural (cf. Haiman 1985; Lakof 1987; WORKS CITED AND SUGGESTIONS FOR FURTHER READING
Goldberg 1995). Functional and historical generalizations count Bencini, G. M. L., and A. E. Goldberg. 2000. he contribution of argument
as explanations, but they are not predictive in the strict sense, just structure constructions to sentence meaning. Journal of Memory and
as parallel generalizations in biology are not predictive. hat is, Language 43: 64051.
language, like biological evolution, is contingent, not determin- Bergen, B. 2004. he psychological reality of phonaesthemes. Language
istic. Just as is the case with species, particular constructions are 8.2: 290311.
the way they are not because they have to be that way but because Bergen, B., and N. Chang. 2005. Embodied construction grammar
their phylogenetic and ontogenetic evolution was motivated by in simulation-based language understanding. In Construction
general forces. Grammar(s): Cognitive and Cross-Language Dimensions, ed. Jan-Ola
Ostman and Mirjam Fried, 14790. Philadelphia: John Benjamins.
Borer, H. 1994. he projection of arguments. In University of
Varieties of Construction Grammars Massachusetts Occasional Papers in Linguistics. Vol. 17. Ed. E. Benedicto
here are several variations of construction grammars, including and J. Runner, 1947. Amherst: GLSA, University of Massachusetts
the following: Press.
. 2003. Exo-skeletal vs. endo-skeletal explanations. In he Nature
(1) SCxG: sign-based construction grammar (Fillmore 1999;
of Explanation in Linguistic heory, ed. J. Moore, and M. Polinsky,
Fillmore, Kay, and OConnor 1988; Kay 2002; Kay and Fillmore 3167. Chicago: CSLI and University of Chicago Press.
1999; Sag, Wasow, and Bender 2003) Croft, W. 2001. Radical Construction Grammar: Oxford: Oxford University
(2) CG: cognitive grammar (e.g., Langacker 1987a, Press.
1987b, 1988, 1991, 2003) Culicover, P. W. 1999. Syntactic Nuts: Hard Cases, Syntactic heory and
Language Acquisition. Oxford: Oxford University Press.
(3) RCxG: radical construction grammar (e.g., Croft 2001)
Culicover, P. W., and R. Jackendof. 2005. Syntax Made Simple(R).
(4) ECxG: embodied construction grammar (e.g., Bergen Oxford: Oxford University Press.
and Chang 2005) Deane, Paul. 1991. Limits to attention: A cognitive theory of island phe-
nomena. Cognitive Linguistics 2: 163.
(5) CCxG: cognitive construction grammar (e.g., Bencini and
Fillmore, C. J. 1999. Inversion and constructional inheritance. In
Goldberg 2000; Goldberg 1995, 2006; Lakof, 1987)
Lexical and Constructional Aspects of Linguistic Explanation, ed. Gert
(6) Fluid constructon grammar (e.g., Steels and DeBeule 2006) Webelhuth, Jean-Pierre Koenig, and Andreas Kathol, 11328. Stanford,
(7) Simpler syntax (Culicover and Jackendof 2005) CA: CSLI.
218
Contact, Language
Fillmore, C. J., P. Kay, and M. C. OConnor. 1988. Regularity and idioma- languages, as well as varieties of the same language, since the lin-
ticity in grammatical constructions: he case of let alone. Language guistic efects of either sort of contact are similar.
64: 50138. he social contexts in which language contact occurs are var-
Goldberg, A. E. 1995. Constructions: A Construction Grammar Approach ied and have been common throughout human history. Most of
to Argument Structure. Chicago: Chicago University Press.
us, even if we are ourselves monolingual, interact with people
. 2002. Surface generalizations: An alternative to alternations.
who are bilingual or bidialectal and, thus, are participants in
Cognitive Linguistics 13.4: 32756.
. 2006. Constructions at Work: he nature of generalization in lan-
language contact situations whether we are aware of it or not.
guage. Oxford: Oxford University Press. Language contact can result in a wide variety of possible
Haiman, John. 1985. Iconicity in Syntax. Cambridge: Cambridge outcomes for the language varieties involved, ranging from no
University Press. discernible efects to the borrowing of a few vocabulary items to
Hale, K., and J. Keyser. 1997. On the complex nature of simple predi- profound structural change. It may even result in the creation of
cators. In Complex Predicates, ed. A. Alsina, J. Bresnan, and P. Sells, entirely new language varieties.
2965. Stanford, CA: CSLI.
Jackendof, R. 2002. Foundations of Language. Oxford: Oxford University Outcomes of Language Contact
Press. Language contact occurs when speakers of diferent language
Kay, P. 2002. English subjectless tagged sentences. Language
varieties communicate. In general, for any two varieties of
78.3: 45381.
speech, we can say that the relationship between the variet-
Kay, P., and C. J. Fillmore. 1999. Grammatical constructions and lin-
guistic generalizations: he whats X doing Y? construction. Language
ies is either one of autonomy or heteronomy. To say that they
75.1: 134. are autonomous means that the two varieties are independent
Lakof, G. 1987. Women, Fire, and Dangerous hings: What Categories socially and politically: English and Mandarin are autonomous
Reveal about the Mind. Chicago: University of Chicago Press. with respect to each other. However, to say that a variety is het-
Lambrecht, K. 1990. What, me worry? Mad Magazine sentences revis- eronomous with respect to another means that it is in some way
ited. Proceedings of the 16th Annual Meeting of the Berkeley Linguistics dependent or connected to it socially, politically, or both. So,
Society. Berkeley, CA: Berkeley Linguistics Society, 21528. regional dialects of English are heteronomous with respect to
Langacker, Ronald. 1987a. Foundations of Cognitive Grammar. Vol 1. standard English and one another.
Stanford, CA: Stanford University Press.
Heteronomy is not the same as mutual intelligibility: Standard
. 1987b. Nouns and verbs. Language 63: 5394.
Czech and standard Slovak are more or less mutually intelligible
. 1988. A usage-based model. In Topics in Cognitive Linguistics,
varieties, but they are deinitely autonomous. Local Italian dia-
ed B. Rudzka-Ostyn, 12761. Philadelphia: John Benjamins.
. 1991. Foundations of Cognitive Grammar. Vol. 2. Stanford, lects may not be mutually intelligible but are heteronomous with
CA: Stanford University Press. respect to standard Italian.
. 1992. Reference point constructions. Cognitive Linguistics 4: 139. Heteronomous varieties frequently come into contact: in
. 2003. Construction grammars: Cognitive, radical and less so. national institutional settings, as a result of migration or trade,
Paper presented at the International Cognitive Linguistics Conference, and in colonial tabula rasa situations (i.e., where there
Logrono, Spain. were no varieties of the language spoken in the region before).
Marantz, A. 1997. No escape from syntax: Dont try morphological anal- One result of such contact may be the creation of a koin. he
ysis in the privacy of your own lexicon. In University of Pennsylvania original koin (the Koin) was a variety of Ancient Greek that
Working Papers in Linguistics. Vol. 4.2. Ed. A. Dimitriadis and L. Siegel,
had come to supplant other, local Greek dialects during the
20125. Philadelphia: University of Pennsylvania Press.
Hellenistic and Roman periods. his dialect was based mostly
Michaelis, L. 2004. Implicit and explicit type-shifting in construction
on the Athenian dialect but included many elements from other
grammar. Cognitive Linguistics 15: 167.
Michaelis, L. A., and K. Lambrecht. 1996. he exclamative sentence type dialects and involved a certain amount of simpliication: the
in English. In Conceptual Structure, Discourse and Language, ed A. E. disappearance of irregularities in favor of structurally regular
Goldberg, 37598. Stanford, CA: CSLI. forms.
Sag, I. A., T. Wasow, and E. M. Bender. 2003. Syntactic heory: A Formal he term koin has come to be used for any variety that sup-
Introduction. Stanford, CA: Center for the Study of Language and plants heteronomous varieties and that serves as a means of
Information. intercommunication between speakers of these varieties. his
Saussure, F. de. [1916] 1959. Course in General Linguistics. Trans W. comes about as a result of dialect leveling: the loss of distinctive
Baskin. New York: Philosophical Library. features in favor of features with a high degree of mutual intel-
Steels, L., and J. DeBeule. 2006. A (very) brief introduction to luid
ligibility and/or high prestige. Sometimes this involves a fair
construction grammar. Paper presented at the hird International
amount of dialect mixture, though this neednt be the case. Where
Workshop on Scalable Natural Language Understanding, New York
dialect mixture is involved, the process of creating the koin can
City.
Williams, E. 1994. Remarks on lexical knowledge. Lingua 92: 734. be referred to as koinization. Koinization has probably been a
fairly common feature of the history of languages.
Standard dialects these days typically ill the role of a koin.
CONTACT, LANGUAGE
Standard dialects sometimes arise spontaneously through a pro-
Language contact occurs when individuals who regularly use cess of koinization; sometimes they are created by a deliberate
diferent language varieties communicate with each other. mixture of varieties through the actions of a language academy or
Language variety should be understood in a very broad sense, a government commission (language policy), and sometimes
including varieties that are traditionally considered to be diferent they are a regional or social variety selected for the purpose.
219
Contact, Language
Whether varieties are autonomous or heteronomous, linguis- grammar of the interlanguage will eventually become identical
tic elements of various sorts may be exchanged between them. to (or nearly identical to) that of the target language. In most
his exchange is referred to as borrowing. Virtually any linguistic cases, however, the interlanguage difers from the target lan-
feature can be borrowed: vocabulary, grammatical morphemes, guage in signiicant ways, relecting imperfect learning. In many
grammatical constructions, semantic relations, sounds, and so cases, the interlanguage may be a very simple, rudimentary sys-
on. Vocabulary borrowing is the most common, but its easy to tem, consisting of a few vocabulary items and simple phrases if
ind instances of borrowing of virtually any grammatical element. the need for interlanguage-based communication is restricted
Borrowing, particularly of vocabulary, can occur even when con- to a narrow range of activities, such as simple commercial or
tact between speakers of the source and target languages is fairly workplace transactions. When such interlanguages are used by
casual; when languages coexist in bilingual situations, however, a number of people and stabilize to a degree, they are usually
really intensive borrowing can take place. referred to as pidgins.
bilingualism implies communicative skill in two auton- Pidgins typically take their vocabulary primarily from a single
omous varieties. (he term bidialectalism refers to commu- language, referred to as the lexiier language. English is the lexi-
nicative skill in two heteronomous varieties.) When bilingual ier language for the various pidgin Englishes that developed in
situations persist over a long period, convergence may take many parts of the world during the colonial period. Pidgins are
place: he two varieties may converge toward each other, but characterized by a very large degree of simpliication vis--vis the
more usually one variety converges toward another, relecting lexiier language: Even when most of the vocabulary derives from
local political or social dominance. If the varieties are suiciently a given language, the grammatical complexities of the language
diferent to begin with, and the convergence is extensive, the are seldom found in the pidgin, which instead has a very simple
result may be metatypy, a complete change in language type. grammatical structure.
Cases of metatypy have not been commonly attested, but they Of the set of stable interlanguages that can arise in contact
certainly exist: Amharic, a Semitic language spoken in Ethiopia, situations, only those with a considerable degree of grammati-
has a grammar that is rather diferent from its Semitic kin due cal simpliication vis--vis the lexiier language are referred to as
to metatypic convergence with the Cushitic languages spoken in pidgins. When the language varieties spoken by the people cre-
the region prior to the arrival of Semitic languages. ating the interlanguage are similar particularly when they are
At the beginning, bilingual situations are usually character- close enough to be heteronomous one would never describe
ized by varying degrees of imperfect learning: situations where the interlanguage as a pidgin, even when there is some simpli-
in learning variety B, speakers of variety A learn B imperfectly, ication involved: he task of learning the lexiier variety would
making various sorts of grammatical errors in B, speaking B with be suiciently easy so that the radical simpliication associated
an accent, and so on. hese linguistic features may become per- with pidgins would not occur. (A koin may be the product of
manent components of the bilingual communitys command of simpliication, but it is not a pidgin.) Similarly, with autonomous
language B and may ultimately afect the speech of others speak- varieties that are similar, one would not describe stable interlan-
ing language B. If the bilingual community undergoes language guages as pidgins: An interlanguage developed between speakers
shift if speakers of A cease to speak A and adopt B instead the of Romance languages or Slavic languages would not be consid-
results of imperfect learning are referred to as substratic inlu- ered a pidgin. For a variety to be described as a pidgin, the native
ences on the B variety they speak. he particular sort of English varieties spoken by the people involved in its creation must be
spoken in Ireland, for example, is often said to be the result, at suiciently diferent to make learning the other variety relatively
least in part, of this sort of substratic inluence, in this case from diicult, though the degree of imperfect learning associated with
the Irish language to the English now spoken in the country. pidgins relects restricted opportunities for learning as well.
Superstratic inluence is also possible. A superstratic language Further, it seems that the conditions for the creation of stable
is one with high prestige, either in all formal contexts or in some pidgins as opposed to the creation of bilingual situations
speciied domains. Classical languages (Latin, classical Greek, seem to be the product of economic systems associated with
Sanskrit, Koranic Arabic, classical Tamil, etc.) can serve as super- states, suggesting that they may have been rare or nonexistent in
strata, but so can living languages when these languages have ancient times. he colonial period was an especially fertile time
suicient prestige. French has served as a superstratic language for the creation of pidgins.
for Europe, a role now illed largely by English. Chinese served Stable pidgins typically originate in social situations where
as a superstratum for Japanese, Arabic for much of the Moslem they are used only in a very restricted set of social situations, for
world, Russian for other languages within the old Soviet Union, example in commercial transactions. But if they persist over a
and so on. Superstratic languages do not require many speakers long period, they may come to be used in a wide range of social
within communities for their inluence to be strong: In the last contexts, in which case the pidgin may acquire a relatively large
few centuries, only a minority of people in Europe learned Latin vocabulary and a relatively large, stable set of grammatical con-
to any signiicant degree, yet Latin inluence on the languages structions. he pidgin language Tok Pisin, the national language
of Europe remains profound and extends beyond the extensive of Papua New Guinea, is such a language. Such pidgins may
borrowing of vocabulary to the borrowing of syntactic construc- come to be acquired natively by children as their irst language.
tions, rhetorical strategies, and so on. In many situations, this is a gradual process since pidgins arise
he discipline of second language acquisition refers in multilingual situations, and therefore there are other lan-
to linguistic systems that arise in the course of learning another guages that could serve as the native languages for some or even
language as an adult as an interlanguage. In the ideal case, the all of the children in the community. For some communities,
220
Contact, Language Context and Co-Text
however, the process may be fairly abrupt: most adult members CONTEXT AND CO-TEXT
of the community speak the pidgin to one another and children
he term context is used to refer very generally to the extralin-
grow up learning only the pidgin. Creolization refers to a social
guistic circumstances in which language is produced as a text
process by which a stable pidgin acquires native speakers; the
and to which the text is related the setting in which the lan-
result of creolization is a creole.
guage is used, for example, and the participants involved. But
Speakers of pidgins and creoles may remain in contact with
such circumstances are many and indeterminate, and only when
native speakers of the lexiier language, or they may not. If they
they relate to the text in the realization of meaning do they count
remain (or come to be) in contact with such speakers, and if the
as context. Many circumstantial features may have no bearing
lexiier language is a prestige language for instance, a language
whatsoever on the meaning that is intended by a text or how it
of administration and/or of the social elite the pidgin or creole
is interpreted. he question is: How does one establish which
may become heteronomous with the lexiier language. In such
attendant circumstances are contextually signiicant and which
cases, the pidgin or creole may borrow vocabulary and/or gram-
are not?
matical constructions from the lexiier language, becoming more
he importance of taking context into account as a matter
like it in the process. his, to some degree, is the reverse of the
of principle in the deinition of meaning has been long estab-
pidginization process. If, in particular, the lexiier language is the
lished. Early in the last century, the anthropologist Bronislaw
oicial language, serving as the standard dialect in the region
Malinowski argued that an understanding of the way in which
where the pidgin or creole is spoken, the pidgin or creole may be
language functions as a mode of action depends on establishing
perceived by its speakers and by the local authorities as a dialect
a relationship with its context of situation (Malinowski 1923).
of the lexiier language, and a process of dialect leveling may take
Subsequently, the linguist J. R Firth reformulated the notion as a
place similar to what was described under the label of koiniza-
suitable schematic construct to apply to language events (1957,
tion. When this happens, we may ind that there is a continuum
182) his construct makes mention of the relevant features of
of speech varieties in the community, ranging from relatively
participants and the relevant objects, but leaves unanswered
pure versions of the pidgin or creole to forms of the language
the key question of how relevance is to be determined.
that resemble (or are identical with) the standard dialect. Such
Context is a selection of circumstantial features that are rec-
situations, when they afect creoles, are referred to as post-creole
ognized by the language user as relevant in that they key into
continua. he pidgin or creole may eventually lose its distinc-
text to achieve communication. One set of criteria for deter-
tive status as speakers come to speak versions of the language
mining relevance can be found in the conditions for realizing
that are no longer distinguished by the results of a pidginization
pragmatic meaning as proposed in the theory of speech-acts
process. his leveling process, when it afects creoles, is referred
(Searle 1969). A piece of text, the uttering of a particular linguistic
to as decreolization. A variety that descends from a creole but
expression, for example, can be said to realize a particular illo-
has undergone extensive decreolization is referred to as a post-
creole.
cutionary force to the extent that circumstantial features are
taken to satisfy the conditions that deine the illocution. hus, the
he processes that we have been discussing pidginization
illocutions of threat and promise have the conditions in common
and dialect leveling can take place in tandem. hat is, its pos-
of reference to a future event and one controlled by the irst per-
sible, for example, that a language could acquire new native
son, but they difer as to whether the event has a negative efect
speakers via pidginization and subsequent creolization, while at
(threat) or positive efect (promise) on the second person. Hence,
the same time undergoing dialect leveling (koinization) in favor
the utterance I will call again tomorrow could be interpreted as
of the creole-based dialects. his has happened, more often than
either a threat or a promise, depending on the contextual factors
is usually acknowledged, in the historical development of lan-
of who said it to whom and in what circumstances.
guages. Languages that have undergone this sort of development
he recognition of relevance comes about because language
can be referred to as creoloids. Since the social dynamics of lan-
users are familiar with such conditions as part of their extralin-
guage shift situations can vary considerably, we have to examine
guistic sociocultural knowledge. But familiarity with illocution-
each instance of language shift to determine whether or not its
ary conditions is only one kind of sociocultural knowledge that is
result is a creoloid. Afrikaans, the Germanic language spoken in
brought to bear in the recognition of contextual signiicance. he
South Africa, is often described as a creoloid.
world we live in is made familiar by projecting two kinds of order
Michael Noonan onto it: linguistic encoding, on the one hand, and sociocultural
convention, on the other. Communication involves an interac-
WORKS CITED AND SUGGESTIONS FOR FURTHER READING tion between them: We make texts with the irst with a view to
homason, Sarah Grey. 2001. Language Contact: An Introduction. keying them into the second. Sociocultural conventions take the
Washington, DC: Georgetown University Press. form of schemata (see schema): customary representations of
homason, Sarah Grey, and Terrence Kaufman. 1988. Language Contact, reality, in various degrees culture-speciic, modes of behavior
Creolization, and Genetic Linguistics. Berkeley: University of California and thought that are socially established as normal. Contexts are
Press. features of a particular situation that are identiied as instantia-
Trudgill, Peter. 2000. Sociolinguistics. London: Penguin Books. tions of these abstract conigurations of experience that are real-
Weinreich, Uriel. [1953] 1968. Languages in Contact. he ized and recognized in text. hese schematic constructs are not,
Hague: Mouton.
however, static and ixed since once they are engaged they can
Winford, Donald. 2003. An Introduction to Contact Linguistics.
be extended and changed. Although communication depends
Oxford: Blackwell.
221
Context and Co-Text Control Structures
on some schematic convergence to get of the ground at all, it can Halliday, M. A. K, and R. Hasan. 1976. Cohesion in English.
then develop its own creative momentum. London: Longman.
Although context is generally understood as an extratextual Labov, W. 1972. Sociolinguistic Patterns. Pittsburgh: University of
phenomenon, apart from text but a crucial concomitant to it, the Pennsylvania Press.
Malinowski B. 1923. he problem of meaning in primitive languages. In
term is also often used, misleadingly, to refer to the intratextual
he Meaning of Meaning, ed. C. K. Ogden and I. A.Richards, 296336.
relations that linguistic elements contract with each other within
London: Routledge and Kegan Paul.
text. An alternative, and preferable, term for this is co-text. Schrifrin, D. 1994. Approaches to Discourse. Oxford: Blackwell.
Co-textual relations occur between linguistic elements at dif- Searle, J. R. 1969. Speech Acts. Cambridge: Cambridge University Press.
ferent levels. William Labov shows the tendency for segments of Sinclair, J. M. 1991. Corpus, Concordance, Collocation. Oxford: Oxford
spoken utterance at the morpho-phonemic level, for example, to University Press.
vary according to the phonetic and morphological environ- Widdowson, H. G. 2004. Text, Context, Pretext: Critical Issues in Discourse
ment in which they co-textually occur, and he is able to specify Analysis. Oxford: Blackwell.
variable rules for their occurrence. hese are distinct from other Widdowson, H. G.. 2007. Discourse Analysis. Oxford Introductions to
variable rules that Labov postulates, rules that have to do with Language Study. Oxford: Oxford University Press.
contextually motivated variation where speakers adjust their Wray, A. 2002. Formulaic Language and the Lexicon. Cambridge:
pronunciation in relatively formal situations in approximation
to prestige social norms (Labov 1972). Co-textual variation is
a property of text and in itself has no social signiicance as dis- CONTROL STRUCTURES
course. Contextual variation, on the other hand, decidedly does.
In many languages, when the subject of an embedded clause is
Co-textual relations at the lexico-grammatical level have
identical in reference (coreferential) with some noun phrase in
attracted particular interest over recent years in the ield of
the main clause, the former may (or must) be left syntactically
corpus linguistics. Computers now provide the means for
unexpressed. hus, sentence (1a) can be paraphrased as (1b),
collecting and analyzing vast quantities of text and for identify-
where the understood subject of the embedded clause corre-
ing in detail the regularities of co-textual patterning that occur
sponds to the pronoun he in (1a). his unexpressed subject is
(Sinclair 1991). One such pattern is that of collocation, the regu-
standardly notated by PRO, as represented in (1c). he referen-
lar occurrence of one word in the environment of another. But
tial dependence of PRO on George is expressed by the sharing of
co-textual patterning extends beyond the appearance of pairs of
an index (here, subscript i). his dependence is called control
words in juxtaposition and is also manifested in word sequences
(originally equi-NP deletion, see Rosenbaum 1967).
of relative degrees of ixity. he identiication of such co-textual
relations has led to the recognition that text is essentially formu- (1) a. George hoped that he would meet the Pope.
laic in structure (Wray 2002). b. George hoped to meet the Pope.
Whereas contextual relations bring about pragmatic c. Georgei hoped [PROi to meet the Pope].
efects, co-textual relations of this lexico-grammatical kind have
Universally, PRO can only occur in a subject position.
semantic consequences to the extent that the mutual condi-
Furthermore, in most languages, PRO can only occur in non-
tioning of meaning across co-occurring words becomes estab-
inite clauses. However, the latter is not a universal condition,
lished as a conventional encoding. Another kind of semantic
as controlled clauses in the Balkan languages, for example, are
linking is brought about by the co-textual function of cohesion
systematically inite. Deriving the distribution of PRO is a fun-
(Halliday and Hasan 1976). Here, there is a copying of one or
damental issue in the theory of control ever since Chomsky
more semantic features from an antecedent expression on to
(1981).
an expression that follows. hus, a pronoun like she would link
Although occasionally challenged, the existence of PRO
cohesively with a noun phrase like the lady in red occurring ear-
receives strong empirical support in languages with case con-
lier in a text in that it copies the features of singular and female.
cord (Sigursson 1991), where PRO can be shown to bear the
It should be noted, however, that the co-textual link of cohesion,
same morphological case that an overt subject does. Further
being semantic, does not guarantee that the appropriate prag-
evidence for PRO is provided by pairs like (2). Secondary predi-
matic reference will be achieved. here may be more than one
cates (like angry) cannot be predicated on arguments absent
antecedent to which the copying expression may semantically
from the syntax, like the implicit agent of serve. he fact that
relate. Co-textual cohesive links, therefore, do not themselves
they can be predicated on the understood subject of an inini-
result in referential coherence; the latter depends on contex-
tive implies that the latter is syntactically present, even if pho-
tual factors (Blakemore 2001).
netically null.
H. G.Widdowson
(2) a. Michael served dinner angry / *Dinner was served angry.
b. Michaeli hated [PROi to serve dinner angry].
Blakemore, D. 2001. Discourse and relevance theory. In he Handbook
Examples (1c) and (2b) illustrate obligatory control. In contrast,
of Discourse Analysis, ed. D. Schifrin, D. Tannen, and H. E. Hamilton, when the noninite clause occurs as a subject, the reference of
10018. Oxford: Blackwell. PRO is free and can pick a remote linguistic antecedent or no
Firth, J. R. 1957. Papers in Linguistics 193451. Oxford: Oxford University antecedent at all (arbitrary PRO). his situation is called non-
Press. obligatory control.
222
Conversational Implicature
(3) a. Janei admitted that it was likely that [PROi perjuring herself] Maxim of Relation. Be relevant.
was a mistake. Maxim of Manner. Be perspicuous; so avoid obscurity and
b. [PROarb to blame everything on fate] is all too common. ambiguity, and strive for brevity and order.
hus, theories of control must explain, at a minimum: i) where Grice consistently allowed that this list is not exhaustive, and
PRO must/can/cannot occur, ii) what syntactic conigurations at the end of his career, he raised some additional issues. Quality,
require obligatory versus nonobligatory control, and iii) the lexi- he suggests in his Retrospective Epilogue (1989, 3702), dif-
cal/semantic/pragmatic factors that afect the choice of control- fers from the other maxims in being essential to the making of
ler in particular environments. his unique combination makes a genuine contribution. Nor are the maxims independent of one
control an area where separate modules of grammar lexicon, another.
syntax, semantics, and pragmatics converge. Grice contends that an implicatum can be conveyed by obey-
Idan Landau ing the maxims, as well as by the louting of a maxim (1989,
301). When what is said is patently irrelevant, false, uninforma-
WORKS CITED AND SUGGESTIONS FOR FURTHER READING tive, or obscure, the hearer is incited to search for some speakers
intention that does contribute to the purpose of the conversation.
In one of his descriptions of the inferential pattern whereby the
the Netherlands: Kluwer.
hearer works out a conversational implicature, Grice has the
Landau, Idan. 2000. Elements of Control: Structure and Meaning in
Ininitival Constructions. Dordrecht, the Netherlands: Kluwer.
hearer reason as follows: S has said that p; I presume he is observ-
Manzini, M. Rita. 1983. On control and control theory. Linguistic ing the cooperative principle, and p does not on its own suit the
Inquiry 14: 42146. purposes of the conversation, so he must have been implicating
Rosenbaum, Peter. 1967. he Grammar of English Predicate Complement some other proposition, q; the speaker knows and (knows that I
Constructions. Cambridge, MA: MIT Press. know that he knows) that I can see that the supposition that he
Sigursson, Halldr A.. 1991. Icelandic case-marked PRO and the licens- thinks that q is required; he intends me to think, or is willing to
ing of lexical arguments. Natural Language and Linguistic heory allow me to think, that q; and so he has implicated that q.
9: 32763. To apply this pattern to our example, A assumes that B was
being cooperatively rational (informative, sincere, relevant, per-
CONVERSATIONAL IMPLICATURE spicuous) in making the obscure remark about C not having been
he British philosopher Herbert Paul Grice observed that the total to prison and so must draw upon background beliefs to come up
signiicance of an utterance embraces not only what is said but with the point of the remark. his background could pertain to
what is implied. His term of art for the latter was implicature, what A and B mutually believe about C, such as the idea that C
and he identiied conversational implicature as an important has venal inclinations. he remark about prison may be inter-
type of implicit meaning or signiication. preted, then, as a hyperbolic comment meant to evoke this trait.
Grice used the following example to introduce this type of Grice states that a conversational implicature can be can-
implicit meaning in his 1967 William James lectures at Harvard: A celed without contradiction. For example, B could coherently
and B discuss a mutual friend, C, who has recently started work- add: but of course there is no real danger of C going to prison.
ing at a bank. When A asks B how C is getting on, B replies, Oh Implicatures can also be reinforced, which would be the case
quite well, I think; he likes his colleagues, and he hasnt been to should B add: so lets hope C does not get caught. To implicate
prison yet. Given knowledge of English and of contextual fac- something insincerely can be dangerously misleading but does
tors, A can readily grasp that B has said that C is getting on well, not amount to lying: he hearer cannot reasonably complain
likes his colleagues, and hasnt been to prison yet. A may also that B has performed the illocutionary action of stating or assert-
understand that B has implied something else with the remark ing that their friend has venal inclinations.
about prison, and, according to Grice, a rational reconstruction Grice distinguishes between conversational and conventional
of the bases of this understanding reveals a complex inferential implicature. Sentences, as opposed to utterances, implicate
process a process based on principles that the persons involved what speakers or writers who follow the linguistic conventions
probably would not be able to articulate unless they had studied would normally use the sentence to implicate. He also contends
the Gricean literature. that there are generalized conversational implicatures that
At the top of Grices list is the cooperative principle, are not conventional. One of his examples is that if someone
roughly, the idea that it is rational for participants in conversa- says X went into a house, it is normally but not convention-
tions to advance the accepted purpose or direction of the talk ally implied that the house is not Xs own (1989, 378). A detailed
exchange to which they contribute. As it is routinely recapitu- neo-Gricean account of generalized conversational implicature
lated, Grices theory speciies that in addition to this basic pre- for utterance-type meaning has been developed by Stephen C.
sumption about rational cooperation, hearers should act and Levinson (2000), who argues that an implicature is generalized
think in terms of conversational maxims or imperatives, which just in case it is implicated by default or, in other words, unless
include the following: there are unusual contextual assumptions that prevent the impli-
cature from being appropriate.
Maxim of Quality. Make your contribution true; so do not Grice describes implicature as a pervasive feature of dis-
convey what you believe false or unjustiied. course and extends his account to cover metaphor, irony,
Maxim of Quantity. Be as informative as required. and indirect speech-acts. His exploration of implicature was
223
Conversational Implicature
linked to larger philosophical themes, including his defense of a proposition because that would have been irrational? In other
causal theory of perception and contentions about ambiguity words, why could there be no irrational implicatures?
and presupposition in ordinary language use. Grice argues, Other challenges to Gricean theory target the interest and
for example, that the word or is not ambiguous in English since adequacy of the normative account of the hearers recognition of
the exclusive interpretation (according to which or means that implicature. Kim Sterelny (1982, 1913) observes that it is knowl-
either but not both disjuncts is true) can be understood as a con- edge of the speaker that is crucial to the success of this kind of
versational implicature and not as a second meaning of or. interpretive project, not knowledge of conversational principles,
According to the thesis known as Grices Razor, it is better to maxims, rules, or general tendencies. Interpreters do sometimes
posit conversational implicatures than ambiguity (and in some discern the intentions of uncooperative, strategic, and even idio-
cases, presupposition) because implicatures can be derived syncratic interlocuters who violate the norms of rational, coop-
more economically from the independently motivated principles erative speech. Implicatures that are generated and understood
of cooperative rationality. may be a prevalent feature of non-Gricean discursive exchanges,
Although it is widely acknowledged that implicature is an that is, exchanges that diverge very signiicantly from the norm
important phenomenon, questions are raised about the explan- of cooperative communicative activity. he comprehension of
atory and descriptive value of Grices theory. Wayne A. Davis implicature is assisted by the existence of various conventional-
(1998) argues that Grices maxims and cooperative principle ized forms, many of which vary from culture to culture, as Anna
predict a range of implicatures that do not actually occur, while Wierzbicka (1991) has documented.
other implicatures that do occur cannot be derived from them. Davis argues that the interpretation of implicit meaning does
He also argues that the theory has no genuine explanatory pay- not depend on any one specialized or characteristic pattern of rea-
ofs. he proximal causes of a speakers conversational moves soning, concluding that any principle general enough to hold in
are that persons attitudes, not general tendencies to cooper- all cases of implicature will be too general to yield speciic predic-
ate or an audiences presumptions about the latter. According tions (1998, 99). his criticism also applies to the versions of the
to Davis, Grice wrongly assumes that the production and rec- principle of relevance advanced by Sperber and Wilson (1986) as
ognition of implicature are processes explicable in terms of the the successor to Grices bundle of maxims. It seems highly dubi-
same principles and maxims. his premise is misleading if what ous to suppose that what people imply, and what others efec-
a speaker implicates or means is not caused by what others pre- tively take them to be implying, in all discourse is determined by a
sume or know about that speaker. To implicate a meaning that quest for communicative eiciency deined as the maximization
extends beyond what one says is to say something with certain of information conveyed per unit of processing cost.
intentions, and the speakers intentions do not directly depend An alternative to the Gricean recourse to broad psychosocial
on what others know or presume. Having and expressing inten- principles is to focus on the role of conventions in both the gen-
tions is one thing, whereas communicating them to others is eration and understanding of the speakers implicature. Often
something else. Grice appears to assume that audience uptake of when we intend to imply one thing by saying another we rely
a certain kind is necessary or even suicient to the realization of upon some conventional, established idiom. For example, it is
communicative intentions. idiomatic that S could have done y normally implies that S did
Grices explicit analysis of conversational implicature can not do y, whereas the nearly synonymous S was able to do y
be read as indicating that the very existence of implicature (as implies that S did do y (Davis 1998, 378). hus, Bernard was
opposed to its successful uptake or understanding by some audi- able to make the inal putt can be used to implicate convention-
ence) requires the presumption, on the part of a hearer, that the ally that Bernard made the putt, and the speakers intention and
speaker has observed or acted in accordance with the coop- corresponding implicature can be grasped through knowledge
erative principle. hus, Grice writes that S implicates q only if he of the convention without recourse to the complex inferences
is presumed to be observing the conversational maxims (1989, Grice postulated. As an alternative to the thesis that interpreters
30). On an alternative reading, the actual hearers presumptions reason from cognitive states to implicatures, Nicholas Asher and
and other beliefs are not necessary, since some implicatures Alex Lascarides (2003) argue that reasoning about implicature is
are made by a speaker but remain unrecognized by the target based on rhetorical structures understood as speech-act types.
audience. What Grice has proposed is an account of success-
Paisley Livingston
fully communicated implicatures, but not implicature tout court.
Jennifer Saul (2002, 241) suggests that what matters for Grice
is not what particular hearers actually think, but what they are
required to think. Grice indeed stressed that his focus was on Asher, Nicholas, and Alex Lascarides. 2003. Logics of Conversation.
the rationality or irrationality of conversational conduct (1989, Cambridge: Cambridge University Press.
369). he thesis that implicature must be calculable or capable Brown, Penelope, and Stephen C. Levinson. 1987. Politeness: Some
Universals in Language Usage. Cambridge: Cambridge University
of being worked out can, then, be taken as belonging to a nor-
Press.
mative theory of the conditions under which speakers can suc-
Davis, Wayne A. 1998. Implicature: Intention, Convention, and Principle
cessfully realize the rational intention to implicate rather than in the Failure of Gricean heory. Cambridge: Cambridge University
to state some thought. Yet it is unclear why the norms of com- Press.
municative rationality should apply to both noncommunicative Grice, Herbert Paul. 1975. Logic and conversation. In Syntax and
and communicative linguistic behavior. Is it persuasive to argue Semantics. Vol. 3. Ed. Cole and Morgan, 4158. New York: Academic
that it is simply impossible for a speaker to have implicated some Press.
224
Conversational Repair Conversation Analysis
. 1989. Studies in the Way of Words. Cambridge: Harvard University Conversation analysis inds that the grammar of repair is
Press. vital for syntax-for-conversation. Repair is closely related to
Kasher, Asa, ed. 1998. Pragmatics: Critical Concepts. London: Routledge. syntax because it afects the shape and/or components of
Levinson, Stephen C. 2000. Presumptive Meanings. Cambridge, MA: MIT a sentence. Syntax organizes elements through which talk
Press.
is constructed. Syntax-for-conversation cannot exist without
Saul, Jennifer. 2002. Speaker meaning, what is said, and what is impli-
repair because speakers constantly search for the next item
cated. Nos 36: 22848.
Sperber, Dan, and Deirdre Wilson. 1986. Relevance: Communication and
due for the interactive needs of the conversation. he study of
Cognition. Cambridge: Harvard University Press. repair, therefore, demonstrates how interaction and grammar
Sterelny, Kim. 1982. Against conversational implicature. Journal of shape each other.
Semantics 1: 18794.
Liang Tao
Wierzbicka, Anna. 1991. Cross-Cultural Pragmatics: he Semantics of
Human Interaction. New York: Mouton de Gruyter.
Fox, Barbara, Markoto Hayashi, and Robert Jasperson. 1996. Resources
CONVERSATIONAL REPAIR and repair: A cross-linguistic study of the syntactic organization
Conversational repair (hereafter, repair) refers to a common of repair. In Interaction and Grammar, ed. Elino Ochs, Emanuel
practice in the interactive social organization of conversations Scheglof, and Sandra hompson, 185237. Cambridge: Cambridge
University Press.
in which speakers suspend the smooth progressivity of the talk
Scheglof, Emanuel, Gail Jeferson, and Harvey Sacks. 1977. he prefer-
to deal with some ostensible problem in speaking, hearing, or
ence for self-correction in the organization of repair in conversation.
understanding the talk. Repair does not always involve hearable Language 53: 36182.
errors or mistakes that require correction. herefore, the term
repair, rather than correction, is used to capture the more general
domain of such occurrence in conversation analysis.
CONVERSATION ANALYSIS
he organization of conversation is a turn-taking system in Conversation analysis (CA) is the study of talk (and other con-
which speakers take turns to converse. A repair may be done duct) in human interaction that began with the pioneering work
by the speaker of the trouble source in the same turn (same- of Harvey Sacks (1995) and his collaborators Emanuel Scheglof
turn self-repair). Or it may be done by anyone but the speaker. and Gail Jeferson (e.g., Sacks, Scheglof, and Jeferson 1974;
Furthermore, a repair may be initiated by the speaker of the Scheglof, Jeferson and Sacks 1977). CA seeks to establish tech-
trouble source or by others. Repair is often carried out with rep- nical speciications of the practices people use to co-construct
etition/recycling, replacement, or restructuring of the utterance, orderly and mutually understandable courses of action. hese
although not all repair attempts may be successful. Studies ind speciications constitute a cumulative, empirically derived body
that self-repair prevails even when a repair is initiated by others. of knowledge that is foundational to CA as a discipline. Since its
Next is a brief discussion of same-turn self-repair, for which beginnings within sociology in the late 1960s and early 1970s,
an emerging utterance may be stopped, aborted, recast, con- CA has become hugely inluential, both as an emerging disci-
tinued, or redone. Such repair often involves self-initiation with pline in its own right and across the ields of sociology, psychol-
some nonlexical initiators, such as cutofs, sound stretches, uhs, ogy, anthropology, linguistics, and education. It is increasingly
and so on, followed by repair. Following is an example contain- applied in studies of institutional and organizational interaction
ing two instances of repair with cutofs, replacements, insertion, (including news interviews, court proceedings, emergency and
and repetition/recycling (the asterisk indicates where repair help-line calls, and doctorpatient interaction; see Drew and
initiates). Heritage 1992; Heritage and Maynard 2006) and in sociologi-
cal studies of the operation of social norms and the reproduc-
(1) And tshe-* this girls ixed up onna da- * a blind da:te.
tion of culture (especially related to gender [see gender and
In the irst instance of repair, the speaker cuts of the pronoun language] and sexualities; Kitzinger 2000, 2005). Although
tshe- (i.e., the repairable; the - indicates glottalized cutof) and it originated in the analysis of talk from American-English speak-
replaces it with a full noun phrase, this girl. he second instance ers, CAs basic indings have now been replicated across many
is where date is cut of to introduce a modiier by recycling the other languages.
entire noun phrase: a blind date. he intellectual roots of CA lie in a synthesis of the sociological
Repair is highly patterned, with some basic mechanisms traditions established by Erving Gofman and Harold Garinkel
occurring cross-linguistically. But speciics of the mechanisms traditions that, like other broadly social constructionist theo-
difer. For instance, recycling often occurs at a turn beginning retical frameworks, ofer models of people as agents and of a
when the utterance overlaps with the ending of the previous social order grounded in contingent, ongoing, interpretive work
speakers turn. However, the syntactic unit that is recycled difers (see Heritage 1984). CA aims to build a science of social action,
from one language to another: Some allow repetition of single rather than to contribute to the study of language per se. It relies
words, whereas some require larger syntactic units to be recy- on analysis of recordings of naturally occurring human interac-
cled (e.g., in the example, the entire noun phrase is recycled). tion (i.e., not invented or hypothetical data and not data gener-
Furthermore, speakers of a tone language such as Mandarin ated by researchers via interviews or in laboratories). Recordings
make tone-related recycles. herefore, repair mechanisms are are transcribed according to a distinctive transcription nota-
constrained by the grammar of individual languages. tion system (Jeferson 2004), but it is the recordings themselves
225
Conversation Analysis
(and not transcripts of them) that are the primary data. Sound turn, transition to a next speaker can become relevant: his is a
iles are increasingly being made available on the World Wide transition relevance place. Turn-taking organization is designed
Web (see http://www.sscnet.ucla.edu/soc/faculty/scheglof/ to minimize turn size, such that a turn with one (and only one)
sound-clips.html for sound clips from Scheglofs publications), TCU is the default, and extended turns with lengthy and/or
enabling readers of published work to access the original data. multiple TCUs are accomplishments. his has important impli-
Much early CA was based only on audio recordings (since the cations for the analysis of overlapping talk and of longer turns
technology for video recording was not yet available, but see at talk (including, but not limited to, storytelling), both of which
Goodwin 1981), which precluded analysis of such interactional have been extensively researched. he model also encompasses
features as gesture, body deployment, and gaze. Video record- speaker-selection techniques in multiparty interaction.
ings of face-to-face interactions are now the norm. Although new
data are continually being collected, several core data sets have Action Formation
been shared within the CA community since the 1970s (e.g., the Researchers have focused on how speakers deploy talk (and other
telephone conversation known as Two Girls [TG], which can conduct) in order to fashion a turn designed to be recognizable to
be accessed at http://www.cambridge.org/9780521532792, their recipients as doing a particular action, that is, how people
Appendix 2). hese shared data are widely used in teaching, do complaining, or inviting, or declining, and so on (Atkinson
frequently reanalyzed for new phenomena, and appear in pub- and Heritage 1984). Since CA (unlike speech-act theory) starts
lications by a range of diferent authors. Analysis of these kinds from the analysis of singular episodes of human interaction and
of ordinary conversations is the point of departure for studying undertakes to understand action as the co-participants under-
more specialized communicative contexts (the legal process, the stand it, one outcome of this kind of analysis is a very detailed
medical encounter) in which social institutions are talked into understanding of how (for example) complaining or inviting
being (Heritage 1984). are done that often departs from vernacular understandings.
Conversation analysis has produced few theoretical mani- Another outcome is the discovery of actions that have no vernac-
festos but has, rather, concentrated on ine-grained empirical ular name (e.g., conirming an allusion; Scheglof 1996a).
studies of interaction. hese studies rest upon three fundamen-
tal theoretical assumptions (Heritage 1984): i) Talk is a form of Sequence Organization
action; that is, people use it to do things like complaining, com- he most basic type of sequence involves two turns at talk by dif-
plimenting, disagreeing, inviting, telling, and so on; ii) action ferent speakers, the irst constituting an initiating action (irst
is structurally organized; that is, turns at talk are systematically pair part) and the second an action responsive to it (second pair
related to one another, such as (for example) when an accep- part): for example, an invitation and an acceptance or declina-
tance follows an invitation or a self-deprecation follows a compli- tion of it; a news announcement and a news receipt (see adja-
ment (see adjacency pair); and iii) talk creates and maintains cency pair). Most initiating actions can be followed by a range
intersubjectivity; that is, a irst speaker understands, by what a of sequentially relevant (i.e., appropriately itted) next actions,
second speaker does, how that second speaker heard his or her some of which further the action of the prior turn (e.g., accepting
irst turn as when a second speaker produces a turn hearable as an invitation) and are termed preferred responses, and others of
an answer, thereby showing herself /himself to have heard the which do not (e.g., rejecting an invitation) and are termed dispre-
prior turn as a question. ferred. he basic two-turn adjacency pair sequence can be and
he focus in CA research is on identifying generic orders of frequently is expanded. Pre-expansions are turns that come
organization in talk-in-interaction that are demonstrably salient before and are recognizably preliminary to some other action;
to the participants analyses of one anothers turns at talk in for example, a turn such as What are you doing tonight? can be
the progressively unfolding interaction. Data are rarely coded recognizable in context as preliminary to an invitation (hence, a
or quantiied since manifest similarities in talk may turn out to pre-invitation); a turn such as Guess what is virtually dedicated
have very diferent interactional meanings. Key discoveries of CA to preannouncement. Insert expansions come between the irst
include turn-taking, action formation, sequence organization, and second pair parts, for example, between an invitation and
repair, word selection, and overall structural organization, each the acceptance or declination of it (Do you wanna come round
of which is now sketched out. tonight? / What time? / About six. / Okay where the invita-
tion and its acceptance are separated by an insert sequence).
Turn Taking Postexpansions come after the second pair part and may accept
he classic paper by Sacks, Scheglof, and Jeferson (1974) pres- or assess it. For example, You want me to bring you anything?
ents a model to describe the practices whereby people (mostly) (ofer: irst pair part) / No, no nothing (declination: second
speak one at a time. Summarized very simply, the model pro- pair part) / Okay (acceptance of the declination, expanding the
poses that the building blocks out of which turns are composed sequence to a third turn). he authoritative work on adjacency
(turn constructional units or TCUs) can be whole sentences, pairs and expansions of them, the organization of preference
phrases, sometimes just single words, or even nonlexical and dispreference, and other types of sequence organization is
items which, in context, are recognizable to a co-participant Scheglofs (2007) primer.
as possibly constituting a complete turn. Each speaker is ini-
tially entitled to just one TCU, after which another speaker has Repair
the right (and sometimes the obligation) to speak next. As a Interactional co-participants must manage troubles in speak-
speaker approaches the possible completion of a irst TCU in a ing, hearing, and/or understanding talk if the interaction is not
226
Conversation Analysis
to founder when trouble arises (see conversational repair). WORKS CITED AND SUGGESTIONS FOR FURTHER READING
Repair is a method for ine-tuning a turn in the course of its pro- Atkinson, J. Maxwell, and John Heritage, eds. 1984. Structures of Social
duction and for maintaining intersubjectivity. Researchers have Action. Cambridge: Cambridge University Press.
shown some of the practices that speakers use across a range of Clayman, Steven, and John Heritage. 2002. he News Interview.
diferent positions in talk, both in repairing their own talk (e.g., Cambridge: Cambridge University Press.
by deleting, inserting, or replacing a word; Scheglof, Jeferson, Drew, Paul. 1997. Open class repair initiators in response to sequential
and Sacks 1977) and in initiating repair on the talk of others (e.g., sources of trouble in conversation. Journal of Pragmatics 28: 69101.
with open-class repair initiations like huh?; Drew 1997). Most Drew, Paul, and John Heritage, eds. 1992. Talk at Work: Interaction in
repairs are completed by the speaker of the trouble source in the Institutional Settings. Cambridge: Cambridge University Press. A clas-
same turn (more accurately, the same TCU) as the trouble source sic collection of studies exploring the application of conversation
analysis to the study of language and interaction in applied settings,
but can be delayed to third turn or third position, or even later
including doctorpatient consultation, legal hearings, news interviews,
(Scheglof 1992).
and emergency calls. For more recent work in talk in organizational
settings, see Heritage and Maynard (2006) and Clayman and Heritage
Word Selection (2002).
Turns at talk are composed of lexical items selected from among Goodwin, Charles. 1981. Conversational Organization. New
alternatives. For example, when English-language speakers refer York: Academic Press.
to themselves, they can often select between I or we (the latter Heritage, John. 1984. Garinkel and Ethnomethodology.
choice sometimes being used, for example, to index that they are Cambridge: Cambridge University Press.
speaking on behalf of an organization or a couple). Alternatively, Heritage, John, and Douglas Maynard, eds. 2006. Communication in
they can self-reference in distinctive (marked) ways (e.g., self- Medical Care. Cambridge: Cambridge University Press.
naming or self-description) that perform analyzable actions. Jeferson, Gail. 1974. Error correction as an interactional resource.
Language in Society 2: 18199.
Likewise, explicit self-reference in (so-called zero-anaphora)
. 2004. Glossary of transcript symbols with an introduction. In
languages in which this is not required has been shown to be
Conversation Analysis, ed. Gene Lerner, 1331. Amsterdam: John
interactionally meaningful (see the Lerner and Kitzinger 2007
Benjamins.
collection on selection issues in self-reference). Category-based Kitzinger, Celia. 2000. Doing feminist conversation analysis. Feminism
reference to nonpresent persons also involves choices between and Psychology 10: 16393.
alternatives (Scheglof 1996b); for example, law enforcement . 2005. Heteronormativity in action. Social Problems
oicers can be referred to as police or cops, and speakers selec- 52.4: 47798.
tion of one or the other may be responsive to whether the speaker Lerner, Gene. 2004. Conversation Analysis: Studies from the First
is appearing in court (Jeferson 1974) or talking with adolescent Generation. Amsterdam: John Benjamins. A collection of early but
peers (Sacks 1995). CA explores how word selection is done as previously unpublished research by many of the central igures in the
part of turn design and how it informs and shapes the underdevelopment and advancement of CA.
Lerner, Gene, and Celia Kitzinger, eds. 2007. Referring to self and others
standing achieved by the turns recipient.
in conversation. Discourse Studies 9.4 (Special Issue).
Sacks, Harvey. 1995. Lectures on Conversation.Vols. 1 and 2.
Overall Structural Organization Oxford: Blackwell. Useful for understanding the early beginnings of
Talk-in-interaction is organized into phases, for example, most CA, these two volumes present lectures, transcribed and edited by Gail
obviously, openings and closings (Scheglof and Sacks 1973). Jeferson, from one of the founders of conversation analysis, as deliv-
Within ordinary conversation, however, matters are compara- ered to classes at the University of California between 1965 and 1972.
tively luid. Within organizational talk, by contrast, there are Each volume has an introduction by Emanuel Scheglof.
component phases or activities that characteristically emerge in a Sacks, Harvey, Emanuel A. Scheglof, and Gail Jeferson. 1974. A sim-
particular order. Acute doctorpatient interactions, for example, plest systematics for the organization of turn-taking for conversation.
have a highly structured overall organization (opening, present- Language 50: 696735.
ing complaint, examination, diagnosis, treatment, and closing; Scheglof, Emanuel A. 1992. Repair after next turn. American Journal of
Heritage and Maynard 2006), and doctors and patients conduct Sociology 95: 12951345.
Scheglof, Emanuel A.. 1996a. Conirming allusions: Toward an empiri-
can be analyzed for the way in which they orient to and negoti-
cal account of action. American Journal of Sociology 104.1: 161216.
ate the boundaries of each phase of the interaction. Many recent
. 1996b. Some practices for referring to persons in talk-in-
studies draw on analyses of overall structural organization as interaction. In Studies in Anaphora, ed. Barbara Fox, 43785.
part of research designed to be of practical use by organizations Amsterdam: John Benjamins.
in improving the quality of their services. . 2007. Sequence Organization in Interaction: A Primer in
Conversation Analysis. Vol 1. Cambridge: Cambridge University Press.
Conclusion A landmark text providing the deinitive introduction to sequence
Although much research remains to be done, there is, for each of organization and capsule reviews of other key concepts such as turns,
these orders of organization, an established set of core indings, actions, and repair, each of which will constitute the subject matter of
foundational to the discipline of CA. An outstanding bibliograph- forthcoming primers by this leading authority on CA.
ical source of information about CA is available on the Ethno/CA Scheglof, Emanuel A., Gail Jeferson, and Harvey Sacks. 1977. he pref-
Web site maintained by Paul ten Have at http://www2.fmg.uva. erence for self-correction in the organization of repair in conversa-
tion. Language 53: 36182.
nl/emca/resource.htm.
Scheglof, Emanuel A., and Harvey Sacks. 1973. Opening up closings.
Celia Kitzinger Semiotica 7.4: 289327.
227
Cooperative Principle Core and Periphery
COOPERATIVE PRINCIPLE Grice, Herbert Paul.. 1989. Studies in the Way of Words. Cambridge: Harvard
University Press.
Introduced by the British philosopher Herbert Paul Grice Kasher, Asa. 1976. Conversational maxims and rationality. In Language
(191388), the cooperative principle and related maxims are part in Focus: Foundations, Methods, and Systems, ed. A. Kasher, 197211.
of his theory of conversational implicature. Dordrecht, the Netherlands: Reidel.
Grice formulates the principle as an imperative: Make your Kasher, Asa. 1982. Gricean inference revisited. Philosophica
contribution such as required, at the stage at which it occurs, by 29: 2544.
the accepted purpose or direction of the talk exchange in which
you are engaged (1975, 45). He observes that it is a well-rec-
ognized empirical fact that this ceteris paribus principle applies CORE AND PERIPHERY
to all talk exchanges that do not consist of wholly disconnected Mainstream generative grammar makes two basic divisions
remarks, and he adds that he would like to be able to argue among linguistic phenomena. he irst is the traditional division
that the principle is grounded in rationality. To that end, he sug- between grammar and the lexicon, taken to be the locus of all
gests that persons participating in conversational exchanges irregularity. he second (Chomsky 1981) distinguishes between
do so with certain shared purposes, such as exchanging infor- two parts of the grammar itself, the core and the periphery.
mation and inluencing and being inluenced by others. hese he core rules represent the deep regularities of language. he
shared purposes, Grice suggests, are in general only realized if periphery represents marked exceptions, such as irregular verbs,
the exchanges are conducted in accordance with the coopera- for which there are no deep regularities.
tive principle. For those who know this, it is rational to behave in he coreperiphery distinction (henceforth C/P) is related
accordance with the cooperative principle and to expect others to markedness. For Noam Chomsky (1965), markedness is a
to do so as well. hus, this principle and presumption are ratio- graded phenomenon that relects relative centrality, naturalness,
nal, given assumptions about shared conversational ends, efec- simplicity, ease of learning, and related notions. he introduction
tive means to those ends, and rationality. of C/P can be seen as a distillation of the notion markedness hier-
Many researchers (e.g., Brown and Levinson 1987; Clark archy into a binary distinction. he consequence is a dramatic
1996) describe the cooperative principles applications and take conceptual simpliication, which ties naturally to the character-
up thorny questions about its relation to the associated maxims ization of universal grammar in terms of parameters and
of quantity, quality, relation, manner, and politeness. here is a related perspective on the language acquisition device
disagreement as to whether norms of conversational etiquette and learnability (see also syntax, universals of). On this
derive from, are complementary to, or are in tension with the view, the core is part of the human biological endowment for lan-
cooperative principle. Asa Kasher (1976, 1982) argues against guage, and the value of a parameter is set by the learner on the
the assumption of shared conversational purposes and contends basis of minimal linguistic input. One important consequence of
that the cooperative principle is superluous since the needed C/P, particularly in syntactic theory, is that it has focused con-
maxims can be derived from a more fundamental principle of siderable attention on understanding how languages do and do
rational behavior: Given a goal, adopt the most efective and not realize phenomena such as argument structure and wh-in-
least costly means to its realization. Wayne A. Davis (1998) con- terrogatives. Another is that it has led to the uncovering of a wide
tends that the cooperative principle lacks explanatory value and range of empirical phenomena in the attempt to integrate appar-
is hopelessly ambiguous among normative, motivational, behav- ent exceptionality, idiosyncrasy, and counterexamples into a
ioral, and cognitive readings. As it is only the speakers motives general framework of universals and parametric restrictions.
and beliefs that are causally involved in the intentional produc- Despite the value of such a simpliication, Chomsky him-
tion of implicit meanings, what the speaker implicated or implic- self notes that we do not expect to ind chaos in the theory of
itly expressed does not depend on the thoughts or presumptions markedness, but rather an organized and structured system,
of the audience. In other words, Grice erred when he made the building on the theory of core grammar (1981, 216), and that
hearers presumption that the speaker observes the cooperative marked structures have to be learned on the basis of slender
principle a condition on the speakers expressing one thing by evidence too, so there should be further structure to the system
saying something else. outside of core grammar (1981, 8). It is in fact not clear that C/P
Paisley Livingston is a principled distinction and that it relects anything beyond
generality of function and frequency of use. Apparent syntactic
WORKS CITED AND SUGGESTIONS FOR FURTHER READING idiosyncrasies beyond the level of individual words are learned,
they display various degrees of speciicity, and native speakers
Brown, Penelope, and Stephen C. Levinson. 1987. Politeness: Some have sharp and reliable intuitions about them. Furthermore,
Universals in Language Usage. Cambridge: Cambridge University
Occams razor demands that it be shown that a learning mecha-
Press.
nism that can acquire the peripheral cases cannot also acquire
Clark, Herbert H. 1996. Using Language. Cambridge: Cambridge
University Press.
the core. Hence, C/P may be nothing more than a rough and
Davis, Wayne A. 1998. Implicature: Intention, Convention, and Principle tentative distinction, one drawn for working purposes (and
in the Failure of Gricean heory. Cambridge: Cambridge University nothing more than that) (Chomsky 1993, 1718).
Press. Here are some illustrations of peripheral phenomena
Grice, Herbert Paul. 1975. Logic and conversation. In Syntax and Semantics. (Culicover and Jackendof 2005). First, there are words that go in
Vol. 3. Ed. Cole and Morgan, 4158. New York: Academic Press. the wrong place.
228
Core and Periphery
Enough modiies adjectives and adverbs, alternating with d. where to/from/*near.

so, too, and as. However, unlike these, it follows its head: so/ e. * which (book) with/to/from/next to/about/beside. here are
too/as/*enough big; big enough. As a nominal modiier, it can other cases as well (Culicover 1999):
go either before or after its head: much/more/suicient/enough
(5) a. no matter (how heavy the load/what the cost/the diiculty)
pudding; pudding *much/*more/*suicient/enough.
b. -ever [as in whatever the cost]
he quantiiers galore and aplenty also go after the head
c. the comparative correlative (the more he eats the hungrier
rather than before it, obligatorily: money galore, *galore money.
he gets)
Responsible, unlike other adjectives, can occur either before or
d. would rather
after its head. Notwithstanding parallels other prepositions, such
e. had better
as despite, in spite of, and regardless of in its semantics, but it
f. ininitival relatives [as in someone with whom to speak;
can go on either side of its complement noun phrase (NP). he
*someone who to speak with]
related word aside goes on the right of its complement; aside
g. parasitic gaps
from goes on the left.
h. Not-topics (not in my car (you wont))
Each of these cases constitutes an idiosyncratic departure
i. Italian loro
from strict x-bar theory.
j. dative NP in English
here is sluice-stranding too. (1a) means the same as (1b).
k. the possibility of clitic climbing
(1) a. John went to NY with someone, but I couldnt ind out who l. English tags
with. For any apparently peripheral phenomenon, further research
b. John went to NY with someone, but I couldnt ind out who may show that its properties follow from general principles with-
John went to NY with. out construction-speciic stipulations, or that there may be some
(1a) is a case of sluice-stranding, where an isolated wh- irreducible idiosyncrasy.
phrase stands in place of an understood indirect question. Conclusions
It contains not only the wh-phrase but also a preposition Syntactic constructions appear to be ranged on a continuum
from whose complement the wh-phrase has apparently been from words through idioms through truly idiosyncratic con-
moved. It is technically possible to derive this construction structions through more general but still specialized construc-
through some combination of wh-movement and deletion. tions to the most general corelike structures and principles of
he diiculty is that sluice-stranding is both more productive universal grammar. It is likely that certain peripheral construc-
and more restricted than a derivational account would suggest. tions may be related to the core in systematic ways, say, by
Sluicing in general is possible where the purported extraction relaxing certain conditions of core grammar (Chomsky 1986,
site normally forbids extraction (Ross 1969). (2a) illustrates for 147). But C/P per se, however valuable heuristically, may not
ordinary sluicing of a prepositional phrase; (2b) illustrates for merit genuinely theoretical status. (Cf. head-driven phrase
sluice-stranding. structure grammar and construction grammar.)
(2) I saw a fabulous ad for a Civil War book, but I cant remember he implication for learning is that the learner stores current
a. by whom. analyses of novel utterances in the lexicon, with idiosyncratic
b. who by. and general properties (see lexical acquisition). he learn-
c. * by whom I saw a fabulous ad for a Civil War book. ing procedure attempts to construct more general lexical entries
d. *who I saw a fabulous ad for a Civil War book by. on the basis of positive experience, where common parts of exist-
ing lexical entries are retained and difering parts are replaced by
On the other hand, sluice-stranding severely constrains what a variable. he resulting lexical entry functions as a schema or
combinations of wh-word and preposition are acceptable, while rule that encompasses existing entries and permits construction
sluicing is productive. of new utterances. In turn, this schema, along with others, may
be further abstracted into a still more general schema by replac-
(3) Normal pied-piped preposition in sluicing:
ing further dimensions of variation with variables (Tomasello
but I couldnt igure out
2003), producing in the limit grammatical rules of full generality
a. with/to/from/for/next to/about/beside whom.
where warranted (see also syntax, acquisition of).
b. with/for/from/of/on/in/about/at/before/into/near beside
what. Peter W. Culicover
c. for/by/with how much.
d. to/from/near where. WORKS CITED AND SUGGESTIONS FOR FURTHER READING
e. with/to/from/next to/about/beside which (book). Chomsky, Noam. 1965. Aspects of the heory of Syntax. Cambridge, MA:
MIT Press.
(4) Sluice-stranding:
. 1981. Lectures on Government and Binding. Dordrecht, the
but I couldnt igure out
Netherlands: Foris.
a. who with/to/from/for/*next to/*about/*beside. . 1986. Knowledge of Language. New York: Praeger.
b. what with/for/from/of/on/in/about/ . 1993. A minimalist program for linguistic theory. In he View
at/*before/*into/*near/*beside. from Building Twenty, ed. Kenneth Hale and Samuel J. Keyser, 152.
c. how much for/*by/*with. Cambridge, MA: MIT Press.
229
Corpus Callosum
Culicover, Peter W. 1999. Syntactic Nuts. Oxford: Oxford University he functional relevance of the CC in general, and in language
Press. processing in particular, was impressively demonstrated by Roger
Culicover, Peter W., and Ray Jackendof. 2005. Simpler Syntax. W. Sperry and Michael S. Gazzaniga in their research on patients
Oxford: Oxford University Press. with a complete surgical transsection of the CC (Gazzaniga
Ross, John R. 1969. Guess who. In Proceedings of the Fifth Annual
2000). In an everyday situation, these patients are able to pro-
Meeting of CLS, ed. Robert I. Binnick et al., 25286. Chicago: Chicago
cess language in a seemingly appropriate way. However, when
Linguistics Society,
Tomasello, Michael. 2003. Constructing a Language. Cambridge: Harvard
tested with special experimental paradigms, the lack of inter-
University Press. hemispheric communication becomes obvious, indicating that
an intact CC is not obligatory but seems necessary for achieving
optimal and eicient language processing.
CORPUS CALLOSUM he exact role of the callosal axons in interhemispheric inter-
Although language processing relies predominantly on left action is still a matter of debate, however. At least two diferent
hemisphere networks, certain functional units are also local- classes of possible callosal functioning can be distinguished. he
ized in the right hemisphere. As a result, there is a strong CC might be seen either as 1) a channel to exchange information
need for an interaction between the two hemispheres during between the two hemispheres (information transfer function) or
most language-related processes. he neuronal basis for this as 2) a mechanism through which one hemisphere exerts inhibi-
interaction is provided by a brain structure located between the tory or excitatory inluence on the ongoing processing in the
two hemispheres, the so-called corpus callosum (CC), which is opposite hemisphere (modulatory function).
the major interhemispheric iber tract. he more than 200 million Information transfer becomes important whenever one
axons forming the CC originate from nearly all cortical regions, hemisphere needs to access information that is available only in
including the language areas, and they primarily link homologue the other hemisphere. his might be the sensory input initially
regions of the hemispheres. he ibers cross the interhemispheric transferred to only one hemisphere (e.g., visual input from the
gap ordered by their cortical origin. Due to functional special- lateral periphery of the visual ield) or the outcome of preced-
ization of the cerebral cortex, this anatomical organization also ing unilateral processing steps. An instructive example related
establishes a functional topography within the CC. hus, dif- to language is the interplay of (right-hemispheric) prosodic
ferent subregions of the tract are related to speciic functional and (left-hemispheric) syntactic information processing dur-
networks. ing speech comprehension (see speech perception). In an
Viewed in a midsagittal section of the brain (see Figure 1), electroencephalographic (EEG) study, Angela D. Friederici,
two subregions of the CC seem to be particularly relevant for lan- D. Yves von Cramon, and Sonja A. Kotz 2007, 135) examined
guage processing: Fibers passing through portions of the anterior how patients with lesions in the CC respond to a mismatch
CC connect the language production network situated in the left between the syntactic and prosodic structure of a sentence.
inferior frontal cortex (see frontal lobe and brocas area) While the healthy control subjects and patients with anterior CC
with its contralateral homologue, while axons in the posterior lesions showed a clear diference between their EEG responses
CC interconnect the cortical areas in the temporal lobes (see to prosodically correct and incorrect sentences, no such efect
also wernickes area) which are responsible for language was found in patients with lesions in the posterior CC. hus,
perception. the destruction of the direct connections between left and right
Figure 1. A midsagittal view of the brain

acquired with magnetic resonance imaging. The
characteristic cross-sectional shape of the cor-
pus callosum (CC) is indicated. CC subregions
connecting frontal (1) and temporal (2) language
networks are marked by hatched areas.
230
Corpus Linguistics
temporal lobes seems to prevent the interhemispheric exchange linguistics, including phonetics, phonology, morphol-
required to integrate prosodic and syntactic information. ogy, syntax, semantics, pragmatics, discourse analy-
One often-quoted modulatory role of the CC is the functional sis (linguistic), sociolinguistics, language acquisition,
inhibition of the contralateral hemisphere while the ipsilateral PSYCHOLINGUISTICS, HISTORICAL LINGUISTICS, dialectol-
hemisphere is engaged in a task for which it is specialized. he ogy, and lexicography.
advantage of such an inhibitory mechanism might be the reduc-
tion of interfering inluence coming from the opposite hemi- Corpus Data as an Object of Study
sphere. A inding recently published by Alexander hiel and It is appropriate to begin a discussion of corpus linguistics with the
coworkers (2006) could be interpreted in this vein. he authors question of whether the language found in corpora is a legitimate
measured the activation of the left and right inferior frontal gyrus object of study. Corpora, after all, contain performance data.
(IFG) in a verb generation task using positron emission tomog- Noam Chomsky (1957 and elsewhere) and others have argued
raphy (see neuroimaging). In some of the administered trials, that linguists should model competence rather than perfor-
repetitive transcranial-magnetic stimulation, a method to induce mance; this has been widely interpreted to mean that the source
a temporary disruption of ongoing neuronal activity, was simul- of linguistic data should be introspective judgments, rather than
taneously applied over the left IFG. Besides a reduction of the naturally occurring spoken or written text. Additional arguments
activation in the stimulated left IFG, this virtual brain lesion also commonly put forward against the use of corpus data are 1) that
induced a relative increase in the response measured in the right, performance may be afected by factors that are not linguistic in
nonstimulated IFG. hus, the suppression of the left IFG area nature, such as memory limitations and the speakers state of
seems to result in a disinhibition of its contralateral homologue. mind, degree of tiredness, and so on, and 2) that performance
he studies cited here illustrate that not only the exchange of data include utterances that are judged ungrammatical by native
information but also the coordination of bihemispheric processing speakers of the language. In response to the irst argument, intro-
is supported by transcallosal connections. Furthermore, a recent spective judgments can also be afected by nonlinguistic factors.
functional imaging study has shown that interindividual difer- Grammaticality judgments often depend on context: Utterances
ences, which can be found in size and micro-architecture of the may seem unacceptable in isolation but perfectly natural in the
CC, have consequences for language processing (Westerhausen proper context, for example, embedded in discourse within a
et al. 2006, 80). Here, the degree of activation diferences between corpus. he inability to imagine an appropriate context is clearly
left and right inferior frontal language areas (in a word produc- irrelevant to grammaticality, but may also be afected by nonlin-
tion task) appeared to be directly related to diferences in the iber guistic factors, such as tiredness. hus, introspective judgments
architecture of the callosal connection. Whether structural CC may lead to the wrong results (Bresnan 2007). In response to the
diferences between individuals also trigger diferences in per- second, the number of ungrammatical utterances in a corpus is
formance or are even associated with language disorders (as was usually small (Labov 1969).
hypothesized for dyslexia) has still to be conirmed. In their favor, corpora provide at least two types of informa-
Ren Westerhausen tion that is not easily available via speaker judgments: frequency
data, which have applications ranging from lexical studies to
WORKS CITED AND SUGGESTIONS FOR FURTHER READING research into language impairment; and historical and longitu-
dinal data, which can be used to model language acquisition and
Friederici, Angela D., D. Yves von Cramon, and Sonja A. Kotz. 2007. Role
language change.
of the corpus callosum in speech comprehension: Interfacing syntax
and prosody. Neuron 53: 13545.
Gazzaniga, Michael S. 2000. Cerebral specialization and interhemi- Deining Characteristics of Corpora
spheric communication Does the corpus callosum enable the human Corpora themselves vary widely in size, form, and content, and
condition? Brain 123.7: 12931326. in fact, almost any collection of data (a single text, a collection
hiel, Alexander, Birgit Schumacher, Klaus Wienhard, Stefanie Gairing, of the works of a single author, speech recorded from a single
Lutz W. Kracht, Rainer Wagner, Walter F. Haupt, and Wolf-Dieter individual at a speciic time) could be considered a corpus.
Heiss. 2006. Direct demonstration of transcallosal disinhibition in
But modern corpora are usually assumed to have the following
language networks. Journal of Cerebral Blood Flow and Metabolism
characteristics:
26.9: 11227.
1. hey are representative samples of the language under
Westerhausen, Ren, Frank Kreuder, Sarah Dos Santos Sequeira, Christof
Walter, Wolfgang Woerner, Ralf A. Wittling, Elisabeth Schweiger, and investigation and have a inite size. Complete corpora for mod-
Werner Wittling. 2006. he association of macro- and microstruc- ern spoken languages are of course impossible to construct,
ture of the corpus callosum and language lateralisation. Brain and because the number of utterances is constantly increasing as
Language 97: 8090. the language is used by its speakers. he goal of corpus build-
ers is to collect a sample that provides a good picture of the
possible utterances of the language, including both rare and
CORPUS LINGUISTICS
common constructions with representative frequencies. Most
his term refers to linguistic research that uses corpus data as corpora contain a broad range of texts from diferent authors/
the primary object of study. he term, therefore, describes a speakers and genres. here are, of course, corpora that by their
methodology rather than a ield of linguistics; corpus research very nature cannot be representative in this way. hese include
has been carried out in most areas of formal and applied corpora of dead languages, where the texts are inite and
231
Corpus Linguistics Creativity in Language Use
restricted to those that have survived over time, and corpora WORKS CITED AND SUGGESTIONS FOR FURTHER READING
of child language during the period of acquisition, which are Bresnan, Joan. 2007. A few lessons from typology. Linguistic Typology
intentionally restricted to a speciic type of speech and a spe- 11.1. Available online at: http://www.stanford.edu/bresnan.
ciic class of speakers. Chomsky, Noam. 1957. Syntactic Structures. he Hague, Paris: Mouton.
2. Modern corpora are machine-readable; that is, they exist Labov, William. 1969. he logic of non-standard English. Georgetown
as computer iles and can be transmitted and manipulated Monographs on Language and Linguistics 22.
electronically. his characteristic has two consequences: First, McEnery, Tony, and Andrew Wilson. 1997. Corpus Linguistics.
corpora can be searched quickly and easily; second, they can Edinburgh: Edinburgh University Press. Contains a full discussion of
be annotated with linguistic and extralinguistic information to many of the topics introduced in this entry.
make them more useful. Sampson, Geofrey, and Diana McCarthy. 2004. Corpus
Linguistics: Readings in a Widening Discipline. London and New
3. Modern corpora are publicly available and are considered
York: Continuum. A variety of corpus studies.
standard tools for research in particular languages. his char-
Taylor, Ann. 2007. he York-Toronto-Helsinki parsed corpus of Old
acteristic has major implications for language study: Empirical English prose. In Using Unconventional Digital Language Corpora. Vol.
results can be replicated and veriied; studies can more easily 2. Ed. J. C. Beal, K. Corrigan, and H. Moisl. Basingstoke, UK: Palgrave-
build on one another, since they are working from the same Macmillan. A detailed description of a historical English corpus, its
empirical base; and difering results must be attributed to dif- morphosyntactic annotation scheme, and its search engine.
ferent methodology or diferent interpretation, rather than to
diferent databases. hus, corpora may have the overall efect CREATIVITY IN LANGUAGE USE
of raising the quality of linguistic research.
Linguists speak of creativity in two senses. One is what Noam
Corpora and Their Annotation Chomsky called the generativity of language. he other is the
One beneit of large corpora can also be a disadvantage: Having imaginative use of language in novel ways.
a million words or more of text available is of very little use if Chomsky described language as a system wherein a inite
the corpus cannot be easily searched. Even for lexical studies, set of formal rules (plus vocabulary) can generate an indeinite
the researcher must use concordances and other software to number of hierarchical structures or sentences. Many of
determine and then collect all of the variant forms and spell- these will be irst-time new. his, said Chomsky, implies that a
ings of individual words in the corpus. For syntactic research, (behaviorist) theory based on surface probabilities cannot pre-
both part of speech and structural annotation is necessary. dict language use, nor explain the nested dependencies within
Consider the standard problem of retrieving relative clauses in sentences.
a corpus of modern standard English. Relative clauses may be he imaginative use of language wasnt stressed by Chomsky.
introduced by that, which or who/whom, or even by nothing In calling himself a cartesian linguist, he noted that prede-
at all: cessors such as Wilhelm von Humboldt had also stressed the cre-
ativity of language. However, Humboldt was referring not to the
(1) a. the book that I read formal generativity of syntax but to the linguistic expression of
b. the book which I read novel thoughts (Boden 2006, 9.iv.fg). his includes straightfor-
c. the book I read ward sentences/phrases conveying new facts or ideas and also
Searching lexically for that and which/who/whom will not only imaginative uses such as metaphor, analogy, and poetic
miss clauses like (1c) but will also ind the examples in (2), which metaphor, or imagery.
are not relative clauses at all: Cognitive science has deined various creative information-
processing mechanisms underlying those imaginative uses.
(2) a. I like that book. he basic principles of mental association are implemented
b. Which one do you want? by connectionism. his models the luidity of concepts
hese problems are termed low recall (missing wanted data (Hofstadter and Mitchell 1993), and (in parallel distributed
like [1c]) and low precision (getting unwanted data like [2ab]) processing [PDP] networks) their deinition by Wittgensteinian
by Ann Taylor 2007. hey can be solved by annotating corpora family-resemblances, rather than necessary and sufficient
for part of speech and (abstract) syntactic structure, and by rules. hey illuminate, for instance, how Coleridge could pro-
using a search engine designed for corpus annotations. duce the imagery in he Ancient Mariner (Boden 2004, 12546).
he construction and use of publicly available corpora have conceptual blending theory (Fauconnier and Turner
revolutionized the way that empirical linguistic research is 2002) outlines how various metaphors and analogies could arise.
conducted. Rather than spending most of their time collect- Classical artiical intelligence (AI) has identiied some hierarchi-
ing data, linguists can now concentrate on asking questions, cal conceptual structures in longterm memory, including gen-
retrieving the relevant data quickly and easily from corpora, eral and culture-speciic assumptions about human motivation
and constructing analyses. Large searchable corpora are now (Boden 2004, 17092). And it has modeled the generation of jokes
publicly available for many diferent languages, both written (see verbal humor) of the form What do you get if you cross
and spoken, historical and contemporary, in many diferent an x with a y? (Binsted and Ritchie 1997). Compare: Q. What
styles and registers. do you get if you cross a sheep with a kangaroo? A. A woolly
jumper. (his joke doesnt work in American English, wherein
Susan Pintzuk jumpers are called sweaters.)
232
Creoles
As that example illustrates, the tacit knowledge of language drastic opposition and inequality between the dominant and
(syntax, semantics, phonetics, morphology, catego- dominated groups speakers of the European superstrate and
rization, dialect, orthography) and also of the world the African substrate languages, respectively. At the opposite
that is needed to use language imaginatively is richly detailed end, the superstrate and substrate speakers had relatively inti-
and widely various. Both hearer and speaker must possess this mate interactions, especially during the settlement period when
knowledge if the creative usage is to be understood. substrate speakers were outnumbered by, and in relatively close
contact and interdependence with, superstrate speakers and
Margaret A. Boden
then, throughout the colonial period, among and around the
groups that played an intermediate bufer role race- and class-
wise. hese continua would entail, throughout colonial history,
Binsted, Kim, and G. D. Ritchie. 1997. Computational rules for punning corresponding continua of second-language (L2) learner vari-
riddles. Humor: International Journal of Humor Research 10: 2576. eties of the superstrate language. hese non-native varieties,
Boden, M. A. 2004. he Creative Mind: Myths and Mechanisms. 2d ed. alongside native varieties, of the superstrate language would
London: Routledge.
in turn become the target for increasingly numerous cohorts of
Boden, M. A.. 2006. Mind as Machine: A History of Cognitive Science.
native Creole speakers (DeGraf 2002, 37494; 2005b, 2009).
Fauconnier, G. R., and Mark Turner. 2002. he Way We hink: Conceptual
My working assumption is uniformitarian: Normal processes
Blending and the Minds Hidden Complexities. New York: Basic Books. of irst- and second-language acquisition (L2A) and use
Hofstadter, D. R., and Melanie Mitchell. 1993. he copycat pro- have underlaid the formation of Creoles as they have the forma-
ject: A model of mental luidity and analogy-making. In Advances tion of non-Creoles. he sociohistorical evidence, as documented
in Connectionist and Neural Computation heory. Vol. 2: Analogical by (e.g.) Salikoko Mufwene (2008), suggests that Caribbean
Connections. Ed. Keith Holyoak and John Barnden, 31112. Norwood, Creoles were not seeded by any sort of structureless pidgins
NJ: Ablex. (i.e., these Creoles were not created with input from early Pidgins
Humboldt, Wilhelm von. [1836] 1988. On Language: he Diversity allegedly spoken by the parents of the irst generation of Creole
of Human Language-Structure and Its Inluence on the Mental
speakers). Such early Pidgins as the immediate predecessors of
Development of Mankind. Trans. Peter Heath. Cambridge: Cambridge
Caribbean Creoles have never been documented, and neither
University Press.
does the contemporary structural evidence support the pos-
tulation of such Pidgins as the primary ancestors of Caribbean
CREOLES Creoles (see the following).
SocioHistorical, Terminological, and Epistemological
Background Creole Exceptionalism
he term Creole exceptionalism (DeGraf 2003) covers a sub-
he concept Creole has not been operationalized with rigorous
set of long-standing hypotheses whereby Creole languages
and reliable criteria in linguistic theory. At best, it is a sociohis-
constitute a sui generis class on phylogenetic and/or structural
torically and politically motivated concept, often misidentiied
grounds. Here is a sample:
as linguistic (DeGraf 2005b, 2009; Mufwene 2008).
Etymologically, the word Creole derives from the Portuguese (i) Creoles are degenerate ofshoots of their European
crioulo and/or Spanish criollo raised in the home (from criar ancestors;
to raise, to breed). In Caribbean history, the labeling of bio- (ii) Creoles are special hybrids with exceptional
logical species, including humans, as Creole seems to have genealogy;
preceded the labeling as Creole of certain speech varieties.
(iii) Creoles are the only contemporary languages with a his-
Both uses referred to nonindigenous varieties that developed
tory of abnormal transmission that deprives them of any
locally, in contrast to their counterparts from Europe and Africa.
structurally full-ledged ancestors.
he original uses of the word were thus devoid of any speciic
structural correlates (Mufwene 2001, 311; Chaudenson and (iv) he Pidgin-to-Creole transition recapitulates the transi-
Mufwene 2001, Chap. 1). tion from pre-human protolanguage to human language.
In keeping with this original usage and to avoid circular- (For a fuller development of these arguments, see DeGraf 2005a,
ity and the sort of controversial linguistic assumptions that 2009.)
are noted in Mufwene 2008, I here ostensively use Creole as a
label for certain speech varieties that became emblematic of the Creoles as Degenerate Offshoots? Its only in the latter part of
newly created communities the Creole communities on and the twentieth century that linguists started refuting the received
around colonial Caribbean plantations. hese are the classic wisdom that Creoles are structurally impoverished variants of
Creole languages. their European norms. In Julien Vinsons scientiic dictionary
Caribbean Creole languages developed mostly among (1889, 3456), Creole languages result from the adaptation of
Europeans and Africans via language acquisition by adults and a language, especially some Indo-European language, to the
children in a complex mix of language-contact settings. he (so to speak) phonetic and grammatical genius of a race that
complex sociohistorical factors therein included a continuum is linguistically inferior. he resulting language is composite,
of social divides and power asymmetries (Chaudenson and truly mixed in its vocabulary, but its grammar remains essen-
Mufwene 2001). One end of this continuum was marked by tially Indo-European, albeit extremely simpliied. For Leonard
233
Creoles
Bloomield (1933, 472), he creolized language has the status of Segmentation and parsing of target speech necessarily tap into
an inferior dialect of the masters speech. substantial aspects of target grammar.
Even in the latter half of the twentieth century, certain lin-
guists claimed that structural linguistic factors, related to (e.g.) CREOLIZATION AS ABNORMAL/BROKEN TRANSMISSION AND
morphological simplicity and a vocabulary [that] is extremely CREOLES AS LIVING FOSSILS? In keeping with the postulated
poor, are among the greatest obstacles to the blossoming of congruence in nineteenth-century philology between the evo-
Creoles (Valdman 1978, 345; cf. Whinnom 1971, 110; Samarin lution of races and that of languages, Alfred de Saint-Quentin
1980, 221; Seuren and Wekker 1986; and Quint 1997, 58). Pieter ([1872] 1989, 40) considered it a property of emerging languages
Seuren (1998, 292) has elevated the alleged extraordinary sim- to be naive and claimed Guyanais Creole as a spontaneous
plicity of Creole languages to historical universal. product of the human mind, freed from any kind of intellectual
here is no reliable empirical or theoretical basis for the claim culture. Similarly, Isle de France Creole was considered an
that Creole languages are uniformly less complex than their infantile language for an infantile race (Reinecke 1980, 11).
European ancestors. For example, certain aspects of my native In twentieth-century linguistics, the abnormal/broken trans-
Haitian Creole (HC) signal an increase in complexity to the extent mission doctrine excludes Creoles from the scope of the com-
that these properties of HC have no counterpart in French, HCs parative method and turns them into new linguistic phyla
European ancestor (DeGraf 2001b, 284). Furthermore, HC, like without ancestry (homason and Kaufman 1988).
any other language, expands its vocabulary as needed, via pro- his doctrine seems related to another myth of origins, as
ductive aixation, neologisms, borrowings, and so on (DeGraf writers in cultural studies (see deconstruction and crit-
2001a; Fattier 1998). ical discourse analysis) might put it that of Creoles as
contemporary (quasi-)replicas of human language at its evo-
CREOLES AS SPECIAL HYBRIDS? Lucien Adams (1883) hybri- lutionary incipience (Bickerton 1990, 171, Chap. 5; 1998, 354;
dologie linguistique hypothesis posited diferent linguistic Bickerton and Calvin 2000, 149). In Derek Bickertons scenario,
templates for diferent races. he latter belong to distinct evolu- the hypothetical Pidgin-to-Creole cycle recapitulates the evolu-
tionary rungs, with their respective linguistic templates ranked tion of Homo erectuss protolanguage into the most primitive
in a corresponding hierarchy of complexity. Upon language instantiations of Homo sapienss language: What happened [in
contact, these templates will cross-fertilize (i.e., hybridize), the formation of Hawaiian Creole] was a jump from protolan-
and the most primitive grammar (in this scenario, the gram- guage to language in a single generation (Bickerton 1990, 171).
mar of the lower race of speakers, i.e., the non-European In this scenario, one sui generis process that allegedly dis-
speakers) imposes an upper bound of complexity on the hybrid rupts normal language transmission and leads to catastrophic
grammar. In such scenario, the European contribution to the language genesis is some form of radical pidginization. he lat-
hybridization of European and non-European languages is lim- ter is claimed to obliterate virtually all stable structural patterns,
ited to supericial traits, such as the phonetic shapes of words including morphology (Bickerton 1999, 69, n. 16), and to lead to a
only these shapes, and not the complex grammars of European structureless early pidgin. Such a Pidgin is putatively unlike any
languages, can be acquired by the allegedly inferior minds of full-ledged human language and more like the hypothetical pro-
the non-Europeans. tolanguage of Homo erectus, our prehistoric hominid ancestors
Claire Lefebvres (1998) relexiication hypothesis is far (Bickerton 1990, 169, 181; 1998, 354; Bickerton and Calvin 2000,
removed from Adams race-theoretical postulates. For Lefebvre, 149). his early Pidgin, by deinition, is non-native, unstable, and
it is because the Africans in Haiti had very limited access to used as an emergency lingua franca across languages. his early
French that they were virtually unable to learn any aspect of Pidgin is argued to abruptly seed the Creole when the former
French grammar. hus, they could only overlay French-derived becomes the acquisition target for the irst generation of locally
phonetic strings on their native substrate grammars, with the lat- born children (see Bickerton 1999, 49) in a way similar to how
ter being kept nearly intact in the original Creole languages. Homo erectus protolanguage seeded the early forms of human
Consider again HC. Current results from L2A research prelanguage as spoken by the irst cohorts of Homo sapiens.
dicts that HC structure would have indeed evolved under some How could the documented pidgins of modern humans and
inluence from the substrate languages. L2A research also docu- the hypothetical protolanguage of Homo erectus evince any
ments that adult learners at every stage acquire more than pho- enlightening similarity? How could the hypothetical Pidgin-to-
netic strings from their target. Unsurprisingly, HC instantiates, Creole transition in modern history resemble the evolution in
alongside substrate-inluenced patterns, a wide range of super- prehistory from Homo erectuss structureless protolanguage to
strate-derived properties that apparently have no analogues in Homo sapienss full-ledged human language? If the transition
the substrate languages (DeGraf 2002). from Homo-erectus protolanguage to Homo-sapiens human
Adams and Lefebvres proposals share one non-uniformitar- language is a relex of brain reorganization via natural selection
ian assumption, namely, that Creole creators, unlike L2 learn- in the course of human evolution, then Bickertons hypothetical
ers elsewhere, were unable to learn anything abstract about Pidgin-to-Creole cycle has nothing to say about such brain reor-
their target language. Yet the lexicon and morphology of HC ganization and its linguistic structural consequences. Indeed
demonstrate that Creole creators were able to segment and Pidgins, under any deinition, relect mental properties of Homo
parse target speech (here, French), including aixes. Such seg- sapiens. Acquisition data suggest that learners at every age and
mentation and parsing contradict the claim that the creators of stage, including Pidgin speakers, have access to the same faculty
HC could not access any abstract property of French grammar. of language as any other human being (Mufwene 2008, ch. 5).
234
Creoles
he broken transmission and linguistic fossils doctrines Evolution of Language, ed. J. Hurford, M. Studdert-Kennedy, and C.
are further undermined by a vast range of comparative data and Knight, 34158. Cambridge: Cambridge University Press.
empirical and theoretical observations. As mentioned earlier, . 1999. How to acquire language without positive evidence. In
there is ample evidence for systematic lexical and morpho-syntac- Language Creation and Language Change, ed. Michel DeGraf, 4974.
tic correspondences between radical Creoles and their European
Bickerton, Derek, and William Calvin. 2000. Lingua Ex
lexiiers from the onset of Creole formation onward (Fattier 1998;
Machina: Reconciling Darwin and Chomsky with the Human Brain.
DeGraf 2001a, 2005b, 2009; Mufwene 2008). here is also ample Cambridge, MA: MIT Press.
evidence for transfer from the African substrate languages into Bloomield, Leonard. 1933. Language. New York: H. Holt and Co.
Creole grammars. his is as expected given the aforementioned Chaudenson, Robert, and Salikoko Mufwene. 2001. Creolization of
facts of Caribbean history and the results from L2A research. he Language and Culture. London: Routledge.
sort of structureless pidgin that is an essential ingredient in the DeGraf, Michel. 2001a. Morphology in Creole genesis: Linguistics and
traditional Pidgin-to-Creole scenario renders mysterious any sys- ideology. In Ken Hale: A Life in Language, ed. M. Kenstowicz, 53121.
tematic set of structural correspondences between Creoles and Cambridge, MA: MIT Press.
their ancestor languages. Besides, the magnitude of structural . 2001b. On the origin of Creoles: A Cartesian critique of Neo-
gaps in the history of non-Creole languages seems comparable Darwinian linguistics. Linguistic Typology 5.2/3, 213310.
. 2002. Relexiication: A reevaluation. Anthropological Linguistics
to, and sometimes even greater than, that of their counterparts
44.4: 321414.
in Creole diachrony (DeGraf 2005b, 2009), pace homason and
. 2003. Against Creole exceptionalism. Language 79: 391410.
Kaufman (1988, 812, 206) and homason (2002, 105). . 2004. Against Creole exceptionalism (redux). Language
If the rigorous criteria of the Comparative Method [CM] 80: 8349.
include the establishment of recurring phonological correspon- . 2005a. Linguists most dangerous myth: he fallacy of Creole
dences in morphemes of identical or similar meanings, including exceptionalism. Language in Society 34: 53391.
much basic vocabulary the establishment of systematic mor- . 2005b. Morphology and word order in creolization and beyond.
phosyntactic correspondences (homason 2002, 103), then the In Handbook of Comparative Syntax, ed. G. Cinque and R. Kayne, 249
available evidence puts Caribbean Creoles squarely in the scope 312. New York: Oxford University Press.
of the CM (DeGraf 2005b, 2009; Mufwene 2008, pace homason). . 2009. Language acquisition in creolization and, thus, language
change. Language and Linguistic Compass 3:888971.
Such evidence militates against the postulation, in Creole for-
Fattier, Dominique. 1998. Contribution ltude de la gense dun
mation, of an exceptional and abnormal break in transmission
crole: LAtlas linguistique dHati, cartes et commentaires. Ph.D. diss.,
with subsequent creation of all new linguistic structure from the
Universit de Provence. Distributed by Presses Universitaires du
hypothetical scraps of a Pidgin. Septentrion, Villeneuve dAscq, France.
Greenield, William. 1830. A Defence of the Surinam Negro-English
The End of Creole Exceptionalism? Version of the New Testament. London: Samuel Bagster.
Creolization difers from language change on sociohistori- Lefebvre, Claire. 1998. Creole Genesis and the Acquisition of
cal and political, not linguistic, grounds. For example, conquered Grammar: he Case of Haitian Creole. Cambridge: Cambridge
peoples involved in forming Caribbean Creoles may have spo- University Press.
ken more languages than their counterparts in the formation Mufwene, Salikoko. 2001. he Ecology of Language Evolution.
of, say, the Romance languages. Furthermore, oppression in Cambridge: Cambridge University Press.
. 2008. Language Evolution. London: Continuum.
the Caribbean was correlated with race. Caribbean Creoles and
Quint, Nicolas. 1997. Les les du Cap-Vert aujourdhui: Perdues dans
Romance languages thus evolved in distinct ecologies, with
limmensit. Paris: LHarmattan.
Caribbean vernaculars ending up disfranchised for sociohis-
Reinecke, John. 1980. William Greenield, a neglected pioneer creolist.
torical reasons. Creolization is a social, not a structural, pro- In Studies in Caribbean Language, ed. L. Carrington, 112. Saint-
cess (Mufwene 2001, 138). he individual speakers engaged in Augustine, Trinidad: Society for Caribbean Linguistics.
language contact, whether in the genesis of Creole or Romance Saint-Quentin, Alfred de. [1872] 1989. Introduction lhistoire de
languages, would have made use of the same [mental] process Cayenne , with tude sur la grammaire crole by Auguste de Saint-
adopted [for the] formation of [their respective new] language Quentin. Antibes: J. Marchand. 1980 edition: Cayenne: Comit de la
(Greenield 1830, 51 f). If so, Creole grammars do not, and could culture, de lducation et de lenvironnement, Rgion Guyane.
not, form a typological class that is aprioristically and fundamen- Samarin, William. 1980. Standardization and instrumentalization of
tally distinguishable from non-Creole grammars (DeGraf 2005b, Creole languages. In heoretical Orientations in Creole Studies, ed.
A. Valdman and A. Highield, 21336. New York: Academic Press.
2009; Mufwene 2008).
Seuren, Pieter. 1998. Western Linguistics: An Historical Introduction.
Michel DeGraf Oxford: Blackwell.
Seuren, P., and Herman W. 1986. Semantic transparency as a factor in
Creole genesis. In Substrata Versus Universals in Creole Genesis, ed.
P. Muysken and N. Smith, 5770. Amsterdam: Benjamins.
Adam, Lucien. 1883. Les idiomes ngro-aryen et malo-aryen: Essai homason, Sarah. 2002. Creoles and genetic relationship. Journal of
dhybridologie linguistique. Paris: Maisonneuve et cie. Pidgin and Creole Languages 17: 1019.
Bickerton, Derek. 1990. Language and Species. Chicago: University of homason, Sarah, and Terrence Kaufman. 1988. Language
Chicago Press. Contact, Creolization, and Genetic Linguistics. Berkeley and Los
Bickerton, Derek.. 1998. Catastrophic evolution: he case for a single Angeles: University of California Press.
step from protolanguage to full human language. In Approaches to the
235
Critical Discourse Analysis
Valdman, Albert. 1978. Le crole: Structure, statut et origine. Paris: ditions he label critical linguistics has given way to critical dis-
Klincksieck. course analysis, as the ield developed to include wider areas
Vinson, Julien. 1889. Croles. In Dictionnaire des sciences anthro- of social concern and more social theory. In particular, in addi-
pologiques, ed. A. Bertillon, 3457. Paris: Doin. tion to the writings of Marx and Marxian writers (see marxism
Whinnom, Keith. 1971. Linguistic hybridization and the special case of
and language ), CDA practitioners have often made appeal
pidgins and creoles. In Pidginization and Creolization of Languages,
to Antonio Gramsci, Michel Foucault, and the British sociolo-
ed. D. Hymes, 91115. Cambridge: Cambridge University Press.
gist Anthony Giddens. he label CDA became associated with
the work of, in particular, Norman Fairclough (e.g. 1989, 1992),
Teun van Dijk (e.g. 1993, 1998, 2005) and Ruth Wodak (1996;
CRITICAL DISCOURSE ANALYSIS
Wodak et al. 1999). Faircloughs work, mainly cast within a neo-
his term has been used since the 1990s by a group of academics Marxian mold, has developed detailed concepts and models of
initially in the United Kingdom, but also increasingly in the rest discourse, intertextuality, and genre, while leaving the
of Europe, Australia, South America and more recently in Asia. linguistic dimension of discourse comparatively undeveloped.
he various practitioners of critical discourse analysis (CDA) His work can be characterized as social theory that gives full
who would associate themselves with this label have in common recognition to the constitutive role of language in society, in the
some concept of what it means to be critical, various notions of form of interlinked discursive practices. Van Dijks work is
discourse inluenced strongly by sociology and social theory, and rooted in formal discourse analysis of the 1970s with a cognitive
a range of descriptive methods borrowed from various linguistic tendency, and has sought to provide deeper discourse-based
theories. Broadly speaking, what all CDA practitioners are con- understanding of major pragmatic notions such as context,
cerned with is the way language is integrated with society, but as well as of the social-theoretic notion of ideology. Wodaks
unlike most sociolinguists, they espouse an overtly ethical or work has developed the discourse historical method (Wodak
political stance in engaging with this question. While its goal is to and Meyer 2001), a methodology for empirical investigation
increase understanding of the relationship between society and that advocates the study of intersecting texts and samples of
language, CDA does not in general contribute to the description talk collected from various milieus and representing various
or theorization of human language systems. genres, for which analysis of the historical context is regarded
he term critical was irst used to characterize an approach as crucial. Numerous other scholars of the same period, whose
to language study that was dubbed critical linguistics by Roger accomplishments it is not possible to describe here, produced
Fowler and his colleagues (1979) and by Gunther Kress and work that was overtly or implicitly critical, particularly in
Robert Hodge ([1979] 1993). hese scholars took some inspiration France, Germany, the Netherlands and Belgium, the Hispanic
from George Orwells informal critique of the use of language in world, and Australia.
political life and his dystopian fantasy of newspeak in the novel he CDA literature has introduced a number of key con-
Nineteen Eighty-Four. But they also acknowledged intellectual cepts and claims. he principle that discourse is constitutive of
debts to Valentin Voloshinov (see dialogism and hetero- social processes and structures has been extended to include
glossia) and to Frankfurt School critical theory, especially the other semiotic systems, notably pictorial ones (Kress and van
work of Jrgen Habermas (ideal speech situation). he ini- Leeuwen 2001). Discourses, in the plural, are understood as rel-
tial impetus of critical linguistics was, to some extent, grounded atively stable uses of language serving the structuring of social
in the Enlightnement philosophical notion of critique. However, life, organizations, and political systems. Such discourses con-
in many respects, the work produced by critical linguistics and sist of interlinked discursive practices or genres. Discourses
its successor CDA has been colored by, even tainted by, the may be of various kinds and operate in diferent ways. hus,
everyday sense in which one speaks negatively of criticizing a political ideologies, scientiic worldviews, ethical systems, and
person or group of persons. Among other ideas, critical linguists the like are said to represent the world in particular ways. Genres
held that the use of language could lead to mystiication, which regulate interaction and thus control social behavior, examples
analysis could elucidate. For example, a missing by-phrase in being interviews, news broadcasts, medical consultations, edu-
English passive constructions might be seen as an ideological cational examinations, and so forth. When these discursive
means for concealing or mystifying reference to an agent. he practices are viewed as interlinked, they involve intertextual-
same is claimed for nominalizations such as destruction or arrest, ity and interdiscursivity, leading to the colonization of one
which have neither tense nor aspect and can also appear without discourse by another and to hybrid genres a process regarded
a by-phrase specifying an agent. Conspicuous clustering of syn- as integral to and an index of social change. Discursive prac-
onymous or near-synonymous lexical items around a particu- tices viewed as a complex network are referred to as an order of
lar topic, or overlexicalization, is felt to indicate some critical discourse (Fairclough 1992). A crucial claim of CDA is that the
problematic social process or institution. Analysis of the refer- details of particular instances of text and talk are related in com-
ents associated with diferent kinds of participant roles in clauses plex ways to these structures, structures regarded as embodying
(e.g., actor, goal, beneiciary) is regarded as a way of detecting power.
patterns in the way social relations, especially power relations, CDA has not produced a theory of language that explains how
are represented. he most signiicant principle of critical linguis- meanings produced by utterances have the tight connection to
tics, carried over into CDA, is the important observation that use social structures, which is often claimed. here are nonetheless
of language is a social practice, that is, a form of action consti- numerous examples of the description, analysis, and interpreta-
tutive of and constituted by social processes and structures. tion of utterances in the CDA literature, and these are dependent
236
Critical Discourse Analysis
on existing linguistic frameworks. In critical linguistics, early special focus in many CDA works on issues such as racism, mili-
transformational and generative grammar was used as tarism, media bias, marketization, and gender.
a framework for explaining the supposedly ideological efects of his survey necessarily neglects much work that has a critical
the language forms found in texts. One questionable claim was stance without subscribing to the CDA label. Such work would
that a linguistic system that is, the syntax, lexicon, and if not include the writing on language and gender that has arguably had
the phonology then the writing system of a language could be an impact on modern social behavior (for an overview see Eckert
inherently ideological (Kress and Hodge [1979] 1993), an idea and McConnell-Ginet 2003). It would also include work on soci-
revived in equally questionable fashion by Robert Hodge and ety and on controversial political matters in North America (G.
Kam Louie (1998) in their discussion of Chinese. A more lasting Lakof 1996); R. Lakof 1990; Lemke 1995). Further, it should also
inluence has been M. Hallidays systemic-functional grammar, include the emerging scholarship on discourse in its sociopoliti-
which CDA has drawn on for its classiication of clause types, its cal context in China (cf. Gu 2001) and on a smaller scale in the
model of modality and theory of register. his framework Middle East and Africa. It may be the case that something like the
has the advantage of being formulated within a social-semiotic CDA approach emerges in periods of signiicant socioeconomic
perspective. However, it has the disadvantage that it is inad- or political change: In some respects, CDA may be considered to
equate for the analysis of certain textual phenomena that CDA have the character of a social movement.
writers have wanted to talk about, particularly such theoretically
Paul Chilton
diicult areas as implied meanings and metaphor. he reason for
this is that systemic-functional grammar has an encoding model WORKS CITED AND SUGGESTIONS FOR FURTHER READING
of linguistic meaning and fails, as do certain kinds of CDA work,
Bourdieu, Pierre. 1982. Ce que parler veut dire: Lconomie des changes
to take account of the fact that language understanding depends
linguistiques. Paris: Fayard.
on nonlinguistic knowledge. By contrast, alongside these theo-
Chilton, Paul. 1996. Security Metaphors: Cold War Discourse from
retical frameworks, cognitive approaches have increasingly
Containment to Common House. New York: Peter Lang.
provided theoretical resources, as is shown by the work of van . 2004. Analysing Political Discourse: heory and Practice.
Dijk (e.g., 1998, 2005), who has drawn on models from cogni- London: Routledge.
tive psychology, and Paul Chilton (1996, 2005), who has drawn . 2005. Missing links in mainstream CDA: Modules, blends
on conceptual metaphor theory and blending theory (see and the critical instinct. In A New Agenda in (Critical)l Discourse
conceptual blending). Analysis: heory and Interdisciplinarity, ed. Ruth Wodak and Paul
CDA has itself been the focus of principled criticism for at Chilton 1950. Amsterdam: Benjamins.
least three reasons. One is its claim to be socially committed and Eckert, Penelope, and Sally McConnell-Ginet. 2003. Language and
objective (Widdowson 2005). Another criticism has come from Gender. Cambridge: Cambridge University Press.
Fairclough, Norman. 1989. Language and Power. London: Longman.
within CDA itself on the basis of the view that CDA has been
. 1992. Discourse and Social Change. Cambridge, UK: Polity.
too negative and should examine, even advocate, positive
Fowler, Roger, Gunther Kress, Robert Hodge, and Tony Trew, eds. 1979.
analysis of discourse that it approves of (Martin 2004). Kieran
Language and Control. London: Routledge.
OHalloran (2003) has criticized CDAs inadequate notion of Goatly, A. 2007. Washing the Brain: Metaphor and Hidden Ideology.
cognition, in particular its symbolicism that is, the assump- Amsterdam: Benjamins.
tion that mental processes correspond to and can be inluenced Gu, Yueguo. 2001. he changing orders of discourse in a changing
by the manipulation of symbols. Chilton (2005) criticizes CDAs China. In Studies in Chinese Linguistics. Vol 2. Ed. Haihua Pan, 3158.
failure to engage with developments in cognitive linguistics Hong Kong: Linguistic Society of Hong Kong.
and evolutionary psychology, arguing also that a critical faculty Hodge, Robert, and Kam Louie. 1998. he Politics of Chinese Language
may be universal and that CDA may exaggerate the power of and Culture. London and New York: Routledge.
discourse. he fact remains, however, that the notion of critical Kress, Gunther, and Robert Hodge. [1979] 1993. Language as Ideology.
London: Routledge.
analysis of discourse remains increasingly inluential through-
Kress, Gunther, and heo van Leeuwen. 2001. Multimodal Discourse: he
out the world.
Modes and Media of Contemporary Communication. London: Arnold.
In general, it can be argued that, taken as a whole, CDA writ- Lakof, George. 1996. Moral Politics: What Conservatives Know hat
ing provides something more like a social theory than a linguistic Liberals Dont. Chicago: University of Chicago Press.
theory, although this is not to overlook many examples of perspi- Lakof, Robin. 1990. Talking Power: he Politics of Language in Our Lives.
cacious analysis of individual utterances in their sociopolitical New York: Basic Books.
contexts, for example, analyses of leadership speeches, various Lemke, Jay. 1995. Textual Politics: Discourse and Social Dynamics.
media genres, teacher pupil exchanges, and the like. his case- London: Taylor & Francis.
by-case approach may in itself be a disadvantage, since CDA is Martin, Jim R. 2004. Positive discourse analysis: Power, solidarity and
premised on claims about entire discourse networks, a prob- change, Revista Canaria de Estudios Ingleses. 49: 179200.
lem addressed by Wodaks proposals for an empirical method OHalloran, Kieran. 2003. Critical Discourse Analysis and Language
Cognition. Edinburgh: Edinburgh University Press.
based on the analysis of linked instances of diferent genres
Saussure, Louis de, and Peter Schulz, eds. 2005. Manipulation and
within organizational structures (Wodak et al. 1999; Wodak and
Ideologies in the Twentieth Century: Discourse, Language, Mind.
Meyer 2001). he most general characteristic of CDA has been Amsterdam: Benjamins.
not its linguistics but its ethically or politically committed stance. Van Dijk, Teun. 1993. Elite Discourse and Racism. Newbury Park,
Indeed, a number of CDA practitioners make an emancipatory CA: Sage.
mission the core element of their work, and this is evident in the . 1998. Ideology. London: Sage.
237
Critical Periods
. 2005. Contextual knowledge management in discourse produc- birdsong and human language) to bonding in sheep and
tion: A CDA perspective. In A New Agenda in (Critical) Discourse vision in cats and primates. he efects of maturation on minute
Analysis: heory and Interdisciplinarity, ed. Ruth Wodak and Paul details of behavior are continuously being mapped out, not least
Chilton, 71100. Amsterdam: Benjamins
in neurobiological research. Adding to the classic results on sen-
Widdowson H. G. 2005. Text, Context, Pretext: Critical Issues in Discourse
sitive periods for human vision obtained by the 1981 Nobel med-
Analysis. Oxford: Blackwell.
icine laureates David Hubel and Torsten Wiesel, one example
Wodak, Ruth. 1996. Disorders of Discourse. London: Longman.
Wodak, Ruth, and Michael Meyer. 2001. Methods of Discourse Analysis. is the more recent understanding obtained from experimental
London: Sage. studies on rhesus monkeys that certain irreversible visual dis-
Wodak, Ruth, Rudolf de Cillia, Martin Reisigl, and Karen Liebhart. 1999. orders, such as impaired vision of speciic movements or gaze
he Discursive Construction of National Identity. Edinburgh: Edinburgh holding, result from failed visual experience during a three-week
University Press. neonatal sensitive period (Boothe 1997). As the visual systems
of rhesus monkeys and humans are in principle identical, these
results are claimed to be interpretable for humans where the
CRITICAL PERIODS
three-week neonatal sensitive period for monkeys corresponds
he idea of a critical period for language acquisition is one of the to a three-month period for humans.
most debated issues in language acquisition theory. One reason Human language as a system of communication is an
for controversy in the empirically based discussion of the critical extremely complex type of behavior. As in other kinds of complex
period hypothesis (CPH) has been the diferent understandings behavior, such as vision, it is highly likely that maturation afects
of what a critical period actually means and what efects it might some but not all aspects of language acquisition. Diferent details
have on language. Another more basic philosophical cause for of language may be constrained by diferent phases of matura-
lack of unanimity is the notions central symbolic role in the tion, something that is covered by the notion of multiple critical
naturenurture divide and the concomitant ideological prefer- periods as suggested, for example, by H. W. Seliger (1978). Much
ences among researchers to stress biological or environmental language learning occurs with reasonable ease over the whole life
aspects of language development. It is probably fair to say that span, for example, the learning of new vocabulary. What seems
the rhetorical tone that is sometimes noticeable in debates on to be the key distinguishing parameter between child learners
the CPH originates to a considerable extent from the various and adult learners is the fact that young learners in the majority
epistemological commitments of the protagonists, nativist or of cases seem to be able to reach an overall proiciency level in
constructivist, cognitive or social constructivist, general cogni- the second language, phonetics and phonology included,
tive or modular, and so on. that allows them to be taken for native speakers of that language,
he CPH is relevant for all kinds of language learning. When while this is extremely rare in adult learners. herefore, an obvi-
much more of the discussion is about second language learn- ous candidate for what may be maturationally constrained in
ing (see second language acquisition) than about irst language learning is the ability to reach ultimate nativelikeness.
language learning, it is a relection of the fact that irst language A central role for nativelikeness was outlined already in Eric
learning, except in cases of isolation from language input, starts Lennebergs original formulation of the CPH in his volume
from age zero. his means that data from delayed irst language Biological Foundations of Language:
acquisition are rare. On the contrary, for second language acqui-
[A]utomatic acquisition from mere exposure to a given lan-
sition (SLA), massive data from language learning at diferent
guage seems to disappear [after puberty], and foreign languages
phases of the life span are available. Most of this entry, therefore,
have to be taught and learned through a conscious and labored
addresses second language acquisition rather than irst language
efort. Foreign accents cannot be overcome easily after puberty.
acquisition.
However, a person can learn to communicate at the age of forty.
he notion of critical period has been used by ethologists to
his does not trouble our basic hypothesis on age limitations
explain the fact that the development of several aspects of spe-
because we may assume that the cerebral organization for lan-
cies-speciic behavior are dependent on early stimulus exposure
guage learning as such has taken place during childhood, and
or experiences. A critical period can generally be deined, there-
since natural languages tend to resemble one another in many
fore, as a time span in early life during which the organism is
fundamental aspects the matrix for language skills is present.
responsive to those stimuli in the external environment that are
(1967, 176)
crucial or relevant for a behavior (or capacity) to develop eventu-
ally in keeping with a species-speciic standard. If the organism Lennebergs formulation, in actual fact, addressed many of the
does not encounter or experience the particular stimuli during issues that have been researched and debated over the years: 1)
the time span for sensitivity, that behavior will either not develop the diference between the automatic, or implicit, acquisition
at all or eventually reach an end state that difers from the spe- assumed to be possible within the critical period and conscious,
cies-speciic ultimate standard. In addition to maturationally or explicit, learning postulated to be the only remaining option
constrained learning in humans and other species, there are two for late learners; 2) puberty as the end point for a critical period;
other kinds of learning, namely, learning that occurs with equal 3) the ability to reach nativelike ultimate proiciency for L2 learn-
success at any time over the lifespan and learning that becomes ers who start at ages below that point; and 4) the efect that any
efective only at later phases of cognitive development. early language learning can have on subsequent languages.
Examples of maturationally constrained behaviors range In addition, albeit not mentioned in this particular quotation,
from imprinting in geese and song learning in songbirds (see Lenneberg proposed 5) lateralization as the neural mechanism
238
Critical Periods
that could explain an end point for the critical period at puberty. either with phenomena that are not covered by the hypothesis
However, such a role for lateralization was soon demonstrated or issues that are not decisive components of it. An example
not to be correct: he widely accepted theory in the 1960s that of the former are early objections to the CPH based on results
lateral specialization is progressive, increasing with age from showing that younger learners are, in fact, not better than older
infancy to adolescence has long been abandoned in the face of learners in initial rate of learning a second language (Snow and
accumulated evidence that indicates that cerebral lateral spe- Hoefnagel-Hhle 1977). he CPH is not about what happens in
cialization is established from early infancy or even during initial stages of second language learning but indeed about long-
fetal development (Paradis 2004, 107). term impacts, that is, what is ultimately attainable in language
Lennebergs proposals were based mainly on general informal learning. An example of objections to nondecisive components
observations, but subsequent research has provided theoretical is the type of criticism that says that if lateralization is not the
frameworks and substantial empirical data that, taken together, cerebral mechanism behind the diferential behavior of children
can be given an interpretation that is compatible with the predic- and adults (Krashen 1973), there can be no critical period. he
tions of the hypothesis. he irst point, on the distinction between CPH is not dependent on lateralization as such; other cerebral
(implicit) acquisition and (explicit) learning, has been addressed mechanisms may be at work.
in a series of theoretical discussions about language learning dif- More substantial criticisms to the CPH have frequently
ferences between children and adults (see DeKeyser 2003), with focused on the correlation between AO and ultimate attainment.
some of its most well known exponents in S. Krashens (1988) For various reasons, most prominently because correlation does
acquisition-learning hypothesis, R. Bley-Vromans (1989) fun- not equal cause, results showing a strong negative correlation
damental diference hypothesis, and S. Felixs (1985) competition between AO and ultimate proiciency are not accepted by every-
hypothesis. Also, the various perspectives on access to univer- one as evidence of maturational constraints. It has been sug-
sal grammar (UG) (full/direct, partial/indirect, or no access) gested that there may be other factors of a social or psychological
in UG-framed SLA theories (see White 2003) can be translated nature, such as length of residence, input frequency, motivation,
into a CPH framework (cf. Pinkers 1994 use-it-then-loose-it general cognitive changes, or L1 use, that would explain these
hypothesis). correlations (cf. Flege, Frieda, and Nozawa 1997). However, in
As for the second point, a multitude of studies have singled studies that have used statistical measures that are able to assess
out puberty or early adolescence as an end point for high or the relative weight of diferent dependent variables, it has consis-
nativelike proiciency levels in a second language. Several other tently been shown that AO is the strongest and often only factor
studies have pointed to a discontinuity at earlier ages, in particu- in predicting ultimate attainment. Results are often in the range
lar around age six, especially for phonology and grammar (Long of 50 percent of the variance explained by AO, while other factors
2005; see, especially, Johnson and Newport 1989). add only 26 percent in explaining the variance (see DeKeyser
In relation to point three on obtained nativelikeness in and Larson-Hall 2005). As these other factors do not correlate
younger and older starters, empirical observations are by and highly with achieved ultimate attainment, and as none of them
large compatible with the CPH; that is, child learners frequently has a convincing link to age, this leaves us with maturation as the
have results in the range of native controls, whereas this is rare, strongest candidate for explaining the correlation.
or even claimed never to have been demonstrated, in adult In addition, it has been claimed that AO and ultimate attain-
learners (Hyltenstam and Abrahamsson 2003; Long 2005). In ment correlations exhibit patterns of a linear decrease, rather
many studies, however, nativelikeness is not focused upon. It is, than one of discontinuity, which would be expected for the
rather, the level of ultimate attainment, nativelike or not, among CPH: At the end of the period, there should be an obvious of-
younger and older learners that is correlated with age of onset set, after which we would expect a lattening of the interaction.
(AO). One of the most robust results in CPH-related research is a Studies such as E. Bialystok and B. Miller (1999) obtained results
strong negative correlation between AO and ultimate attainment indicating a linear decline through all AOs, and D. Birdsong
in a second language. and M. Molis (2001) saw age efects among postpuberty and
In addition, in relation to Lennebergs fourth point, the adult learners generally. hese authors interpret their results as
hypothesis correctly seems to predict the fact that delayed expo- evidence against the CPH and suggest general age-dependent
sure to an L1 more severely afects the level of ultimate attain- cognitive factors as causes. However, another possible back-
ment than delayed exposure to an L2. Case studies of abused ground for a linear decrease might be a combined efect where
children who have not been exposed to any language before maturation plays the dominating role up through puberty and
puberty (Curtiss 1977) show severe limitations in the develop- adolescence and where social/psychological factors become
ment of grammar and pronunciation, whereas learning a second more important for explaining the variation after maturation is
language from the same age allows high levels of proiciency in complete (Birdsong 1999; Hyltenstam and Abrahamsson 2003).
that language. It has frequently been pointed out that depriva- It is probably fair to say that the issue of discontinuity is far from
tion data are diicult to interpret in terms of a critical period for solved.
language, but similar results have been obtained from studies Finally, counterarguments have addressed the issue of
of delayed irst language exposure to American Sign Language nativelikeness evidence for the CPH. Some authors claim that
(ASL) (e.g., Mayberry 1993). nativelike ultimate attainment among late learners is not as rare
A number of counterarguments to the CPH have also been as has been previously thought, something that would indeed
put forward over the decades. here is a current consensus falsify the hypothesis. Birdsong (2005) reviewed frequencies of
that some of these are, in fact, not valid arguments, as they deal L2 participants who performed in the range of native speakers
239
Critical Periods Culture and Language
and claimed that studies show up to 15 percent of postpuber- acquisition of English as a second language. Cognitive Psychology
ty-investigated samples that reach this level. At the other end, 21: 6099.
many studies have shown that nativelike ultimate attainment is Krashen, S. 1973. Lateralization, language learning, and the critical
far from always obtained among learners with prepuberty AOs period: Some new evidence. Language Learning 23: 6374.
. 1988. Second Language Acquisition and Second Language
(Abrahamsson and Hyltenstam 2009).
Learning. Boston: Prentice-Hall.
Research related to the idea of a critical period for language
Lenneberg, E. 1967. Biological Foundations of Language. New York: Wiley
is central to language acquisition theory, in particular for the and Sons.
understanding of deining diferences between irst and second Long, M. H. 2005. Problems with supposed counter-evidence to the
language acquisition. Intensive eforts at understanding the fac- critical period hypothesis. International Review of Applied Linguistics
tors behind age diferences in language acquisition outcomes 43: 287317.
have been made over the last 15 years, but the ield is far from Mayberry, R. I. 1993. First-language acquisition after childhood dif-
approaching a consensus about the role of maturational con- fers from second-language acquisition: he case of American Sign
straints or critical periods. However, current claims for recon- Language. Journal of Speech and Hearing Research 36: 125870.
sidering deinitions of concepts, analytical instruments, and Paradis, M. 2004. A Neurolinguistic heory of Bilingualism.
research methodologies (see, e.g., Birdsong 2005; Hyltenstam Amsterdam: Benjamins.
Pinker, S. 1994. he language instinct: How the mind creates language.
and Abrahamsson 2003; Long 2005) are promising for decisive
New York: Morrow.
steps forward in the near future.
Seliger, H. W. 1978. Implications of a multiple critical periods hypoth-
Kenneth Hyltenstam esis for second language learning. In Second Language Acquisition
Research, ed. W. Ritchie, 1119. New York: Academic Press.
Snow, C., and M. Hoefnagel-Hhle. 1977. Age diferences in the pronun-
ciation of foreign sounds. Language and Speech 20: 35765.
Abrahamsson, N., and K. Hyltenstam. 2009. Age of onset and nativelike- White, L. 2003. On the nature of interlanguage representation: Universal
ness in a second language: Listener perception versus linguistic scru- grammar in the second language. In Doughty and Long 2003, 1942.
tiny. Language 59: 249306.
Bialystok, E., and B. Miller. 1999. he problem of age in second-lan-
guage acquisition: Inluences from language, structure, and task.
CULTURE AND LANGUAGE
Bilingualism: Language and Cognition 2: 12745. Culture and language are connected in a myriad ways. prov-
Birdsong, D. 2005. Interpreting age efects in second language acquisi- erbs, politeness, linguistic relativism, cooperative
tion. In Handbook of Bilingualism: Psycholinguistic Perspectives, ed. J. principle, metaphor, metonymy, context and co-text,
Kroll and A. M. B. De Groot, 10927. Cambridge: Cambridge University
semantic change, discourse (see discourse analysis
Press.
[foucaultian] and discourse analysis [linguistic]),
Birdsong, D., ed. 1999. Second Language Acquisition and the Critical
Period Hypothesis. Mahwah, NJ: Lawrence Erlbaum.
ideology and language, print culture, oral culture,
Birdsong, D., and M. Molis. 2001. On the evidence for maturational literacy, sociolinguistics, and speech-acts are just some
constraints in second-language acquisition. Journal of Memory and of the entries in this encyclopedia that deal with some obvious
Language 44: 23549. connections between culture and language. Several disciplines
Bley-Vroman, R. 1989. What is the logical problem of foreign language within the language sciences attempt to analyze, describe, and
learning? In Linguistic Perspectives on Second Language Acquisition, explain the complex interrelations between the two broad areas.
ed. S. Gass and J. Schachter, 4168. Cambridge: Cambridge University (For a brief and clear survey, see Kramsch 1998.)
Press.
Boothe, R. G. 1997. A neonatal visual deprivation syndrome. Perception Culture and Language as Meaning Making
26: 766.
Can we approach this vast variety of topics from a more uniied
Curtiss, S. 1977. Genie: A Psycholinguistic Study of a Modern-day Wild
perspective than is traditionally done and currently available?
Child. New York: Academic Press.
DeKeyser, R. M. 2003. Implicit and explicit learning. In Doughty and
he relationship between culture and language can be dealt with
Long 2003, 31348. if we assume that both culture and language are about making
DeKeyser, R. M., and J. Larson- Hall. 2005. What does the critical meaning. his view of culture comes closest to that proposed by
period really mean? In Handbook of Bilingualism: Psycholinguistic Cliford Geertz, who wrote: Man is an animal suspended in webs
Approaches, ed. J. F. Kroll and A. M. B. De Groot, 88108. Oxford: Oxford of signiicance he himself has spun. I take culture to be those
University Press. webs, and the analysis of it to be therefore not an experimental
Doughty, C., and M. Long, ed. 2003. Handbook of Second Language science in search of law but an interpretative one in search of
Acquisition. Oxford: Blackwell. meaning (1973, 5). In this spirit, I suggest that we approach both
Felix, S. 1985. More evidence on competing cognitive systems. Second culture and language as webs of signiicance that people both
Language Research 1: 4772.
create and understand. he challenge is to see how they are cre-
Flege, J. E., E. M. Frieda, and T. Nozawa. 1997. Amount of native-lan-
ated and understood often in multiple and alternative ways.
guage (L1) use afects the pronunciation of an L2. Journal of Phonetics
25: 16986.
We have a culture when a group of people living in a social,
Hyltenstam, K., and N. Abrahamsson. 2003. Maturational constraints in historical, and physical environment make sense of their experi-
SLA. In Doughty and Long 2003, 53988. ences in a more or less uniied manner. his means, for example,
Johnson, J. S., and E. L. Newport. 1989. Critical period efects in sec- that they understand what other people say, they identify objects
ond language learning: he inluence of maturational state on the and events in similar ways, they ind or do not ind behavior
240
Culture and Language
appropriate in certain situations, they create objects, texts, and and Johnson 1980). When meaning making is based on such ele-
discourses that other members of the group ind meaningful, mentary human experiences, the result may be (near-)universal
and so forth. In all of these and innumerable other cases, we have meaning (content) though under a particular interpretation
meaning making in some form: not only in the sense of produc- (construal), that is, conceived of in a certain manner, to use
ing and understanding language but also in the sense of correctly Hoyt Alversons phrase (1991, 97).
identifying things, inding behavior acceptable or unacceptable, Language, on this view, consists of a set of linguistic signs, that
being able to follow a conversation, being able to generate mean- is, pairings of form and meaning (which can range from simple
ingful objects and behavior for others in the group, and so forth. morphemes to complex syntactic constructions). Learning
Meaning making is a cooperative enterprise (linguistic or other- a language means the learning of such linguistic signs. hus,
wise) that always takes place in a large set of contexts (ranging language can be regarded as a repository of meanings stored in
from immediate to background) and that occurs with varying the form of linguistic signs shared by members of a culture. his
degrees of success. People who can successfully participate in lends language a historical role in stabilizing and preserving a
this kind of meaning making can be said to belong to the same culture. his function becomes especially important in the case
culture. Spectacular cases of unsuccessful participation in joint of endangered languages (see extinction of languages),
meaning making are called culture shock. and it often explains why minorities insist on their language
his kind of meaning-based approach to culture can be rights (see language policy).
found in George Lakofs (1996) work on American politics, Mark Members of a culture interact with one another for particu-
Turners (2001) investigations into the cognitive dimensions lar purposes. To achieve their goals, they produce particular dis-
of social science, and Zoltn Kvecsess (2005) study of meta- courses. Such discourses are assemblies of meanings that relate
phorical aspects of everyday culture. Gary Palmer makes such a to particular subject matters. When such discourses provide a
meaning-based approach the cornerstone of what he calls cul- conceptual framework within which signiicant subject mat-
tural linguistics and applies it to three central areas of anthropo- ters are discussed in a culture, and when they function as latent
logical linguistics: Boasian linguistics, ethnosemantics, and the norms of conduct, the discourses can be regarded as ideolo-
ethnography of speaking (1996, 45). gies (see, e.g., Charteris-Black 2004; Musolf 2004; Goatly 2007).
What is required for meaning making? he main meaning- Discourse in this sense is another source of making meaning in
making organ is the brain/mind. he brain is the organ that cultures. A large part of socialization involves the learning of how
performs the many cognitive operations that are needed for to make meaning in a culture.
making sense of experience and that include categorization,
igure-ground alignment, framing knowledge, metaphorical Three Examples of Meaning Making
understanding, and several others. Cognitive linguists and cog- As the irst example, consider how people make sense of the spa-
nitive scientists in general are in the business of describing these tial orientation of objects around them. What we ind in language
operations. Cognitive linguists believe that the same cognitive after language is that speakers conceptualize the spatial orienta-
operations that human beings use for making sense of experi- tion of objects relative to their own bodies (Levinson 1996). his
ence in general are used for making sense of language. On this means that they operate with such orientations as right and left
view, language is structured by the same principles of operation or in front of and behind. Both pairs of concepts make use of the
as other modalities of the mind. However, these cognitive opera- human body in order to locate things in space. hus, we can say
tions are not put to use in a universally similar manner; that is, that the window is on my left and that the church is in front of
there can be diferences in which cognitive operations are used us. If we did not conceptualize the human body as having right
to make sense of some experience in preference to another, and and left sides and if we did not have a forward (and backward)
there can be diferences in the degree to which particular opera- orientation aligned with the direction of vision, such sentences
tions are utilized in cultures. his leads to what is called alter- would not make too much sense. But in our efort to understand
native construal in cognitive linguistics (see Langacker the world we do rely on such conceptualization. his is called an
1987). Moreover, the minds that evolve on brains in particular ego-centered, or relativistic, spatial orientation system.
cultures are shaped by the various contexts (historical, physical, Since so many of the worlds languages have this system and
discourse, etc.) that in part constitute cultures (Kvecses 2005). because the system is so well motivated in our conception of
his leads to alternative conceptual systems. the human body, we would think that the ego-centered system
Many of our most elementary experiences are universal. is an absolute universal and that no culture can do without it.
Being in a container, walking along a path, resisting some physi- However, as Stephen Levinson (1996) points out, this is just a
cal force, being in the dark, and so forth, are universal experi- myth. he native Australian language of Guugu Yimithirr has a
ences that lead to image schemas of various kinds (Johnson radically diferent system:
1987; Lakof 1987). he resulting image schemas (container,
source-path-goal, force, etc.) provide meaning for much of Take, for example, the case of the Guugu Yimithirr speakers
our experience either directly or indirectly in the form of con- of N. Queensland, who utilize a system of spatial conception
ceptual metaphors. Conceptual metaphors may also receive and description which is fundamentally diferent from that
their motivation from certain correlations in experience, when, of English-speakers. Instead of concepts of relativistic space,
for instance, people see correlations between two events (such wherein one object is located by reference to demarcated regions
as adding to the content of a container and the level of the sub- projected out from another reference object (ego, or some land-
stance rising), leading to the metaphor MORE IS UP (see Lakof mark) according to its orientation, Guugu Yimithirr speakers use
241
Culture and Language
a system of absolute orientation (similar to cardinal directions) the one that it is some kind of physical object) can be explicitly
which ixes absolute angles regardless of the orientation of the negated and efectively canceled. his is how new art movements
reference object. Instead of notions like in front of, behind, were born out of a successful new deinition. More importantly,
to the left of, opposite, etc., which concepts are uncoded in there are always some people who do not accept the deinition
the language, Guugu Yimithirr speakers must specify locations that most people take to be deinitional. his small but signii-
as (in rough English gloss) to the North of, to the South of, cant minority can constantly challenge, undermine, or plainly
to the East of, etc. he system is used at every level of scale, negate every one of the features that the majority take to be def-
from millimeters to miles, for there is (efectively) no other sys- initional and essential. If they were essential, they could not be
tem available in the language; there is simply no analogue of the so easily challenged and canceled. We can suggest that the con-
Indo-European prepositional concepts. (Levinson 1996, 180) cept of art has a central member the traditional conception
and many noncentral ones. he noncentral ones may become
hus, according to Levinson, the Guugu Yimithirr speakers
the prototypes of art for some people, and then these new pro-
must carry a mental map in their head of everything surround-
totypes can be further challenged. Concepts like art assume a
ing them, with the map aligned for the four quadrants. With the
prototype-based organization, and it is their very structure that
help of such a mental map, they can identify the location of any
invites contestation. We can only understand the nature of the
object with a high degree of precision, far exceeding the ability
widespread phenomenon of cultural and social debates if we
of speakers of languages that have a relativist system of spatial
study and understand the nature of our categories that give rise
reckoning.
to and invite debates by virtue of their very structure.
he second example deals with the cognitive process of cat-
Our third example has to do with how we represent knowl-
egorization. We can suggest that there is a close connection
edge in the mind. Categories are mentally represented as frames,
between the nature of our categories and many important cul-
schemas, or mental models (see, e.g., Schank and Abelson
tural and social issues. he classical view of categories is based
1977; Fillmore 1982; Langacker 1987; Lakof 1987). We can use
on the idea of essential features. In that view, the members of
the following working deinition of frames: A frame is a struc-
the category must share certain essential features. In the new
tured mental representation of a coherent organization of
rival view, categories are deined not in terms of necessary
human experience.
and sufficient conditions (i.e., essential features) but with
Frames are important in the study of almost any facet of life
respect to prototypes and various family resemblance
and culture and not just language. he world as we experience
relations to these prototypes.
it is always the product of some prior categorization and framing
How do we make sense of social debates? he emergence,
by ourselves and others. A crucial aspect of framing is that difer-
existence, and often the resolution of cultural and social issues
ent individuals can interpret the same reality in diferent ways.
may hinge on how we think about the nature of our categories.
his is the idea of alternative construal mentioned earlier.
To see how this is possible, let us consider the concept of art. he
How do we categorize the various objects and events we
discussion of the structure of the concept of art can shed light
encounter in the world? Clearly, many of our categories are
on why art has been a debated category probably ever since its
based on similarity (especially of the family resemblance kind)
inception and particularly in the past two centuries. Kvecses
among members of a category. hat is, many categories are held
(2006) examines some of the history of the category of art in
together by family resemblances among the items that belong
the past 200 years on the basis of the Encyclopedia Britannica
to a particular category. In this sense, most of our conventional
(2003). What he inds in this history is that the category under-
categories for objects and events are similarity-based ones. For
goes constant redeinition in the nineteenth and twentieth
example, the things that one can buy in a store are commonly
centuries. Diferent and rival conceptions of art challenge the
categorized on the basis of their similarity to one another; thus,
traditional view that is, the most prevalent conservative
we ind diferent kinds of nails (short and long ones, thick and
view. Impressionism, cubism, surrealism, pop art, and the like
thin ones, etc.) in the same section of a hardware store. hey
are reactions to the traditional view and to each other. But what
form a similarity-based category. However, we can also ind nails
is the traditional view of art?
in other sections of the store. Some nails can occur in sections
he traditional conception of art can be arrived at by exam-
where, for example, things for hanging pictures are displayed.
ining those features of art that are challenged, negated, or suc-
Clearly, a nail is not similar to any of the possible things (such as
cessfully canceled by the various movements of art. For example,
picture frames, rings, short strings, adhesive tapes, maybe even a
most people believe that a work of art represents objective reality.
special hammer) displayed in this section. How is it possible that
his feature of art is canceled by the art movements of impres-
certain nails appear in this section? Or, to put it in our terms, how
sionism, expressionism, and surrealism. Another feature of art
is it possible that nails are put in the same category with these
that most people take to be deinitional is that a work of art is
other things? he answer is that in addition to similarity-based
representational, that is, it consists of natural igures and forms.
categories, we also have frame-based ones. hat is to say, catego-
his feature is efectively canceled by symbolism, cubism, and
ries can be formed on the basis of the things that go commonly
abstract art. Finally, most believe that a work of art is a physical
and repeatedly together in our experience. If we put up pictures
object. his feature is canceled by conceptual art.
on the wall by irst driving a nail into the wall and then hanging
As can be seen, even those features of art that many would take
the picture frame on the nail by means of attaching a metal ring
to be deinitional for all forms of art (such as the one that art rep-
or a string on the frame, then all of the things that we use for this
resents objective reality, the one that it is representational, and
242
Culture and Language Cycle, the
purpose may be placed in a single category. But this category will . 2006. Language, Mind, and Culture: A Practical Introduction.
be frame-based not similarity-based. Oxford and New York: Oxford University Press.
Now there can be diferences across and even within cul- Kramsch, Claire. 1998. Language and Culture. Oxford: Oxford University
tures in the use of this meaning-making device. An interesting Press.
Lakof, George. 1987. Women, Fire, and Dangerous hings.
example is provided by a study by J. Glick (1975) conducted
among the Kpelle of Liberia. Kpelle farmers consistently sorted
Lakof, George.. 1996. Moral Politics: How Liberals and Conservatives
objects into functional groups (such as knife and orange and hink. Chicago: University of Chicago Press.
potato and hoe), rather than into conceptual categories (such Lakof, George, and Mark Johnson 1980. Metaphors We Live By.
as orange and potato and knife and hoe). he former is what we Chicago: University of Chicago Press.
would call a frame-based categorization, whereas the latter is a Lakof, George, and Mark Johnson. 1999. Philosophy in the Flesh: he
similarity-based one. On the whole, Westerners prefer to cat- Embodied Mind and Its Challenge to Western hought. New York: Basic
egorize objects on the basis of similarity. When Glick asked the Books.
Kpelle how a fool would categorize the objects, they came up Langacker, Ronald. 1987. Foundations of Cognitive Grammar: heoretical
with such neat similarity-based piles. Clearly, cultures can dif- Prerequisites. Vol. 1. Stanford, CA: Stanford University Press.
fer in the use of meaning-making devices, and these diferences Levinson, Stephen C. 1996. Relativity in spatial conception and descrip-
tion. In Rethinking Linguistic Relativity, ed. J. Gumperz and S. C.
may produce diferences in the use of categories and language
Levinson, 177202. Cambridge: Cambridge University Press.
in general.
Musolf, Andreas. 2004. Metaphor and Political Discourse: Analogical
Reasoning in Debates about Europe. London: Palgrave Macmillan.
Conclusion Palmer, Gary. 1996. Toward a heory of Cultural Linguistics. Austin: Texas
Culture and language are connected in many ways, and the University Press.
interconnections can be studied from a variety of diferent Schank, Robert, and Robert Abelson. 1977. Scripts, Plans, Goals, and
perspectives. Following Geertz, I tried to develop a view of the Understanding. Hillsdale, NJ: Lawrence Erlbaum.
relationship that is based on how we make sense of our experi- Shore, Bradd. 1996. Culture in Mind: Cognition, Culture, and the Problem
ences linguistic or otherwise. Recent cognitive science and cog- of Meaning. Oxford and New York: Oxford University Press.
nitive linguistics provide us with new ideas and methodological Strauss, Claudia, and Naomi Quinn. 1987. A Cognitive heory of Cultural
Meaning. Cambridge: Cambridge University Press.
tools with which we can approach the issue of meaning making
Turner, Mark. 2001. Cognitive Dimensions of Social Science. Oxford and
in cultures, both in its universal aspects and in its ininite cross-
New York: Oxford University Press.
cultural variety.
Whorf, Benjamin Lee. 1956. Language, hought, and Reality: Selected
Zoltn Kvecses Writings of Benjamin Lee Whorf. Ed. John B. Carroll. Cambridge,
MA: he MIT Press.
Wolf, Hans-Georg. 2001. he African cultural model of community in
English language instruction in Cameroon: he need for more syste-
Alverson, Hoyt. 1991. Metaphor and experience: Looking over the maticity. In Applied Cognitive Linguistics. Vol 2: Language Pedagogy.
notion of image schema. In Beyond Metaphor: he heory of Tropes Ed. M. Putz, S. Niemeier, and R. Dirven, 22558. Berlin: Mouton de
in Anthropology, ed. J. Fernandez, 94117. Stanford, CA: Stanford Gruyter.
University Press.
Charteris-Black, Jonathan. 2004. Corpus Approaches to Critical Metaphor
Analysis. Houndsmill, UK: Palgrave Macmillan. CYCLE, THE
Encyclopedia Britannica Ready Reference. 2003. Chicago: Encyclopedia
Britannica. Electronic version. he syntactic cycle was originally formulated in Chomsky (1965)
Fillmore, Charles. 1982. Frame semantics. In Linguistics in the Morning as a general principle of grammar that constrains the way trans-
Calm, 111137. Hanshin: he Linguistic Society of Korea. formational rules can apply in the derivation of sentences
Foley, William A. 1997. Anthropological Linguistics: An Introduction. containing embedded clauses. he term cycle refers to the prop-
Oxford and Malden, MA: Blackwell. erty of syntactic derivations whereby transformations apply
Geertz, Cliford. 1973. he Interpretation of Cultures. New York: Basic within syntactic subdomains before they apply to larger syntac-
Books. tic domains that contain them. hus, in (1), where A, B, and C
Gibbs, Raymond W. 2006. Embodiment and Cognitive Science. New
denote syntactic domains to which transformational rules can
York: Cambridge University Press.
apply, the set of transformations must apply to C before B, and
Glick, J. 1975. Cognitive development in cross-cultural perspective. In
Review of Child Development Research. Vol. 4. Ed. F. Horowitz, 595
then to B before A (see Figure 1). If a rule X applies to domains
654. Chicago: University of Chicago Press. AC in a single derivation, it applies successive cyclically to C,
Goatly, Andrew. 2007. Washing the Brain: Metaphor and Hidden Ideology. then B, and inally A that is, starting with the smallest cyclic
Amsterdam: John Benjamins. domain and proceeding stepwise to the largest (see Boeckx 2007
Johnson, Mark. 1987. he Body in the Mind. Chicago: University of for discussion).
Chicago Press. Noam Chomsky sharpens his original formulation as the
Kimmel, Michael. 2001. Metaphor, Imagery, and Culture: Spatialized strict cycle condition (SCC):
Ontologies, Mental Tools, and Multimedia in the Making. Ph.D. diss.,
University of Vienna. No rule can apply to a domain dominated by a cyclic node A in
Kvecses, Zoltn. 2005. Metaphor in Culture: Universality and Variation. such a way as to afect solely a proper subdomain of A dominated
Cambridge: Cambridge University Press. by a node B which is also a cyclic node. (1973, 243)
243
Cycle, the Deconstruction
(1) A extension condition (1993), the phase impenetrability condition

(Chomsky 2000), and cyclic linearization (Fox and Pesetsky 2005).
It appears that syntactic derivations do not allow countercy-
clic operations, either because the formulation of grammatical
B
operations will not allow them or because such operations violate
general constraints (aside from the SCC) on either derivations
or the representations that are produced. Whichever approach
C turns out to be correct, it is clear that the theory of grammar need
not include a speciic cyclic principle along the lines of the SCC,
given that virtually all of its empirical efects follow from inde-
pendent factors. Either way, the cycle is deeply embedded in
Figure 1.
syntactic theory.
he SCC includes the further restriction that transformations Robert Freidin
may not revisit a subdomain after they have applied to the larger
domain that contains it. For example, once a transformation WORKS CITED AND SUGGESTIONS FOR FURTHER READING
applies to domain B in Figure 1, no transformation can apply
Boeckx, C. 2007. Understanding Minimalist Syntax: Lessons from Locality
solely within the subdomain C. Since the earliest formulations, in Long-Distance Dependencies. Oxford: Blackwell.
clauses (complementizer phrase (CP)/inlection phrase (IP) Chomsky, N. 1965. Aspects of the heory of Syntax. Cambridge, MA: MIT
in current analysis) have been designated as cyclic domains. Press.
Nominal phrases (i.e., noun phrase [NP] and determiner phrase Chomsky, N. 1973. Conditions on transformations. In A Festschrift for
[DP]) and more recently light verb phrase (vP) have also been Morris Halle, ed. S. Anderson and P. Kiparsky, 23286. New York: Holt,
proposed as additional cyclic domains. Rinehart and Winston.
Empirical motivation for the SCC involves deviant sentences . 1993. A minimalist program for linguistic theory. In he View
whose derivation violates the SCC, for example (2) under the from Building 20: Essays in Linguistics in Honor of Sylvain Bromberger,
ed. K. Hale and S. Keyser, 152. Cambridge, MA: MIT Press.
partial derivation given in (3).
. 2000. Minimalist inquiries: he framework. In Step by Step: Essays
(2)*Who did you wonder what bought? on Minimalist Syntax in Honor of Howard Lasnik, ed. R. Martin,
D. Michaels, and J. Uriagereka, 89155. Cambridge, MA: MIT Press.
(3) a. [CP1 C [IP1 you wonder [CP2 C [IP2 who bought what ] ] ] ] Fox, D., and D. Pesetsky. 2005. Cyclic linearization of syntactic struc-
b. [CP1 C [IP1 you wonder [CP2 who C [IP2 bought what ] ] ] ] ture. heoretical Linguistics 31: 145.
c. [CP1 who C [IP1 you wonder [CP2 C [IP2 bought what ] ] ] ] Freidin, R. 1978. Cyclicity and the theory of grammar. Linguistic Inquiry
d. [CP1 who C [IP1 you wonder [CP2 what C [IP2 bought ] ] ] ] 9: 51949.
. 1999. Cyclicity and minimalism. In Working Minimalism, ed.
Speciically, the SCC blocks the countercyclic derivational step
S. Epstein and N. Hornstein, 95126. Cambridge, MA: MIT Press.
that moves what in (3c) to the speciier position of CP2, as illus- Lasnik, H. 2006. Conceptions of the cycle. In Wh-Movement: Moving
trated in (3d). On, ed. L. Cheng and N. Corver, 197216. Cambridge, MA: MIT
Whether a cyclic principle of rule application like the SCC Press.
constitutes an axiom of syntactic theory depends on two
things: one concerning the formulation of transformations and
the other involving the empirical overlap with independently
motivated conditions. If the countercyclic movement of what
D
in (3) is prohibited by the basic formulation of transformations
(e.g., merge), then stipulating an independent cyclic principle
DECONSTRUCTION
is redundant. he cyclic application of rules simply follows from
the formulation of rules (see Freidin 1999). Deconstruction is a practice of exceptionally close and vigilant
Suppose, however, that the formulation of transformations critical reading that aims to reveal the various contradictions
does not prohibit the countercyclic movement in (3). hen, a or moments of aporia (of paradox or strictly irresolvable doubt)
cyclic principle is required only if (2) under derivation (3) cannot endemic to the texts of Western philosophy, literature, and other
be excluded by other independently motivated constraints. For kinds of writing. It is best approached through the work of Jacques
example, under trace theory, the derivation (3) yields a represen- Derrida (19302004), the most vigorous exponent of deconstruc-
tation (4). tion and a thinker centrally concerned with issues in semantics,
hermeneutics (see philology and hermeneutics),
(4)[CP1 whoi C [IP1 you wonder [CP2 whatj C [IP2 ti bought tj ] ] ] ]
speech-act theory, and philosophy of language and logic (see
he connection between who and its trace ti violates the especially Derrida 1973, 1978, 1982, 1989).
subjacency principle. Taking this condition to be a constraint Perhaps the most striking example is Derridas lengthy
on trace binding (i.e., on representations), one empirical efect of treatment of Jean-Jacques Rousseau in Of Grammatology
the SCC is subsumed under subjacency. (See Freidin 1978 for an (Derrida 1976). Here, he shows how Rousseaus ideas about a
account that generalizes this kind of analysis.) Other proposals to vast range of topics nature, culture, language, society, ethics,
derive the empirical efects of a cyclic principle include Chomskys politics, history, sexual relations, personal identity, literature,
244
Deconstruction
and music are afected by a curious logic of supplementarity condition of impossibility for the sorts of claim put forward by
that constantly twists his argument back against itself and thereby Rousseau and like-minded thinkers.
subverts its manifest intent. Rousseau wants to say and does Such is the logic of supplementarity whereby the text
quite explicitly state that in each case, there is (or once was) bears involuntary witness to this strain on its powers of articu-
an original, authentic, natural, uncorrupted state that then gives late expression through various, often extreme, complexities of
way to a decadent, artiicial, and degraded state where human logical and syntactic structure. Chief among them are complexi-
beings are condemned to live at a distance from their true nature ties of a modal and temporal type, the former brought about by
and enter into all kinds of intrinsically bad (since by very dei- Rousseaus constant switching back and forth between talk of
nition unnatural) relationship with themselves, each other, and what must, might, or should properly have been the case with
the world around them. Yet in each case, his argument comes up regard to the aforementioned orders of priority, the latter by his
against a stubbornly insistent counterlogic that throws its claims likewise ambivalent, grammatically and logically elusive turns of
into doubt by reversing the conceptual order of priorities upon phrase when it comes to establishing a time-indexed (i.e., histor-
which that argument relies. ical, rather than mythic) sequence for the process of decline that
hus, Rousseau sees language and music as having their his text narrates. Hence Derridas claim that the writer writes in
common point of origin in a mode of passionate speech-song a language and in a logic whose proper system, laws, and life his
that expresses human feelings directly and without, as yet, any discourse by deinition cannot dominate absolutely, since [h]e
need for those various bad supplements of syntactic and uses them only by letting himself, after a fashion and up to a
lexical structure in language or harmony and counterpoint in point, be governed by the system (1976, 158). More speciically,
music that had since come to work their insidious, corrupting what Rousseau wishes to say about the proper, authentic order
efects. Indeed, as Derrida notes, the key word supplement of priority between passion and reason, speech and writing, mel-
occurs with remarkable frequency and weight of semantic ody and harmony, primitive and civilized stages of society, or
implication throughout Rousseaus writing on these topics. Yet (subsuming all these) nature and culture is everywhere implic-
when he treats them in a more sustained and relective way, itly subject to challenge or thrown into doubt by that same
Rousseau has perforce to concede that the melodious aspect of supplementary logic.
music is always dependent on a background sense of its har- So critics of Derrida like John Searle and Jrgen Habermas
monic implications, just as the expressive aspect of language along with some of his admirers such as Richard Rorty are wide
depends on the existence of grammatical and lexical struc- of the mark when they take deconstruction to consist in noth-
tures in the absence of which it could communicate nothing ing more than a routine technique for inverting or subverting the
whatsoever. various distinctions between reason and rhetoric, philosophy
he same applies to his thinking about matters of history, pol- and literature, or conceptual and linguistic issues (see Habermas
itics, and civil society. Here, Rousseau purports to trace a process 1987; Rorty 1982; Searle 1977). On their account, his work exem-
of epochal decline from the close-knit, organic, natural commu- pliies the textualist ne plus ultra of that widespread linguis-
nities that once existed before the advent of all the corrupting tic turn that has been such a prominent feature of philosophy in
forces of power, class, education, authority, political inluence, both the analytic and the continental traditions during the past
acquired expertise, and so forth which are falsely considered seven decades or so (Rorty 1967). Of course, one can see how this
progress or civilization by those same decadent standards. idea took hold, given Derridas sharp focus on matters of tex-
And again, what passes for culture among the denizens of tual detail and his extreme attentiveness to elements of igural
modern society is, in truth, just another melancholy sign of the or metaphoric language that must complicate any straightfor-
falling away from nature or the ever more false and artiicially ward appeal to literal, express, or intended meaning. However,
cultivated manners, practices, and modes of expression that ig- he is equally insistent that those who claim to turn the page on
ure as mere supplements to an otherwise perfectly self-suicient philosophy always end up by just philosophizing badly, since it
natural state. Yet here also Derrida shows that the term supple- is a pointless (and in any case self-refuting) gesture that afects to
ment is subject to a kind of dislocating force, or logico-semantic have done with all those old philosophical concepts and catego-
torsion, that twists the operative sense of Rousseaus argument ries while, in fact, surreptitiously or involuntarily deploying them
against his avowed intent. In each case, the supplementary item at every turn (Derrida 1982; Norris 1989; Rorty 1982, 1989).
turns out to be not so much a mere supplement ( = add-on, hese issues receive their most explicit treatment in his essay
accessory, optional extra) but a supplement in the opposite, he Supplement of Copula, where Derrida ofers a full-dress
palliative sense: that which is required in order to complete or transcendental argument from the conditions of possibility for
make good an otherwise defective, non-self-suicient, or inad- thinking or reasoning in general against the idea put forward
equate mode of being. hus, it is strictly impossible a downright by the linguist Emile Benveniste that our entire stock of philo-
contradiction in terms to posit the existence of a social state sophical concepts and categories can be seen to derive from a
of nature that would somehow precede and contrast with those certain language (the ancient Greek) and its distinctive range of
subsequent states whose hallmark, according to Rousseau, was lexico-grammatical structures (Benveniste 1971; Derrida 1982,
their basis in various, increasingly complex forms of societal and 175205). On the contrary, Derrida maintains, Benveniste cannot
cultural distinction. his attempt to describe what can never in advance a single proposition in support of his linguistic-relativist
truth have existed a society unmarked by any of those struc- case without falling back upon those same conceptual resources,
tures (however primitive) that constitute the very conditions such as the distinction between language and thought. So there
of possibility for social life can in fact be seen to deine the is no making sense of Benvenistes claim to invert the received
245
Deconstruction
(philosophical) order of priorities by treating language as the historians of science and showing how they take for granted not
condition of possibility for thought, or in narrowly professional only the existence of certain prior philosophical concepts and
terms linguistics as the discipline now poised to occupy the high categories but also the necessity of bringing them to bear in the
academic ground. Rather, what emerges from a critical reading process of deconstructing the kinds of uncritical or prejudicial
of Benvenistes texts is the absolute necessity that any such argu- thinking that often go along with them. For if the concept of
ment should turn out to controvert its own leading premise by metaphor is itself what Coleridge dubbed a philosopheme a
taking for granted a whole range of indispensable distinctions distinctively philosophic notion then we can have no means
that derive from a prior philosophic discourse, in this case one of questioning the supposed priority of concept over metaphor
that inds its irst clear statement in Aristotles doctrine of the except by way of the discourse wherein that topic has received
categories. its most decisive statements and elaborations. hus, it makes no
here is a similar twist of argument, and again one that sense to proclaim, with postphilosophical adepts like Rorty,
is ignored by most commentators, in Derridas essay White that we should give up the old deluded quest for truth, clear and
Mythology: Metaphor in the Text of Philosophy (1982, 20771). distinct ideas, conceptual precision, and so on and henceforth
On the usual account, what he here purports to show taking a embrace the Derridean ideal of philosophy as just another kind
lead from Nietzsche is the saturation of philosophic discourse of writing that at best ofers new and adventurous modes of cre-
by various (predominantly visual or tactile) types of metaphor ative self-description (Rorty 1982). his involves not only a snip-
that cannot be expunged, as some philosophers would wish, or pety reading of Derrida but also a failure to grasp his point that
even brought within the bounds of rational acceptability through such gestures can amount to no more than a kind of rhetorical
a systematic treatment or method of classiication. hat is, they hand waving or a claim to have come out on the far side of phi-
are so pervasive and go so far toward deining the very nature, losophy, while in fact regressing to a prephilosophical stage of
self-image, and operative scope of that discourse that it is strictly unrelective immersion in language.
impossible for philosophy either to manage without them or Such is indeed the charge that Habermas brings against
come up with some rigorously theorized account that would Derrida, namely, his having leveled or annulled the crucial genre
inally reduce them to order on its own methodological terms. To distinction between philosophy and literature or language in its
suppose that philosophy has managed to resolve this problem constative (i.e., truth-based or logical) and its performative (sua-
to achieve a clear demarcation between concept and metaphor sive and rhetorical) modes (Habermas 1987; see also perfor-
or literal and igural language is to take it for granted that the mative and constative). It seems to me, on the contrary,
sense aimed at through these igures is an essence rigorously that Derridas most signiicant achievement will be seen to lie in
independent of that which transports it, which is an already phil- his contributions to philosophy of language and logic and, above
osophical thesis, one might even say philosophys unique thesis, all, his remarkably inventive and original yet none the less rigor-
the thesis which constitutes the concept of metaphor (Derrida ous rethinking of the relationship between these disciplines. (For
1982, 229). However, this conidence may appear ill-placed if some early indications, see Norris and Roden 2002.)
one considers the extent to which philosophy depends upon a
Christopher Norris
range of metaphorical terms and distinctions like that between
metaphor (etymologically a means of transport or carrying
away) and concept (grasping, comprehending, holding together
in thought) which make up its very element. Aristotle. 1990. Categories and De Interpretatione. Trans. J. L. Ackrill.
What I have said so far about White Mythology its in well Oxford: Clarendon.
enough with the received view among mainstream analytic phi- Benveniste, Emile. 1971. Problems in General Linguistics. Trans. Mary
Meek. Coral Gables, FL: University of Miami Press.
losophers: that Derridas approach has more in common with
Dasenbrock, Reed Way, ed. 1989. Re-Drawing the Lines: Analytic
literary criticism than with philosophy properly so called, that
Philosophy, Deconstruction, and Literary heory. Minneapolis:
is, the practice of rigorous conceptual analysis. However, this is University of Minnesota Press.
a partial and highly prejudicial reading, as soon becomes clear Derrida, Jacques. 1973. Speech and Phenomena and Other Essays
if one looks beyond the opening section where his approach on Husserls heory of Signs. Trans. David B. Allison. Evanston,
might plausibly be construed along these echt-Nietzschean IL: Northwestern University Press.
lines to later passages where Derrida goes out of his way to fore- . 1976. Of Grammatology. Trans. Gayatri C. Spivak. Baltimore: Johns
stall or disqualify that interpretation. His counterargument (as Hopkins University Press.
with the response to Benveniste) is that philosophy has provided . 1978. Writing and Diference. Trans. Alan Bass. London: Routledge
all the terms and categorical distinctions that must be seen as and Kegan Paul.
absolutely prerequisite to any discussion of these issues, among . 1982. Margins of Philosophy. Trans. Alan Bass. Chicago: University
of Chicago Press.
them most crucially the distinctions between concept and met-
. 1982. Margins of Philosophy. Trans. Alan Bass. Chicago: University
aphor, reason and rhetoric, or philosophy and literature.
of Chicago Press.
Hence, Derridas cardinal point: that this will require not only . 1989. Afterword: Toward an ethic of conversation. In Limited Inc,
the highest degree of conceptual precision but also a detailed ed. Gerald Graf, 11154. Evanston, IL: Northwestern University Press.
knowledge of their history and various stages of elaboration and Habermas, Jrgen. 1987. On levelling the genre-distinction between
reinement to date. philosophy and literature. In he Philosophical Discourse of
White Mythology makes good this claim by examining a Modernity: Twelve Lectures, trans. Frederick Lawrence, 185210.
great range of texts by philosophers, linguists, rhetoricians, and Cambridge: Polity Press.
246
Deinite Descriptions Deixis
Norris, Christopher. 1989. Philosophy as not just a kind of writ- on the verb, pronouns, or a combination of both. Many deixes
ing: Derrida and the claim of reason. In Dasenbrock 1989, 189203. add information in the pronoun on the referents number (sin-
. 1990. Deconstruction, postmodernism and philosophy: Habermas gular, paucal, plural, dualis, or trialis) and its classiication
on Derrida. In Whats Wrong with Postmodernism: Critical heory and (masculine, feminine, neuter, animate, inanimate, edible). In
the Ends of Philosophy, 4976. Hemel Hempstead: Harvester.
Asian and Native American languages, irst person plural pro-
. 2000. Deconstruction and the Uninished Project of Modernity.
nouns often distinguish whether or not the hearer is included
London: Athlone.
. 2002. Derrida on Rousseau: Deconstruction as philosophy of
in the narrated event (labeled inclusive or exclusive), as shown
logic. In Norris and Roden 2002, II: 70124. in Table 1. Languages may use a two-term or three-term system
Norris, Christopher, and David Roden, eds. 2002. Jacques Derrida. 4 vols. to localize the referent in space. his tripartite distinction also
London: Sage. applies to languages with elaborate deictic systems like Malagasy
Rorty, Richard. 1982. Philosophy as a kind of writing. In Consequences (Madagascar) and Venda (South Africa) (see Table 2).
of Pragmatism, 89109. Brighton: Harvester. In one-term systems, nouns and verbs may be used for a com-
. 1989. Two versions of logocentrism: A reply to Norris. In plete deictic reference as, for example, the verb ro seaward in
Dasenbrock 1989, 20416. the following Ewaw (Indonesia) sentence.
Rorty, Richard, ed. 1967. he Linguistic Turn. Chicago: University of
Chicago Press. 1. Om-liik=ken nung afa en-ho ded=i en-ro?
Searle, John R. 1977. Reiterating the diferences: A reply to Derrida. 2sg-see=hit my thing 3sg-pass road=DEM 3sg-seaward
Glyph 1: 198208. Baltimore: Johns Hopkins University Press. Did you see something of mine on that road?
Time deixis can be encoded by tense inlections on the verb or

DEFINITE DESCRIPTIONS
adverbs in which the speech moment is the deictic center. Maori
A phrase of the form the is a deinite description. In his 1905 (New Zealand) seems atypical in that it signals past, present, or
On Denoting, Bertrand Russell sought an account of deinite future tense by means of special locative markers (respectively, i,
descriptions that would speak to why it is worthwhile to produce kei, and hei in the following examples).
an identity statement with such a phrase (a = a is trivial, but
Jupiter = the largest planet is informative), why one cannot 2a. I tepoti rua inanahi.
substitute Jupiter for the largest planet in George Bush often LOC.past Dunedin 3d yesterday
wonders whether Jupiter is the largest planet, and why sen- hey were in Dunedin yesterday.
tences that contain deinite descriptions that appear to denote 2b. Kei raro te ngeru.
nonexistent objects (the present king of France) can be mean- LOC inside ART cat
ingful if they denote nothing. (For details, see Ludlow 2009.) he cat is inside.
James McGilvray 2c. Hei te ata tua haere ai.
LOC.fut ART morning 1d go PART
We will go in the morning.
Ludlow, Peter. 2009. Descriptions. In he Stanford Encyclopedia Of
Often, space deictics are used originally to locate a referent in
Philosophy, ed. Edward N. Zalta. Available online at: http://plato.
stanford.edu/entries/descriptions/.
the discourse (as, for example, in Leti (Indonesia): ptal=d
Russell, B. 1905. On denoting, Mind 14: 47993. (bottle=here) this bottle here; ptal=di (bottle=now) the bot-
tle we are discussing now; and ptal=d=di (bottle=here=now)
this bottle here that we are discussing now).
DEIXIS A special type of discourse deixis, labeled switch-reference in
By deixis (from the Greek to point) we mean here all the literature, occurs in Native American, Papuan, and Australian
cues provided by a language that localize a speech event and its languages where special verb inlections or pronouns signal
participants in space and time. By contrast, reference is based on whether or not the subject in a clause has the same referent as
the privative distinction related to the deictic center (origo) / the subject in the following clause (same subject [SS] or diferent
not related to the deictic center. here are two reference sys- subject [DS]).
tems to localize a referent. In a positional system, the speaker is,
3. U-hu ma or hari-k limu teyen ya-ha lafaura.
or both speaker and hearer are, the deictic center whose position
Do-SS man he died-DS they bench make-SS placed
is used to localize an entity. A dimensional system relies on the
hen the man died, people (in the village) made a bench and
speech participants orientation and understanding of the envi-
placed him there.
ronment, in which case the deictic center may be something else
(Mende, Papua Niugini, after Nozawa 2000).
instead of the speech participants. he choice for one or both of
these systems evokes major diferences among languages. In Algonquian languages (North America), third person pro-
here are six categories of deixis: person, space, time, dis- nouns signal whether their referent is more or less topical
course, emphathy, and social status (Levinson 1983). Person in the narration (for example, proximal bi versus obviative
deixis usually distinguishes the speaker (irst person) from yi in Navajo). Similarly, a languages deictic system may sig-
the hearer (second person) and the non-speech or narrated nal the speakers empathy toward the referent and its status
participant (third person). his can be encoded by inlection within society. In Javanese (Indonesia), for example, social
247
Deixis
Table 1. Pronoun systems in four languages
English Quechua (Peru) Tamil (India) Biak (Indonesia)

1st person singular I nuqa nn ai
2d person singular you qan n au
3d person singular he pay avan (masculine) I
she ava (feminine)
it atu (neuter)
1st person plural we nuqanchis nm u (dualis)
inclusive o (trialis)
1st person plural we nuqayku nka nu (dualis)
exclusive mo (plural)
2d person plural you qankuna nka mu (dualis)
mo (plural)
su (dualis)
3d person plural they paykuna avar so (trialis)
si (animate plural)
na (inanimate plural)
Table 2. Demonstratives in Malagasy (Madagascar)
Referents
Distance to speaker Proximal Distance-neutral
Boundedness Bounded Unbounded Medial Distal Bounded Unbounded
Visibility Number
Visible Singular Ito ~ ity Itsy Iry Io Iny
Plural Ireto Iretsy Irery Ireo ireny
Invisible Singular Izato Izaty - Izary Izao izany
Plural Izatero - - - - -
Source: After Imai 2003, 201.
deixis created separate low and high lexicons that permeate Duchan, J. F., G. A. Bruder, and L. E. Hewitt. 1995. Deixis in Narrative: A
the entire language. he low style is represented in 4a, the high Cognitive Science Perspective. Hillsdale, NJ: Lawrence Erlbaum.
style in 4b. Felson, Nancy, ed. 2004. he poetics of deixis in Alcman, Pindar and
other lyric. Arethusa 37.3 (Special Issue).
4a. Dewek-e sing kok tuko-ni iwak. Garry, Jane, and Carl Rubino, eds. 2001. Facts About the Worlds
self-POS REL you buy-APPL ish Language: An Encycplopedia of the Worlds Major Languages, Past and
Present. New York and Dublin: H. W. Wilson.
4b. Piyambak-ipun ingkang sampeyan tumbas-aken ulam.
Green, Keith, ed. 1995. New Essays in Deixis: Discourse, Narrative,
self-POS REL you buy-APPL ish
Literature. Amsterdam: Rodopi.
It is him whom you bought ish for. Imai, Shingo. 2003. Spatial deixis. Ph.D. thesis, State University of New
Deixis is a major research topic in typology, pragmatics, York at Bufalo.
Levinson, Stehen C. 1983. Pragmatics. Cambridge: Cambridge University
anthropological linguistics and, recently, in poetics.
Press.
Aone van Engelenhoven Levinson, Stephen C., and David P. Wilkins. 2006. Grammars of Space:
Towards a Semantic Typology. Explorations in Cognitive Diversity,
Language, Culture & Cognition 6. Cambridge: Cambridge University Press.
Nozawa, Michiyo. 2000. Participant Identiication in Mende. Available
Bhler, K. 1982 (1934). he deictic ield of language and deictic worlds. online at: http://www.sil.org/ paciic/png/abstract.asp?id=506
Speech, Place and Action: Studies in Deixis and Related Topics. Ed. Te Aka Mori-English, English-Mori Dictionary and Index. Available
R. J. Jarvella and W. Klain (eds). Chichester: John Wiley, 930. online at: http://www.maoridictionary.co.nz/index.cfm.
248
Descriptive, Observational, and Explanatory Adequacy
DESCRIPTIVE, OBSERVATIONAL, AND EXPLANATORY

ADEQUACY
In empirical science, the adequacy of a theory is determined by the
degree to which it gives insight into the real nature of certain aspects
of the world. he evaluation of a theory can concentrate on the cor-
respondence between the predictions of the theory and observed
phenomena, on the plausibility of the system described by the the-
ory as underlying these phenomena, or on the compatibility of the
theory with theories for adjacent ields. Ideally, a theory scores well
on all three of these accounts. In linguistics, a special set of terms for-
malizing these criteria was introduced by Noam Chomsky (1964).
Origin of the Terms

he terms of observational adequacy, descriptive adequacy, and
explanatory adequacy were introduced by Chomsky in his ple- Figure 1. Chomskyan linguistics and levels of adequacy.
nary address to the Ninth International Congress of Linguists in
1962. hese terms are not widely used outside linguistics. hey
are collectively referred to as the levels of adequacy. Chomsky adequacy, it is suicient that the observable facts are covered.
(1964, 289) describes them as follows: For descriptive adequacy, it is required that they be covered by
A grammar that aims for observational adequacy is concerned a grammar that describes the speakers competence. Although
to account for observed linguistic utterances. both observational and descriptive adequacy are described
A grammar that aims for descriptive adequacy is concerned as properties of grammars, only for descriptive adequacy does
to account for the speakers underlying system of intuitions. the grammar correspond to grammar in Figure 1. A grammar
A linguistic theory that aims for explanatory adequacy is con- that ignores the need to describe the speakers competence is a
cerned to provide a principled basis for selecting a descrip- grammar of a kind not represented in this igure. he opposition
tively adequate grammar. between descriptive and explanatory adequacy is characterized
by the fact that the latter is a property of a linguistic theory of a
higher level of abstraction than a grammar.
Position in Chomskyan Linguistics
he interpretation of the three levels of adequacy cannot be sepa- Discussion until the Emergence of Principles and
rated from Chomskys view of the nature of language and the way it Parameters
should be studied. Schematically, this view can be represented as he main purpose of the introduction of the concept of observa-
in Figure 1, which is based on ten Hacken (2007), where a detailed tional adequacy seems to have been to set of Chomskyan linguis-
discussion of and motivation for the elements of the diagram may tics from post-Bloomieldian linguistics. In post-Bloomieldian
be found. On the left-hand side, real-world entities are repre- linguistics, a grammar was not supposed to describe the speakers
sented. Observable facts are phenomena and events that can be competence because the speakers competence is a mental entity.
observed by the linguist, for example, grammaticality judg- Zellig S. Harris ([1951] 1960) and Charles F. Hockett (1954), for
ments. competence is the knowledge of language in the mind/ instance, reject any appeal to mental states because it is impossi-
brain of the speakers that enables them to produce these facts. he ble to observe them directly. Any recourse to mental entities was
language faculty is a set of genetically determined predispositions deemed unscientiic. In the framework of Figure 1, this is tanta-
of human beings that enable them to acquire this competence. mount to a rejection of descriptive adequacy as a legitimate goal
he gray arrows between them can be read as underlies. of linguistic theory. Of course, post-Bloomieldian linguists could
he rounded rectangles in the middle of Figure 1 represent not accept the allegation that they were aiming only for observa-
theoretical entities. An observation is a theoretical entity in the tional adequacy, as shown by Fred W. Householder (1965). As
sense that it imposes a certain structure on the world and selects analyzed by Pius ten Hacken (2007), post-Bloomieldian linguis-
relevant aspects. A grammar in Chomskyan linguistics is a thetics assumed a diferent set of criteria for the selection of gram-
ory of the speakers competence. universal grammar (UG) mars, which was not compatible with the framework of Figure 1.
is a theory of the language faculty. he grammar can be tested he criteria of descriptive and explanatory adequacy create a
by observations and can explain these observations because certain tension because descriptive adequacy is served by a weak
they correspond to consequences of the competence. UG can be UG and explanatory adequacy by a strong UG. he weaker the con-
tested by individual grammars because for each competence, a straints imposed by UG, the more diferent grammars it allows and
grammar must be available that is allowed by the language fac- the easier it is to ind one for a particular language. he stronger
ulty. UG explains the individual grammars in the sense that it the constraints imposed by UG, the smaller the range of grammars
describes the mechanism that makes the emergence of compe- a child has to choose from in language acquisition and the more
tence in the individual (i.e., language acquisition) possible. aspects of the grammar that are determined by genetic factors.
As indicated in Figure 1, the levels of adequacy correspond he tension between descriptive and explanatory adequacy is
closely to the levels of theoretical depth. For observational mentioned by Chomsky (1981, 3) as beneicial because it directs
249
Descriptive, Observational, and Explanatory Adequacy
linguistic theory to an optimal balance between the power of UG he higher-level indeterminacy problem raised by the prolif-
and the power of individual grammars. It provides the basis for a eration of parameters cannot be solved within the framework of
solution to the problem of selecting the grammar that corresponds Figure 1. Chomskys (1995) minimalist program (MP) (see
to the way competence is actually organized in the speaker. If we try minimalism) addresses this problem by considering the ques-
to devise a grammar for a language on the basis of a inite set of data, tions of the uniication of linguistics with biology and the emer-
there are indeinitely many candidates. No (inite) amount of addi- gence of the language faculty in evolution (see biolinguistics).
tional data can constrain the range of candidate grammars to a inite he latter aspect is elaborated by Marc D. Hauser, Chomsky,
set. A very similar problem was known to the Post-Bloomieldians and W. Tecumseh Fitch (2002). In the analysis proposed by ten
as the problem of non-uniqueness (cf. Chao [1934] 1957). Hacken (2007), the MP adds a new pair of a real-world entity and
By exploiting the tension between descriptive and explanatory a theoretical entity on top of the three represented in Figure 1.
adequacy, Chomsky hoped to solve the indeterminacy and, at the hese entities determine and explain evolution, respectively.
same time, reach a deeper level of explanation. In Chomskyan hey are not part of linguistics proper but operate more gener-
linguistics, it is assumed that there is a single correct grammar, ally. It is not necessary to specify their exact nature in order to
that is, the one describing the actual system in the speakers use them as constraints on the way the language faculty can have
mind. If this system can come into existence, it has to be learn- emerged and should be shaped.
able on the basis of the language faculty and a limited amount If the levels of adequacy are considered only in the way they
of input data from the environment. Whatever is contributed to are deined by Chomsky (1964), they have lost their relevance in
language acquisition by the language faculty must be common the MP, as stated by Chomsky (2002, 12933). If they are consid-
to all human languages. his reasoning implies that descriptive ered as relecting a wider-ranging concern in empirical science,
and explanatory adequacy can only be achieved simultaneously. they have to be reformulated in a more general way. Ten Hacken
Without a proper theory of the genetically determined language (2006) considers two alternative ways of adapting the levels of
faculty, it is not possible to ind a grammar describing the actual adequacy to the expanded framework arising from the MP. One
knowledge of language that a speaker has. is to add a new level of adequacy, directly connected with the
An alternative tradition in the approach to the problem of new pair of entities added on top of Figure 1. he other approach
non-uniqueness is to deny the relevance of the problem. Harris is to relativize descriptive and explanatory adequacy with respect
([1951] 1960) and W. V. Quine (1972), for instance, assume that to the level of the entities to which they are applied.
any grammar that covers the data is a correct grammar and that
Pius ten Hacken
there is no principled way to choose between alternative correct
grammars (cf. indeterminacy of translation). A common WORKS CITED AND SUGGESTIONS FOR FURTHER READING
assumption in this tradition is that explanatory adequacy can
only be achieved after descriptive adequacy has been achieved. Chao, Yuen-Ren. [1934] 1957. he Non-uniqueness of phonemic solu-
tions of phonetic systems. Bulletin of the Institute of History and
Gerald Gazdar and colleagues observe that a description of the
Philology 4: 36397. Repr. in Readings in Linguistics: he Development
relevant phenomena is a necessary precondition to explaining
of Descriptive Linguistics in America 19251956, ed. Martin Joos, 3854.
some aspect of the organization of natural languages (1985, 2). Chicago: University of Chicago Press.
Taken literally, their observation does not have a direct bearing Chomsky, Noam. 1964. Current Issues in Linguistic heory. Den
on the order in which descriptive and explanatory adequacy can Haag: Mouton. his book is the original source for the terms descrip-
be achieved. hey only mention description and explanation, not tive, observational, and explanatory adequacy.
descriptive adequacy and explanatory adequacy. For the applica- . 1981. Lectures on Government and Binding. Dordrecht: Foris.
tion of the latter pair of terms, it is necessary to conceive a gram- . 1995. he Minimalist Program. Cambridge, MA: MIT Press.
mar as a theory of a speakers competence. As this conception is . 2002. On Nature and Language. Cambridge: Cambridge University
generally foreign to theories in this tradition, they are all classiied Press.
as aiming only for observational adequacy in the original sense of Gazdar, Gerald, Ewan Klein, Geofrey Pullum, and Ivan Sag. 1985.
Generalized Phrase Structure Grammar. Cambridge: Harvard
the levels of adequacy introduced by Chomsky (1964).
University Press.
Harris, Zellig S. [1951] 1960. Methods in Structural Linguistics.
Recent Developments Chicago: University of Chicago Press. Repr. as Structural Linguistics, 1960.
Chomsky (1981) introduced the principles and parameters Hauser, Marc D., Noam Chomsky, and W. Tecumseh Fitch. 2002. he
(P&P) model. In this model, the language faculty is considered as faculty of language: What is it, who has it, and how did it evolve?
a set of genetically determined principles that are operative in all Science 298: 156979.
languages. In order to account for the diferences between lan- Hockett, Charles F. 1954. Two models of grammatical description.
guages, principles are assumed to have parameters. A parameter Word 10: 21031.
Householder, Fred W. 1965. On some recent claims in phonological
is a variable in the principle with a predetermined set of values.
theory. Journal of Linguistics 1: 1334.
Language acquisition is then analyzed as inding the right values
Quine, W. V. 1972. Methodological relections on current linguistic
for the parameters. he P&P model solves the tension between
theory. In Semantics of Natural Language, ed. Donald Davidson and
the demands of descriptive and explanatory adequacy because it Gilbert Harman, 44254. Dordrecht: Reidel. his article ofers an alter-
provides a basis for explaining language acquisition while describ- native conception of adequacy for grammars.
ing the range of attested languages. It does not provide an obvious ten Hacken, Pius. 2006. he nature, use and origin of explanatory ade-
constraint on the proliferation of parameters, however. his means quacy. In Optimality heory and Minimalism: A Possible Convergence?
that the indeterminacy problem arises again, but at a higher level. ed. Hans Broekhuis and Ralf Vogel, 932. Linguistics in Potsdam 25.
250
Dhvani and Rasa
his article ofers a version of descriptive and explanatory adequacy Just as the substantive body of Greek literature provided
that adapts them to the latest developments in Chomskyan linguistics. materials for Aristotles therapeutic idea of katharsis, the rasa
. 2007. Chomskyan Linguistics and Its Competitors. London: theorists also had literary materials available to them. hese
Equinox. included all of the epic material and most of the early classical
material that we now possess, and they were acquainted with a
substantial literature in Prkrit [vernacular], most of which is now
DHVANI AND RASA
lost (Ingalls 1990, 5). Although earlier Sanskrit poetics tended
he central concept of ancient Indian aesthetics is rasadhvani. A to be somewhat prescriptive, rasa and dhvani aesthetics is pri-
combination of two interrelated terms, rasa and dhvani, the term marily descriptive. No doubt, this is in part due to widely shared
refers to a complex theory of emotion-genres based on sugges- knowledge of canonical and local literatures, a fully developed
tiveness of language, allusion, and imagery. he theory developed literary culture, and the associated reception aesthetics. From
during the classical period of Sanskrit literature. More recently, the engagement of centuries, thus, emerged the nine emotion-
rasadhvani has drawn the attention of scholars in cognitive sci- genres, or rasas: the erotic (gra), the comic (hsya), the
ence, especially those who study representational emotion (Oatley tragic (karua), the furious or cruel (raudra), the heroic (vra),
2004, 152). he term rasa refers to a readers or viewers aesthetic the fearsome or timorous (bhaynaka), the gruesome or loath-
experience in relation to a work of art, music, or literature, his/her some (bbhats), and the wondrous (adbhuta). To these a ninth
enjoyment (or relish) of it. Dhvani refers to the verbal processes was added later, the rasa of peace (nta) (Ingalls, 1990, 16).
of suggestiveness, or vyajakatva (Ingalls 1990, 9). Vyajakatva Elaborate identiication of determinants (vibhvas) of emo-
of language and other representational signs is essential for trig- tion, its consequents (anubhvas), permanent mood-congruent
gering memory traces in the mind of the recipient. When this states of mind (sthybhvas), and the transient states of mind
happens, an intersubjective mirroring (see mirror systems, (vyabhicri, or sacr bhvas) involved the rasa theorists into
imitation, and language) aligns remembered emotion with making a distinction between two sorts of emotion, rasa and
the represented rasa (Hogan 1996, 17071). bhva. It is not uncommon for early theorists, as well as their
Given the emphasis on an essentially transactive relation- modern commentators, to give difering accounts of the distinc-
ship between the actual and the imaginary, rasadhvani can be tion between rasa and bhva, because this is the most controver-
conceived of as a theory of reader response. However, it is not sial area in rasadhvani studies (Pandit 2003, 16572). he general
merely a reader-response theory but also a systematic theory consensus, however, is that bhva, as indicated, is everyday emo-
of representation. Among the prominent rasa theorists whose tion grounded in self-interest and ego attachment, and rasa is what
treatises have been translated and reprinted are Bharat-Muni we feel empathically in relation to the objective determinants of
(second century b.c.e.), nandavardhana (ninth century c.e.) rasa, most often via characters in iction. English renditions of
and Abhinavagupta (tenth century c.e.). A major contribution the Nyastra translate bhvas as emotional tracts and states,
of these three, among many others, is the linking of emotion to providing long lists under each subtype of what would today be
genre in a systematic way (Oatley 2004, 153). Although each has called emotion categories, ranging from physical emotion to
his own pet project and theoretical (or practical) obsession, the mental states and thought trends (Bharata-Muni n.d., 87113).
common assumption is that a basic emotion insofar as one can Insofar as bhvas are already part of representation, the difer-
conceive of an emotion as basic, such as anger, fear, or love in entiation of rasa as a separate emotive entity assumes that within
its rasa format can be and often is the unifying principle for an the deictic ields of a narrative, characters emotions will func-
artwork, while ancillary emotional states elaborate the unifying tion, deictically, as raw emotions, rooted in egotism and various
rasa through antithesis and collaborative synthesis. forms of misrecognition, but for the reader the emotional experi-
In their account of aesthetic experience, the rasa theorists refer ence will be of rasa. Some of this applies to characters as well.
to physical processes, such as senses and sense perceptions, and In contrast to real people, characters as deictic subjects come to
body, hand, and facial gestures, with the same ease with which have ictionally complete lives (i.e., their stories resolve either
they refer to subtler processes, such as the pra (breath), mental in death or with lovers uniting in the happily ever after); hence,
entities like manas (the mind), citta (cognition), budhi (cognizing some of their emotive experience, especially toward the end, will
intelligence) and ahakra (ego consciousness). In his widely be that of rasa. In desiring, they will go beyond desire to achieve
known work Yogastra, Patajali (third century c.e.) searched for an emotional state based on recognition and understanding.
points of alignment between bodily processes that afect the mind he rasa versus bhava distinction becomes more deinitively
and higher cognition (Patajali 1971, 6694). His ideas on the the- clear in nandavardhanas focus on the importance of nta
ory and practice of yoga aim for personal development, as well rasa, which he considers the greatest happiness (Ingalls 1990,
as social practice based on compassion and non-violence. Later, 16). Citing a verse from the epic Mahabhrata as supporting evi-
nandavardhana and Abhinavagupta considered aesthetic expe- dence, nandavardhana says that nta (the peaceful) is char-
rience, too, as a form of yoga, approximating samdhi (contem- acterized by the full development of happiness that comes from
plative realization), though it remains grounded in the materiality the dying of desire (nandavardhana 1990, 520). Not invested
of experience. To this purpose, they reined the folk science of in the dying of desire but its aesthetic transformation, his suc-
emotions while trying to give an account of why and how literary cessor Abhinavagupta too considers nta an essential part of
works induce emotional states and thought trends that produce the rasa emotionality and, hence, a part of all rasas. Countering
aesthetic pleasure, why representations of fear, anger, horror, and objections that nta cannot be regarded as an emotion, he
so forth are enjoyable and deeply satisfying. asks: What shall we call the heroism of compassion? Is it the
251
Dhvani and Rasa
heroism of religion, or the heroism of generosity? It is neither; it by erasing the primary sense, mukhrtha, of her words, the lonely
is simply another name for the peaceful [nta] (Abhinavagupta wife makes an erotic suggestion through a literal negation of it.
1990, 525). his is a very simple example. Others from the epics Rmyaa
Upon careful examination of the primary texts on rasadhvani, and the Mahbhrata are much more complicated.
nandavardhanas Dhvanyloka and Abhinavaguptas com- In explicating some of these epic and classical examples from
mentary, one is led to believe that while bhva covers a range the rasadhvani perspective, Abhinavagupta developed a theory
of emotions, representational and nonrepresentational, rasa is of memory equipped with the notion of memory banks, storage
emotion aligned with the mind-steadying potentialities of nta. and retrieval processes. he connection to memory semantic,
he critics of this idea, who are rebufed by Abhinavagupta, mis- emotional, and episodic allowed the subsequent integration of
take steadiness of the mind for stillness and wonder if nta can rasadhvani into contemporary cognitive science (Hogan 1996,
be a rasa. 17076).
In addition to considering nta an emotion-genre as well as Following traditional theories of consciousness,
an overarching aesthetic for all rasas, Abhinava combines the Abhinavagupta believes that all experiences perceptual,
concept of rasa with the concept of dhvani. It is through pat- cognitive, emotional, etc. leave traces in the mind (Gnoli
terned verbal suggestion (dhvani) that the violent, the hateful, 1968, 79). Relecting on why representational grief is relished,
the horriic, the furious will give rise to aesthetic enjoyment: the Abhinavagupta explains that the basic emotion for grief is
rasa experience. here are various forms of dhvani, but Abhinava compassion. And, compassion consists of relishing (or aes-
considered rasadhvani the most important. he most basic form thetically enjoying) grief. hat is to say, where we have the basic
of dhvani is vastudhvani, which refers to suggestion of a thing, emotion of grief, a thought trend that its with the vibhvas and
or a fact. It is not necessarily emotive, but emotive dhvani can anubhvas of this grief, if it is relished (literally, if it is chewed
build on it. he dhvani movement began with a paradigm shift over and over), becomes a rasa and so from its aptitude [towards
away from the igures of speech (alakra) emphasis in earlier this end] one speaks of any basic emotion as becoming rasa
Sanskrit poetics. However, it did not abandon that idea. he con- (Abhinavagupta 1990, 117; insertions by Ingalls). Drawing a gen-
cept of alamkdhvani combined connotation through igures of eral conclusion, he adds: he basic emotion is put to use in the
speech with suggestion, though rasadhvani was still being con- process of experiencing the rasa of literary and art works, as
sidered more important. thought-trends are transferred from what one has already expe-
In this connection, Daniel H. H. Ingalls notes that in Greek rienced in ones own life to one which one infers in anothers life
rhetoric, signiicatio may seem like a close parallel to dhvani, but it (ibid.). his process involves distancing of ones own emotion
is the igure that draws attention to itself. He continues: [O]nly from self-interested concerns to something larger.
under allegory and irony does Greco-Latin rhetoric come to what Abhinavaguptas notions of memory-trace (saskra)
would qualify with nanda as dhvani, and at that only vastudh- and desire-trace (vsan) are not unrelated to his theologi-
vani (Ingalls 1990, 38). Unlike signiicatio, dhvani is not a trope; cal preoccupation with how to free the mind from egocentric
it does not draw attention to itself. It is a suggestive process, an attachments (and ephemeral satisfactions) to incline it toward
aesthetic strategy that erases the primary meaning, mukhyrtha, transcendental joy: nanda. he importance given to nta rasa
of a word, or phrase, to incline it toward suggestiveness within has the same origin. he materiality of aesthetic experience, for
the textual context of a representational schema. Abhinavagupta, is a means of moving toward the nonmateriality
It is important to keep in mind that the term dhvani was origi- of transcendental experience. Socially, the rasa theory focuses
nally borrowed from the grammarians for whom it had a techni- on education of emotions that would produce sahdaya citizens.
cal meaning. In Vkyapadya, Bharthari deines dhvani in the Abhinavagupta deines sahdayatva (literary sensitivity) as the
following manner: he true form [that is the semantic content] faculty of entering into the identity of the poet (Ingalls 1990,
in the word that is manifested by the dhvani is determined by a 72). In a more modern context, sahdaya is a person trained in
series of cognitions [viz., the cognitions of successive phonemes], rasadhvani-generated understanding of emotion and social obli-
which are unnameable [that is to say, each phoneme-cognition gation (Hogan 2003, 1217).
in itself is unassignable to this word or that], but favorable to the An instance from John Websters Duchess of Mali ([1623]
inal [word-identifying] cognition (quoted in Ingalls 1990, 170; 1961), a work clearly not known to the rasadhvani theorists, will
bracketed insertions from Ingalls). demonstrate to what extent applications of the rasadhvani aes-
In poetic theory, the technical, linguistic meaning of dhvani thetics are contingent neither on a shared origin nor on areal
is used only metaphorically. Graduated phoneme-cognition connectedness of traditions. Briely, Websters play revolves
stands for the temporality of the reading process, where one is around a duchesss secret marriage, upon her early widow-
engaged but is not sure what the dhvani meaning might be until hood, to her steward, against the wishes of her socially powerful
the narrative, musical, or other equivalent of the inal word- brothers. he marriage remains a secret for some time, but in the
identifying phoneme-cognition is registered on the mind. middle of the play the duchess is separated from her husband,
A brief example of the operations of dhvani used in the Antonio, and is imprisoned along with her children and servant
Sanskrit tradition is a Prkrit verse, where a wife gives sleeping woman, Cariola. Toward the end of the play, a loyal friend, Delio,
directives to the stranger who is a guest for the night (while the and Antonio approach the cardinals house in hopes of reconcil-
husband is away). She says Mother-in-law sleeps here, I there; ing Antonio to his wifes brothers. While the reader knows that
/ look, traveler, while it is light. / For at night when you cannot the duchess, her children, and Cariola have been murdered,
see, / You must not fall into my bed (Ingalls 1990, 14). Clearly, Delio and Antonio do not, at least not with certainty.
252
Dhvani and Rasa
he deserted point of entry they choose is a walkway around of this inner story is like a clear stream, resonant with muted
a fortiication near the ancient ruin of an abbey. A piece of the gra (the rasa of romantic love), and is constituted by vari-
old cloister around there Gives the best echo, says Delio to ous determinants, consequents, and permanent and transient
Antonio. he words repeated by the echo are so clear in their emotional states. For instance, one permanent mood-congru-
enunciation, Delio remarks, that it is regarded by some as a ent state of mind (sthy bhva) is the duchesss love for her
spirit (5.3.18). As one would expect, the echo repeats the last husband and her enjoyment of her secret marriage, despite the
words and phrases of what Antonio and Delio say to each other, dread. Another is her vtsalya, mother-love. At the moment of
and these repeated sentence fragments (italicized in many edi- her brutal murder in prison, when Cariola cries out that she will
tions of the play) compose a text of their own with a dhvani mes- die with her, the duchess pleads: I pray thee, look thou givst
sage to Antonio: a message that holds a mirror to his suspicions my little boy / Some syrup for his cold, and let the girl / Say her
and dread. prayers ere she sleep (4.2.18890). Some of the transient emo-
As the conversation continues, Delio remarks that like tional states (sacri bhvas) are her hopes, fears, worries, and
human beings, ancient monuments also become deceased anxieties as she conceals her pregnancies and plans for her fam-
and so Must have like death that we have (5.3.19). he echo ilys safety.
repeats: Like death that we have (5.3.20), changing the referent he precision and economy of Websters semantic technique
for we from humans in general to the duchess and her chil- presents the reverberated sentence fragments not as random
dren. For the reader, the echo-utterance is her utterance, not repetitions. Rather, they are put together like semantic paral-
because she has become a ghost but because we know that the lels to the sequence of phoneme-cognitions, where the inal
duchess would like to inform and warn her husband. On his part, phoneme, retrospectively, forms a meaningful unit the gram-
Antonio imagines that the echo spoke in A very deadly accent marians concept of dhvani, as mentioned earlier. Here, the inal
(23), as instantly it echoes back: Deadly accent (21). Delio, less semantic unit, like the inal phoneme, brings forth the full eicacy
illed with dread and more with anxious anticipation, thinks this of rasadhvani when the two auditors (Antonio and Delio) stand
utterance signiies some thing of sorrow (25). Expectedly, the awed by the inal words: Never to see her more (5.3.45), echoed
phrase, A thing of sorrow is reverberated and collated with the in answer to Antonios My duchess is asleep now, / And her lit-
repetition of hat suits it best (256). Drawn thus far into the tle ones, I hope sweetly, O heaven, / Shall I never see her more
echo-utterance, Antonio says: Tis very like my wifes voice; the (424). After his conditional statement is changed to an assertion
echo promptly repeats: Ay, wifes voice (27). he next few syn- by the echo, never see her more, Antonio says: I marked not
tactic and semantic fragments, similarly taken out of the stated one repetition of the echo / But that; and on the sudden, a clear
context when repeated by the echo are Do not (30) suggesting light presented me a face folded in sorrow (469). What Antonio
that they should not go in. his pattern continues for some time, has known subconsciously for some time, but has not dared to
till the text develops through echoes of Fly your fate (38); For believe, is communicated to him with concern, consolation, and
thou art a dead thing (41). ininite sorrow minutes before he will have no time for relec-
At the narrative level, clearly, the echo phenomenon fore- tion (on his fate), or for emotion of any kind.
shadows the end; yet there is no need for foreshadowing. he
Lalita Pandit Hogan
reader knows that nothing good can come out of Antonios going
into the house. It is equally clear that Webster is not using the
supernatural trope. he echo-text is like a code; it uses vastud-
hvani in the way the primary sense in the sentences spoken by Abhinavagupta. 1990. he Dhvanyaloka of nandavardhana with Locana
each speaker, their mukhyrtha, is erased. A suggested mean- of Abhinavagupta. Trans. Daniel H. H. Ingalls, Jefrey Moussaief
ing, or the dhvani meaning, is inserted systematically. Up to this Masson, and M. V. Patwardhan. London: Harvard University Press.
nandavardhana. 1990. he Dhvanyaloka of Anandavardhana with
point in the play, the plot line involving the duchess and her fam-
Locana of Abhinavagupta. Trans. Daniel H. H. Ingalls, Jefrey Moussaief
ily, though it has had moments of tragic grandeur, was mostly
Masson, and M. V. Patwardhan. London: Harvard University Press.
marked by abjection and horror. Into this general scenario, the Bharata-Muni. n.d. he Natya Sastra. Delhi: Satguru Publications
well-placed junctural scene inserts a relective moment of tran- Gnoli, Raniero. 1968. he Aesthetic Experience According to Abhinavagupta.
quility (of nta), with a muted lavor of retrospective and pro- Varanasi, India: Chowkhamba Sanskrit Series.
spective experience of grief. Hogan, Patrick. 1996. Towards a cognitive science of poet-
hrough the dhvani resonance of the echo scene, the tragic ics: Anandadhana, Abhinavagupta, and the theory of literature.
fate of this holy family (Antonio last parted from his wife near College Literature. 23.1: 16478.
the Loretto convent, where she was captured by her brother, the . 2003. Introduction: Tagore and the ambivalence of commit-
cardinal [3.5.1140]) is, thus, represented through karua rasa, ment. In Rabindranath Tagore: Universality and Tradition, ed. Patrick
the aesthetic feeling of compassionate sorrow. In contrast, what Colm Hogan and Lalita Pandit, 923. Madison, NJ: Fairleigh Dickinson
University Press.
happens to the brothers is represented through the emotion-
Ingalls, Daniel H. H., ed. 1990. he Dhvanyaloka of nandavardhana
genre of absurd horror. From the start, Webster walls of the
with Locana of Abhinavagupta. London: Harvard University Press.
inner story of the duchesss secret marriage, her motherhood, Oately, Keath. 2004. Emotions: A Brief History. Oxford: Blackwell
and marital happiness from the overall clamminess of a sordid Pandit, Lalita. 2003. he psychology and aesthetics of love: ringra,
world, where the most heinous sinner is the one who punishes Bhvan, and Rasadhvani in Gora. In Rabindranath
others for violation of social rules. Despite Websters many iro- Tagore: Universality and Tradition, ed. Patrick Colm Hogan and Lalita
nies (embodied in the character of Bosola), the emotional logic Pandit, 14174. Madison, NJ: Fairleigh Dickinson University Press.
253
Dialect Dialogism and Heteroglossia
Patajali. 1971. Yogasutra. Ed. and trans. J. R. Ballantyne and Govind community may lead to new dialects. Because dialect boundaries
Shastri Deva. Varanasi, India: Indological Book House. are fuzzy, contiguous regional dialects may form dialect chains.
Webster, John. Duchess of Mali. [1623] 1961. In Elizabethan Drama, ed. Within the chain, two adjacent dialects will display greater simi-
Leonard Dean, 271360. 2d ed. Englewood Clifs, NJ: Prentice-Hall. larity than two dialects that exist at a distance from one another.
Richard Cameron
DIALECT
In pursuit of a useful deinition of dialect, one may begin by SUGGESTION FOR FURTHER READING
distinguishing language from dialect. hink of a language as a Labov, William, Sharon Ash, and Charles Boberg. 2006. he Atlas of
set. A dialect is a set member of that set. Just as a set does not North American English: Phonetics, Phonology and Sound Change.
exist in the absence of its set members, so a language does not Berlin: Mouton/de Gruyter.
exist in the absence of its dialects. A dialect is often identiied
as a variety of a language spoken by a group of people. hus, a
DIALOGISM AND HETEROGLOSSIA
dialect involves the shared linguistic behavior and knowledge
of a group, not of the idiosyncratic individual. his variety may Dialogism and heteroglossia, as well as polyphony, chronotope,
be understood, to varying degrees, by people who speak other and an array of related terms used in literary criticism, cultural
contemporary varieties of the same language. his criterion of studies, and postcolonial studies, are most commonly asso-
mutual intelligibility is problematic as diferent languages can ciated with the writings of Mikhail Bakhtin (18951975). he
also show degrees of mutual intelligibility. (Consider Swedish concepts connected with these terms serve to question vari-
versus Norwegian.) Consequently, to speak of a dialect presup- ous forms of epistemologism or ixing of putative knowledge in
poses a prior identiication of the language to which the variety a timeless, generalized, absolute form, a tendency common to
belongs. Within this language, each variety will difer from others many European philosophical systems of the nineteenth cen-
in terms of pronunciation, vocabulary, and/or grammar. More tury and before (Holquist 1990, 18). Detailed elaborations of
precisely, each variety may difer in its phonetic implementation dialogic terminology by the original authors and commen-
rules, phonology, lexical items or semantics, morphology, tators suggest a general distrust of any transformation of tem-
syntax, and the pragmatic functions of discourse markers or poral experience into objectivist abstraction through a reifying
syntactic structures. Although diferences of speech-act real- vocabulary. Even though dialogical theory has accumulated its
izations, narrative types and topics, or conversational routines own privileged vocabulary, its deeper project is to challenge the
can also exist, less research has explored these issues. monologic discourses of authority in favor of internally per-
Generally, diferences between varieties of the same language suasive discourses, which are anchored in the polyglossia of
may be quantitative or qualitative in nature. What is a quantita- social life (Bakhtin 1981, 3345). From this perspective, dialogue
tive diference? Consider words like the or that. Most speakers of is essential to the generation, accumulation, and dispersion of
English will sometimes pronounce the initial consonant as a fric- knowledge.
ative [] and sometimes as a stop [d]. his alternation between Given this emphasis on dialogue, it is somewhat ironic that the
[] and [d] can be quantiied. In turn, the frequencies with which disputed authorship of various writings of the Bakhtin Circle has
individuals say [] versus [d] may correlate to categories of iden- given rise to a great deal of debate over the years (Holquist 1990,
tity, such as age, gender, class, or ethnicity. Alternating forms, 21011). In their deinitive biography, Katerina Clark and Michael
like [] ~ [d], are called variants of a sociolinguistic variable. his Holquist take pains to establish that the volumes attributed to
variable is represented as (dh) in the literature. Sociolinguistic Valentin Voloshinov and P. N. Medvedev were really authored
variables like (dh) show quantitative diferences across social by Bakhtin. His involvement in reverse plagiarism, the biog-
classes (upper favors [] more than lower), genders (women raphers attest, is a consequence of Bakhtins love of conversa-
favor [] more than men), and age (working adults favor [] more tion, the give and take of good talk, which is the keystone of his
than young children). Such frequency diferences have also been dialogism (Clark and Holquist 1984, 14670). Looking toward
termed group-preferential diferences. applications of dialogical theory to digital media, classroom
What is a qualitative diference? Consider the Spanish la casa pedagogy, cognitive theory, and empirical studies, others main-
(house) and la caza (hunt). In Puerto Rico, they are both pro- tain that for searching beyond early dialogism, it is important
nounced [kasa]. In Spain, casa is [kasa] but caza is [kaa]. hus, to keep in mind that there were important diferences in ide-
in one dialect we ind a feature that is absent from another. his ology among the members of the Bakhtin Circle (Bostad et al.
is a qualitative diference. Some researchers identify this as a 2004, 7). Nevertheless, they all shared a conviction that socio-
group-exclusive diference. historical embeddedness of symbolic tools implies that signs
Qualitative and quantitative dialect diferences may map onto carry their previous use with them without having entirely ixed
groups deined by geographical regions or by social characteris- meanings (ibid., 11). his provides suicient internal coherence
tics. hus, researchers speak of regional or social dialects. his to dialogical theory.
difers from the common belief that dialects are only regional.
Because diferences among the genders or age groups occur, we Some Basic Concepts
may ask if varieties associated with gender or age are dialects as In the course of discussing the novel as a polyphonic genre,
well. In practice, researchers reserve social dialects for class and Bakhtin deines heteroglossia as the diversity of speech genres
ethnic varieties, even though gender and age diferences in a that are rooted in social life. he novel, he says, orchestrates all
254
Dialogism and Heteroglossia
its themes, the totality of the world of objects and ideas expressed by continual mainstreaming of its message, but also by contin-
in it by means of the diversity of speech genres (razonreie) and ual and only partially successful marginalization of its residual
by difering individual voices that lourish under such condi- polyphony and heteroglossia.
tions (1981, 263). His preference for the novel over the epic is
based on the notion that high genres distance literary form from What Dialogism Is Not
the diversity of speech genres by choosing an elevated style, a One might reasonably ask here if dialogism is just another way of
unitary language, at the expense of the common language referring to Hegelian (or Marxist) dialectic or Saussurean bina-
and its polyphonic utterances. his joining of diversity must rism. In fact, there are striking diferences, even contradictions,
be simultaneous, not segmented into diferent times. In other among these concepts. Dialogism may, in fact, be seen as quar-
words, it must be heteroglossic simultaneity. hat simultaneity is reling with the tropes of dialectic and binary opposition. On the
neither unitary nor directed at a telos. Its only goal is interactive other hand, these quarrels never resort to confronting systems
self-understanding. with their elaborated antitheses because of the complicities this
Interaction is produced by dialogue, which brings us to dia- entails (Pechey 1999, 326). In other words, Bakhtinian dialo-
logism. Foregrounding utterance as the most salient unit of gism refuses an agonistic identiication with the aggressor, as it
language and dialogue as the social and biological condition in resists any kind of maximal codiication of its own discourse and
which language lourishes, Bakhtin states: We are taking lan- terminology.
guage not as a system of abstract grammatical categories, but Consider dialectical thinking. Although it allows for a thesis
rather language conceived as ideologically saturated, language and an antithesis, dialectical thinking resolves diferences in a
as world view, even as concrete opinion, insuring a maximum synthesis. Dialogue, on the contrary, advocates a copresence of
of mutual understanding in all spheres of ideological life (1981, diferences. More importantly, in dialogue, diferences are never
271). Just as language is not merely a system of abstract gram- only two that can be conveniently resolved into one. Diferences
matical categories, but concrete opinion and world view, are many and dispersed because the voices in a society are plural
truth (pravda) is ongoing dialogue, never-ceasing talk. Since and dispersed.
talk brings into play both the cognitive structure of the brain and Similarly, even though dialogism deines itself against mono-
ones immersion in the social life of a language, consciousness is logism and, thus, sets up an initial binary opposition, dialogical
necessarily participative consciousness, and the primary unit theory does not sponsor binaryism, nor can it be confused with
of polyphony in the novel is the author-hero dialogue that pro- antibinaryism. In its most productive forms, dialogism is asym-
duces an open-ended uninalizable hero (Reed 1999, 117). metric dualism (Holquist 1990, 52). Relecting on the shift
Although the speciic subject for these formulations is from bipolar thinking to triadic thinking in dialogism, Sigmund
Dostoevsky, the implications apply to literature, philosophy, Ongstad draws attention to the super-addressee as the invisible
linguistics, ethics, and social life in general. Similarly, the car- third party in dialogue. In Bakhtin words, any dialogue takes
nivalesque, as explored by Bakhtin in Rabelais, includes dia- place against the background of the invisible thirds responsive
logically signiicant speech genres, such as curses, oaths, slang, understanding (quoted in Ongstad 2004, 78). he role of the
humour, popular tricks and jokes, scatological forms, in fact all super-addressee is, it must be noted, participatory, not telic or
the low and dirty sorts of folk humor (Peter Stallybrass and ontological.
Allon White, quoted in Emerson 1999, 247). For Bakthin, how-
ever, the structures of addressivity and answerability the ways Dialogism, Heteroglossia, and Answerability: An Example
in which all speech is oriented toward an addressee and calls from King Lear
for a dialogic answer have a larger constitutive function than A brief example from King Lear will clarify this point. In the
one might at irst infer from the phrase speech genres. Within the opening scene of the play, Lear asks his daughters how much
frames of heteroglossic creativity and constraint, Baktin suggests, they love him. After the irst two have spoken, Lear asks what
Heroes are genres, and trends and schools are second and third Cordelia can say to draw a third more opulent share of royal
rank protagonists (1990, 78). largesse. Cordelias cryptic answer, Nothing, clashes violently
To say that some discourse is or is not dialogical itself risks with the extravagant professions of her sisters. he addressee for
reifying that discourse in a monological way, however. hus, all their speeches is Lear, also the others who are present. he
Bakhtin stresses, we may dialogize a discourse. Indeed, this per- super-addressee is the invisible viewer/auditor of Shakespeares
spective can be used to enter into dialogue with Bakhtins rele- time and ours. When Lear asks Cordelia to mend her speech,
gation of the epic to the discourse of authority. A Bakhtinian lest she mar her fortunes (1.1.945), it is clear that the king
approach entails that epics are utterances grounded in temporal has already decided to punish by withholding patrimony (and
social realities that, at some point in time, attain a degree of eval- love), but the father worries. His caution expects ilial answer-
uative inalization and absorb a degree of valorized perception. ability (of speech and action). Lears later characterization of his
he Indian epic Rmyaa, for instance, with its regional ver- loved daughter as Dowered with our curse and strangered with
sions still available, along with their link to lively speech genres, our oath (1.1.205) does not merely show that he is hiding his
can be understood as an ongoing societal conversation about shame in anger; it attests to the fact that, though the relationship
ideas of order in family and state, about right rule, duties of wife has fallen apart, the frames of addressivity and answerability
to her husband, brother to brother, ruler to subject, and so forth. are intact. Consequently, the dramaturgy of the scene is marked
However, it should not be forgotten that the Rmnyaas long by a violent clash between the discourse of authority shown
life in cultural memory comes from its lasting value determined in Lears dividing of the kingdom and the mutually struggling
255
Dialogism and Heteroglossia Diffusion
internally persuasive discourses of the ambitious, unloving . 1990. Art and Answerability: Early Philosophical Essays. Ed. and
daughters and the not-ambitious, loving daughter. trans. Michael Holquist, Vadim Liapunov, and Kenneth Brostorm.
he heteroglossic vitality of this moment in the play is Austin: University of Texas Press.
enhanced by Lears own utterances as they move back and forth . 1993. Towards Philosophy of the Act. Ed. Michael Holquist and
Vadim Liapunov, trans. Vadim Liapunov. Austin: University of Texas
from the discourse of authority to an internally persuasive dis-
Press.
course. When Cordelia does not mend her speech enough and
Bostad, Finn, Craig Brandist, Lars Sigfried Evenson, and Hege
chooses, instead, to mar her fortunes, risking her hurtful answer Charlotte Faber, eds. 2004. Bakhtinian Perspectives on Language
Lear asks: But goes thy heart with this? he poignant question and Culture: Meaning in Language, Art and New Media. New
betrays an involuntary admission that the heart does not need York: Macmillan, Palgrave.
to go with the words one speaks, as much as it is indicative of his Brandist, Craig. 2002. he Bakhtin Circle: Philosophy, Culture and Politics.
growing sense that the logic of the discourse (of love) he initiated London: Pluto Press.
has been undermined. Having used up his kinglike and fatherlike . 2004. Law and genres of discourse: he Bakhtin Circle theory
powers, Lear resorts to accusation: So young and so untender? of language and the phenomenology of right. In Bostad et al. 2004,
uttered, once again, to the answerability of Cordelias self-de- 2345.
fense: So young, my lord, and true (1.1.1058) Clark, Katerina, and Michael Holquist. 1984. Mikhail Bakhtin.
London: Harvard University Press.
his kind of responsiveness to the other is the deining feature
Emerson, Caryl, ed. 1999. Critical Essays on Mikhail Bakhtin. New
of heteroglossic simultaneity, perhaps its central ethical norm.
York: G. K. Hall.
At the beginning of the scene, Lear is not responsive to Cordelia Holquist, Michael. 1990. Dialogism: Bakhtin and his World. 2d ed. New
as his other. he super-addressee, embodied later in Lears York: Routledge.
Fool, senses that Lear is like a monologic author, and his daugh- Ongstad, Sigmund. 2004. Bakhtins triadic epistemology and ideologies
ters are his heroines and villains. As an unsurprising parallel, of dialigism. In Bostad et al. 2004, 6588.
Bakhtin, too, thinks of the hero as a model for a person in society Pechey, Graham. 1999. Boundaries versus binaries: Bakhtin in/against
and insists that a hero is not a voiceless object but a legal per- history of ideas. In Emerson 1999, 32137.
son with rights. Deictically imagined as someone who acts, talks, Reed, Natalia. 1999. he philosophical roots of polyphony: A
and exists in a world of heteroglossic simultaneity, heroes are Dostoevskian reading. In Emerson 1999, 11752.
Shakespeare, William. 2001. King Lear. Ed. R. A. Foakes. London: homas
formally equal subjects of law that is immanent to the relations
Learning.
between persons themselves (Brandist 2004, 30; see deixis). In
this case, Cordelia, though not the hero of this play, is intent on
establishing herself as a person with a voice and a legal person DIFFUSION
with rights, just as her sisters are invested in gaining access to
he study of linguistic difusion is concerned with describing and
social power.
explaining how languages or language features spread over time
In stylistic terms, Lears utterances conlate formal lan-
and space. On a macrolevel, difusion refers to the dispersion of
guage of the court with speech genres of familial conversation.
languages from a common point of origin. hrough migration
He changes nouns to unusual verbs, as in strangered and
and subsequent isolation, thousands of languages have devel-
dowered; repeats the plural irst person pronoun, our, twice
oped from a highly limited set of protolanguages. he physical,
in the same line, calling attention to himself as speaker; substi-
demographic, and social constraints on language dispersion can-
tutes curse and oath for authoritative command; and echoes
not be reduced to a simple algorithm, and the reconstruction of
Cordelias Nothing at the moment of irst shock that a daughter,
protolanguages, language family relationships, and patterns
who is also his subject, would be so insubordinate (1.1.8790).
of spatial dispersion remain a primary challenge in historical
In the course of the play, Lears unitary consciousness has
linguistics.
to become a participative consciousness. Most importantly, Lear
On a language-speciic level, difusion is concerned with the
has to acquaint himself with the consciousness(es) of his daugh-
spread of particular linguistic innovations across the varieties of
ters as others. In this way, the play seems to point directly
a language or, in some cases, across languages. Linguists, partic-
toward Bakhtinian conclusions. he raison detre for heteroglos-
ularly sociolinguists, dialectologists, and historical linguists,
sic simultaneity as an aesthetic-ethical norm is to sustain the
seek to identify the mechanisms of transmission and the factors
chaos of human experience and the noise of language within a
that promote or inhibit the spread of language traits. A linguistic
polyphonic system, not to underwrite it through monological
change is initiated in a particular locale at a given point in time
glossing.
and spreads outward from that point in progressive stages so that
Lalita Pandit Hogan earlier changes reach the outlying areas later. he wave model
assumes that a change spreads in concentric layers, as waves
WORKS CITED AND SUGGESTIONS FOR FURTHER READING radiate outward from a central point of contact when a pebble is
Bakhtin M. M. 1981. he Dialogic Imagination: Four Essays. Ed. dropped into a pool of water. Forms that follow this straightfor-
Michael Holquist. Trans. Caryl Emerson and Michael Holquist. ward time and distance relation follow the pattern of contagious
Austin: University of Texas Press. difusion (Bailey et al. 1993).
. 1984. Problems of Dostoevskys Poetics. Ed. and trans. Caryl Because of physical, social, and psychological factors, a model
Emerson. Introduction by Wayne C Booth. Minneapolis: University of that considers only time and distance is too simplistic to account
Minnesota Press. for the spread of linguistic forms. Difusion researchers cite at
256
Diffusion Digital Media
least ive factors that inluence the dispersion of customs, ideas, groups, thereby difusing throughout the group and to other
and practices: 1) the phenomenon itself, 2) communication net- groups or communities within a population (Labov 2001).
works, 3) distance, 4) time, and 5) social structure. Although lin- One important study of language difusion in the southern
guistic structures are inherently quite diferent from phenomena United States (Bailey et al. 1993) shows that although many
such as technological innovations, they are subject to many of linguistic innovations follow the more common hierarchical
the same social and physical factors that inluence the nature of pattern of cascade difusion, some features may display the
difusion in general. opposite difusion pattern. For example, the use of the special
A gravity model or hierarchical model of language (Trudgill intentional modal ixin to, as in heyre ixin to go now, once
1974) often provides a better proile of the difusion of linguistic heavily concentrated in rural areas of the American South, has
forms than a simple wave model. In the gravity model, which is now been adopted in some larger, urban population centers.
borrowed from the physical sciences, difusion is a function not he explanation for this contrahierarchical difusion pattern is
only of the distance from one point to another, as with the wave tied to its symbolic marking of traditional southern American
model, but of the population density of areas that stand to be speech. In the face of a large inlux of outsiders into the region,
afected by a nearby change. he interplay between the popu- native urban residents may seek to assert their southern iden-
lation density of two areas and the distance that separates them tity by adopting selected structures strongly associated with the
parallels the efects of density and distance on gravitational pull. regional South, showing that the social meaning attached to
Changes are most likely to begin in heavily populated cities that linguistic forms has to be considered along with geographical,
serve as cultural hearths. From there they radiate outward, but demographic, and interactional factors in explaining linguistic
not in a simple wavelike pattern; rather, innovations irst reach difusion.
moderate-size cities that fall under the area of inluence of some
Walt Wolfram
large, focal city, leaving nearby, sparsely populated areas unaf-
fected. Gradually, innovations ilter down from more populous,
denser areas to less densely populated areas, afecting rural
areas last, even if such areas are quite close to the original focal Bailey, Guy, Tom Wikle, Jan Tillery, and Lori Sand. 1993. Some patterns
area of the change. he spread of change is thus like skipping a of linguistic difusion. Language Variation and Change 5: 35990.
stone across a pond, rather than dropping a stone into a pond, as Labov, William. 2001. Principles of Linguistic Change. Vol. 2. Social
Factors. Malden, MA, and Oxford: Blackwell.
in the wave model. he model of change following this pattern is
Rogers, Everett M. 1995. Difusion of Innovations. 5th ed. New York: Free
referred to as cascade difusion.
Press.
One of the noteworthy examples of cascade difusion is a Trudgill, Peter. 1974. Linguistic change and difusion: Description and
vowel shift currently taking place in the northern cities of the explanation in sociolinguistic dialect geography. Language in Society
United States. Part of this elaborate rotation involves the shift 3: 21546.
of the vowel of thought so that it sounds more like the vowel of
lot. he lot vowel, in turn, sounds more like the vowel of trap,
which moves closer to the pronunciation of the vowel of dress.
DIGITAL MEDIA
his vowel shift proceeds from larger cities in the North to suc- Digital media are those media whose means of production and
cessively smaller ones, leaving in-between rural areas relatively distribution are digitized via computers; the term is commonly
unafected until the later stages of the change. used in contrast to older forms of media, such as print (for text)
Gravity models of change include factors of distance and or analog devices (for sound and images). In language studies,
communication networks as a function of population density, digital media most commonly refer to the Internet, the World
but they do not recognize the role of other social and psycho- Wide Web, mobile telephony, and other networked and wireless
logical factors. For example, changes do not spread evenly across technologies that support human communication known as
all segments of the population. Members of upwardly mobile computer-mediated communication (CMC) and the transmis-
social classes usually adopt linguistic innovations more quickly sion of information. Digital media can also refer to digital storage
than do members of other classes, and women and younger devices for data, sound, video, and graphics. Here, we are con-
people are often leaders in certain kinds of language change. It cerned primarily with the former sense, especially the impact of
is therefore essential to track changes not only across geographi- digital communication technologies on peoples individual and
cal space and population density but also across diferent age, collective use of and relation to language.
ethnic, gender, and social status groups (cf. age groups and
gender and language). History
In terms of social networks, the irst people to adopt changes Communication via digital media can be traced to the inven-
are those with loose ties to many social groups but strong ties to tion of packet switching technology in the 1960s, which enabled
none due to the face that strong ties inhibit the spread of change. messages to be exchanged among networked computers. he
In order for the changes to make their way into more close-knit ARPANET, the predecessor of the Internet, was implemented as
groups, they need to be picked up by people who are central ig- a United States defense department project in 1969; by the mid-
ures in these groups but who are willing to adopt change none- 1970s, it had become popular for human communication via
theless, perhaps for reasons of prestige. Because these early e-mail and mailing lists. In 1979, the USENET was created as an
adopters are well regarded in their social groups, the changes alternative, grassroots network; USENET newsgroups, along with
they adopt are likely to be picked up by other members of these various BBS (bulletin board systems) and networks hosted on
257
Digital Media
private servers during the 1980s, were eventually integrated into and communication technologies that were previously consid-
the Internet, the term used after 1983 for the collection of net- ered distinct. he incorporation of text chat into multiplayer
works that had grown around the ARPANET. By the late 1980s, online games and the ability to send text messages from mobile
the Internet ofered public real-time chat via Internet Relay Chat phones to interactive television (iTV) programs illustrate the lat-
and MUDs (Multi-User Dimensions), along with email, mailing ter trend.
lists, and newsgroups. Around the same time, Internet service heoretical debate has also centered around the efects of
providers (ISPs) were starting to make the Internet accessible digital technology on human communication. A strong techno-
to people in their homes, rather than just from businesses and logical determinism position holds that production and recep-
universities. tion constraints on CMC inevitably shape digitally mediated
he introduction of the World Wide Web in 1991 and the irst language and language use. Such a position inds support in
graphical browser in 1993 transformed the Internet by enabling research indings that technical constraints on message exchange
networked multimedia. By the mid-1990s, Internet telephony disrupt and reshape turn-taking patterns across a range of digital
and videoconferencing were available, along with graphical vir- genres (Herring 1999). A weaker version of technological deter-
tual worlds. Despite the increasing availability of bandwidth to minism holds that features of speciic technologies predispose
support multimedia, however, text retained its popularity. he users to communicate in certain ways, but that users may over-
late 1990s saw the emergence of several text-based applica- ride those predispositions. For example, the synchronicity of
tions: instant messaging, weblogs (blogs), and text messaging on CMC systems tends to afect message length, complexity, and
mobile phones (especially in Europe and Asia). formality (with messages in asynchronous modes being gener-
A more recent trend has been toward mobile media and lex- ally longer, more syntactically complex, and more formal than
ible access. Starting with external hard drives for external data in synchronous modes), although both formal and informal lan-
storage and continuing with laptops, personal digital assistants guage can be found, for example, in email (asynchronous) and
(PDAs), iPods, and smartphones, digital media have moved chat (synchronous), depending on the topic and purpose of the
away from desktop computing toward more distributed, light- communication.
weight, faster devices. he social construction of technology theory goes further to
assert that users shape technologies through their use as much or
Language-Related Issues more than their use is shaped by those technologies (Bijker and
he rapid rise in popularity of digitally mediated communication Law 1992). his view receives support from computer-mediated
over the past two decades has attracted considerable interest cooperative work and online education, where the nature of
from language scholars. he central debates have focused on the tasks structures communication in often predictable ways.
how to classify such communication relative to speech and writ- Further, many face-to-face social and interactional dynamics,
ing, the efects of technology on language and language use, the including gendered patterns of communication, are reproduced
purported anonymity of text-based CMC and its social and lin- in digital discourse, albeit diferently in academic discussion
guistic consequences, and the long-term efects of digital media forums than in chat. In an efort to account for such variation,
on individual languages and the global language ecology. a fourth position holds that there is no single way in which tech-
Computer-mediated communication is sometimes claimed nology inluences mediated language; rather, it depends on the
to constitute a third modality of language, alongside speech and particular constellation of technical and social variables that
writing. Text-based CMC, by far the most common manifestation characterizes a given sample of mediated discourse (Herring
of digital communication, blends the production and reception 2007). A desideratum for future research is a coherent theory
features of writing (typing on a keyboard or otherwise entering that can predict when speciic types of media will have particular
characters into an alphanumeric interface; reading messages on communicative efects.
a screen) with the structural and interactional features of spoken Another nexus of debate concerns the purported anonym-
conversation (e.g., informality, phatic content, relatively rapid ity of digitally mediated communication. Because social cues
exchange of messages), making it a hybrid modality with distinc- conveyed through prosody, facial expression, and physical
tive characteristics (Crystal [2001] 2006). Moreover, the personal appearance of message senders are iltered out in text-based
accessibility and wide public reach of the Internet have led some CMC, many early scholars believed that digitally mediated com-
to characterize it as fundamentally transformative of human munication was depersonalized and that users identities were
communication, a revolution as profound as that triggered by masked or irrelevant. his was thought to give rise to laming or
the printing press. hostile language (and antisocial behavior, in general); play with
At the same time, the novelty of digital language should not identity and liberatory (or inauthentic, depending on ones per-
be overstated. It is often possible to trace the roots of so-called spective) online self-presentations; and compensatory linguistic
emergent or digitally native CMC genres (Crowston and Williams strategies, such as creative spellings and emoticons (faces made
2001) to older written and oral genres. An example is the blog, out of ascii characters), in order to enhance ones social pres-
which, while arguably a historically unprecedented hybrid of ence and signal ones intentions. hese linguistic strategies have
personal, interpersonal, and mass communication, manifests been referred to as textspeak by David Crystal ([2001] 2006; for
continuities with handwritten diaries, phone calls to friends and examples, see Figure 1).
family, project logs, and letters to the editor. Ultimately, what may Alternative perspectives have also been advanced on these
be most unique about digital media is their tendency to support phenomena, however. True anonymity is infrequent, since most
a convergence of language features, genres of communication, people who communicate digitally use consistent identiiers, and
258
Digital Media
HK English: Hee hee . . . dunno why I always like to send u mails ar! Part is
becoz I wanna keep contact with u la!
French: Ca sera donc tjs 1 plaisir 2te revoir! :-) [So it will always be a pleasure to
Figure 1. Examples of textspeak (bolded) in
meet you again :-)] Hong Kong English, French, Romanized Arabic,
Arabic: w 3laikom essalaaam asoomah ^_^ [Hi there, Asoomah ^_^] and Japanese (from Danet and Herring 2007).
Japanese: (*^ ^*) [Congratulations on your comeback
(as if singing) That was good (*^ ^*)]
in the case of private communication (e.g., via e-mail, instant new morphological formatives such as e- and cyber- into the
messaging, or short message service [SMS]), the communica- English language; however, there is less evidence that digital
tors usually already know one another. Flaming may be better media are associated with syntactic changes, which typically take
explained by the lack of accountability characteristic of public place more slowly. he fears of some educators and journalists
Internet forums than by anonymity per se, given that many hos- that digital communication is accelerating language decline and
tile messages are sent by people with known identities. Play with interfering with childrens learning of standard written language
identity, while fashionable in some chat environments, occurs appear to have no basis in empirical fact (hurlow 2006).
less often in practice than was implied by early theorists, in part Digital media also have global implications for cross-cultural
due to the diiculty of maintaining a false identity over time. communication, multilingualism and language choice, and the
Recent years have also seen an increasing tendency for people to status of individual languages. Although still a small percentage
post photographs of themselves, for example, on social network- of the worlds languages, those used on the Internet are grow-
ing sites although false and digitally modiied photos can, of ing in number. Figure 1 gives examples of textspeak in four
course, be posted. Finally, textspeak is also shaped by the impe- languages.
tus to type quickly, especially in real-time message exchanges, here is debate, however, as to whether linguistic diversity
resulting in creative, often abbreviated, spellings. Nonetheless, equal to that in the oline world will eventually be achieved,
it remains the case that digital media aford new and increased or whether digital media are promoting and accelerating the
opportunities for selectively crafting ones self-presentation, dominance of English and other large languages. Evidence from
both linguistically and visually, and for deceptive communica- multilingual contact situations, such as cross-national Internet
tion to take place. discussion forums, suggests that English or the regional language
he scope and spread of digitally mediated communication, (e.g., Spanish, German, Russian) tends to be used as a lingua
both globally and over time, give rise to other language-related franca in order to ensure the widest comprehension; this trend
issues. Digital media enable unprecedented large-scale con- bodes ill for the use of minority languages in such forums. At
versations (e.g., in public discussion forums) and provide vast, the same time, many Internet forums have national rather than
potentially interactive audiences (e.g., for websites and blogs) in international audiences, and localization eforts are produc-
which many participants are unknown to one another and paring hardware and software in local languages. Some speculate
ticipation is open to a wide spectrum of society. Conversations that these trends are leading toward a global diglossia, with
involving hundreds (or thousands) of people raise new chal- English as the High (international) variety and local languages
lenges for maintaining interactional coherence, and unknown as the low, or colloquial, variety. he Internet has also been used
audiences constitute new kinds of addressees when the broad- with some success as a tool to support revitalization eforts for
cast content is personal, as is the case for many blogs. As ordi- endangered languages (Danet and Herring 2007).
nary language users come to grips with these challenges, new
media-speciic norms are emerging, much as people a century Current State of Research
ago evolved new interactional and pragmatic norms for speaking From the outset, scholarship on digital media was broadly inter-
over the telephone. disciplinary. In the irst two decades of CMC research, scholars
he Internet enables new kinds of social formations to arise trained in communication, rhetoric, social psychology, manage-
known as virtual communities which often develop character- ment, linguistics, humancomputer interaction, anthropology,
istic communicative practices; these, in turn, may spread. New and education came together in interdisciplinary fora to try to
lexical items, as well as textspeak features, have difused rap- meet the challenge of characterizing online communication,
idly across the Internet and have become integrated to varying and in recent years, new interdisciplinary ields have arisen in
degrees into everyday speech and writing, especially those of which digital media play a central role, such as new media studies
young people, giving rise to the claim that digital media are accel- and social informatics. At the same time, there is a trend toward
erating processes of language change. his includes introducing increasing disciplinary specialization, as new media become
259
Diglossia
accepted into mainstream disciplinary approaches. In language to be more standardized than the L variety: Grammars and
studies, new media currently provide application domains (e.g., dictionaries are written for the H variety, but not usually for the
for language learning) and sources of data for empirical analysis L variety. Fifth, while the L variety is the language of the home,
and, increasingly, for theorizing about language from cognitive, the H variety is not spoken natively by anyone in the commu-
social, and evolutionary perspectives. nity and has to be learned through schooling. Finally, although
the L variety may gradually replace the H variety due to such
Susan C. Herring
factors as more widespread literacy and broader communica-
WORKS CITED AND SUGGESTIONS FOR FURTHER READING tion among diferent social groups, a diglossic situation usually
persists for centuries or even millennia.
Bijker, Wiebe, and John Law, eds. 1992. Shaping Technology/Building
Diglossic situations are diferent from other commonly
Society: Studies in Sociotechnical Change. Cambridge, MA: MIT Press.
found language situations in several respects. In contrast to
Crowston, Kevin, and Marie Williams. 2001. Reproduced and emergent
genres of communication on the World-Wide Web. he Information
diglossic situations, many bilingual situations do not main-
Society 16.3: 20116. tain a clear functional compartmentalization of the two varieties
Crystal, David. [2001] 2006. Language and the Internet. (see bilingualism ). In Arabic-speaking countries, colloquial
Cambridge: Cambridge University Press. Arabic serves as the basic medium of interaction, but modern
Danet, Brenda, and Susan C. Herring, eds. 2007. he Multilingual standard Arabic is the preferred variety for formal purposes.
Internet: Language, Culture, and Communication Online. New However, in a bilingual community such as Flemish- and
York: Oxford University Press. French-speaking Belgium, both varieties are used to perform
Herring, Susan C. 1999. Interactional coherence in CMC. Journal of similar functions in formal and informal domains. Diglossic
Computer-Mediated Communication 4.4. Available online at: http:
situations are also diferent from standard-with-dialects situ-
//jcmc.indiana.edu/vol4/issue4/herring.html.
ations. In the German-speaking regions of Switzerland, the H
. 2004. Slouching toward the ordinary: Current trends in computer-
variety (Hochdeutsch) is learned through formal schooling and
mediated communication. New Media & Society 6.1: 2636.
. 2007. A faceted classiication scheme for computer-mediated is not used as the medium of everyday interaction. On the other
discourse. Language@Internet. Available online at: http://www. hand, in Italy (a standard-with-dialects situation), many people
languageatinternet.de/articles/761. speak standard Italian natively and use it in formal as well as
hurlow, Crispin. 2006. From statistical panic to moral panic: he informal settings.
metadiscursive construction and popular exaggeration of new Over the years, numerous scholars have reworked Fergusons
media language in the print media. Journal of Computer-Mediated deinition of diglossia. While maintaining the criterion of strict
Communication 11.3: article 1. Available online at: http://jcmc.indi- functional compartmentalization, Joshua Fishman (1967)
ana.edu/vol11/issue3/thurlow.html. broadened the deinition of diglossia to include genetically unre-
lated varieties. According to this broad deinition, Spanish- and
DIGLOSSIA Guaran-speaking Paraguay would be classiied as a diglossic
community, in that the two genetically unrelated varieties func-
In his seminal article, Charles Ferguson (1959, 435) deined
tion like H and L varieties in diglossic situations. However, some
diglossia as
have criticized this deinition as diluting the original meanings of
a relatively stable language situation in which, in addition to the diglossia. Although the Spanish-Guaran situation resembles the
primary dialects of the language (which may include a standard diglossic situation in the Arabic-speaking world, the two difer in
or regional standards), there is a very divergent, highly codiied their social origin and course of development. While the former
(often grammatically more complex) superposed variety, the came into being through the conluence of two sociolinguistic
vehicle of a large and respected body of written literature, either traditions as a result of colonial contact, the latter was derived
of an earlier period or in another speech community, which is from the internal functional diferentiation within a single
learned largely by formal education and is used for most written sociolinguistic tradition (see colonialism and language).
and formal spoken purposes but is not used by any section of the Furthermore, when language shift occurs in a bilingual com-
community for ordinary conversation. munity, it is usually the H variety that replaces the L variety.
Using the examples of Greek, Arabic, Haitian Creole, and Swiss In contrast, in the terminal stages of Fergusonian diglossia,
German, Ferguson discussed several characteristics common the L variety often displaces the H variety. More recently, Alan
across diglossic situations. First of all, there is a strict division Hudson (2002) argued that the absence of native H speakers
of labor between the two varieties: he superposed variety or distinguishes diglossia from bilingual situations like the one in
the H(igh) variety is used mostly in prestigious domains (e.g., Paraguay. his characteristic, Hudson maintains, enhances the
education; see prestige ), and the vernacular or the L(ow) stability of diglossia. Without a prestigious community of native
variety is restricted to informal domains (e.g., neighborhood). H speakers, L speakers lack the motivation to adopt H for every-
Second, although the two varieties are genetically related, the day communication.
H variety is structurally more complex than the L variety (e.g., Another point of contention is the discreteness of H and L. In
the H variety has more overt case markers than the L vari- many diglossic communities, there exists a continuum of forms
ety). hird, the H variety is more highly valued than the L vari- between the H and L varieties. In addition, speakers sometimes
ety: While there is a sizable body of literature written in the H mix H and L in the same functional domain and even in the same
variety, the L variety is rarely used in the written form except utterance. In the Arabic-speaking world, speakers sometimes
in dialect poetry and advertising. Fourth, the H variety tends engage in diglossic switching (Walters 2003). In this case, one
260
Discourse Analysis (Foucaultian)
variety (i.e., the matrix) provides the frame for an utterance, or discourses, with which the second and third deinitions are
while the other supplies lexical items that are inserted into the concerned. he second deinition that he gives an individu-
frame. In formal interviews, speakers may use modern standard alizable group of statements is one that is used by Foucault
Arabic as the matrix but draw on lexical items from the L vari- when he is discussing particular structures within discourse;
ety. In other cases, the L variety serves as the matrix. his may thus, he is concerned to be able to identify discourses, that is,
occur when Arabs who speak diferent Arabic varieties interact groups of utterances that seem to be regulated in some way and
with one another. hey use mostly their own varieties but with to have a coherence and a force to them in common. Within
lexical items from Modern Standard Arabic and other spoken this deinition, therefore, it would be possible to talk about a dis-
Arabic varieties. A closer look at diglossic switching is warranted course of femininity, a discourse of imperialism, and so on. His
because it may yield important insights into the nature of code third deinition of discourse is perhaps the one that has most res-
mixture in diglossic communities. onance for many theorists: a regulated practice which accounts
In his inal major statement on the subject, Ferguson (1991) for a number of statements. I take this to mean that he is inter-
lamented the fact that studies in the last few decades have focused ested less in the actual utterances/texts that are produced than
mostly on individual cases and examined whether or not they in the rules and structures that produce particular utterances
are instances of diglossia. He called for more cross-community and texts. It is this rule-governed nature of discourse that is of
studies that investigate the origins and developments of diferent primary importance.
diglossic situations, as well as research that examines diglossic Within most discourse theorists work, these deinitions are
situations during rapid social change. used sometimes almost interchangeably. One of the most pro-
ductive ways of thinking about discourse is not as a group of
Andrew Wong
signs or a stretch of text but as practices that systematically form
the objects of which they speak (Foucault [1969] 1972, 49). In
this sense, a discourse is something that produces something
Ferguson, Charles. 1959. Diglossia. Word 15: 32540. else (an utterance, a concept, an efect), rather than something
. 1991. Diglossia revisited. Southwest Journal of Linguistics that exists in and of itself and can be analyzed in isolation. A dis-
10.1: 21434. course is generally something that is airmed by an institution
Fishman, Joshua. 1967. Bilingualism with and without diglossia; diglossia
and, therefore, constitutes an intervention in power relations. A
with and without bilingualism. Journal of Social Issues 23.2: 2938.
discursive structure can be detected because of the systematicity
Hudson, Alan 2002. Outline of a theory of diglossia. International
Journal of the Sociology of Language 157: 148.
of the ideas, opinions, concepts, ways of thinking, and behaving
Walters, Keith 2003. Fergis prescience: he changing nature of diglos- that are formed within a particular context, and because of the
sia in Tunisia. International Journal of the Sociology of Language efects of those ways of thinking and behaving.
163: 77109. he theorists who have drawn on Foucaults work on dis-
course most extensively to develop a form of discourse analysis
have been critical discourse analysts. hey have tried to
DISCOURSE ANALYSIS (FOUCAULTIAN)
develop a form of linguistic analysis of texts that is openly politi-
he type of analysis of discourse developed by Michel Foucault cal and, therefore, draws on a more social model of discourse
has a complex history, and the term discourse itself is used in a than conventional linguistics generally does (Fairclough 1992;
range of ways by diferent theorists. Even Foucault himself draws hornborrow 2002; Wodak 1998; see Mills 2004 for a fuller dis-
attention to the diiculty of ixing on a particular type of analy- cussion). Very often, critical discourse analysts examine texts
sis that follows from his deinition of the term. As he comments, and utterances that seem to display extreme power diferentia-
Instead of gradually reducing the rather luctuating meaning of tion (see inequality, linguistic and communicative),
the word discourse, I believe I have in fact added to its mean- and they draw attention to some of the more troubling aspects
ings: treating it sometimes as the general domain of all state- of these texts in order to bring about change at a discoursal level
ments, sometimes as an individualizable group of statements, but also, more importantly, at a material level. For them, as for
and sometimes as a regulated practice that accounts for a num- Foucault, discourse is crucial for constructing a social identity
ber of statements ([1969] 1972, 80). and for resisting or airming the social roles that others construct
his quotation is crucial for understanding the range of mean- for us. By becoming aware of the systemic nature of some of the
ings that the term has accrued to itself within Foucaults work ways in which institutions position individuals through discourse,
and within that of other discourse theorists and, consequently, it is possible to challenge them and construct alternative modes
crucial to the type of analysis of discourse that they under- of representation. hese theorists often fuse linguistic analysis,
take. he irst deinition that Foucault gives is the most general such as systemic linguistics or conversation analysis, with
one: the general domain of all statements; that is, all utterances a more Foucaultian analysis of discourse. (Other theorists, such
or texts that have meaning and which have some efects in the as D. Smith 1990, use Foucaults work on discourse in a more
real world count as discourse. his is a broad deinition and is thoroughly social or cultural analysis, without focusing on lan-
generally used by Foucault in this way, particularly in his earlier guage as such they would be considered discourse theorists,
more structuralist work, such as Archaeology of Knowledge rather than discourse analysts). However, some theorists, such
([1969] 1972), when he is discussing the concept of discourse as J. Blommaert (2004), are critical of the use of Foucaults work
at a theoretical level. It may be useful to consider this usage within a broadly linguistic analysis, as for him, this constitutes a
to be more about discourse in general than about a discourse distortion of Foucaults overall project.
261
Discourse Analysis (Foucaultian) Discourse Analysis (Linguistic)
N. Fairclough (1992) draws on Foucaults conception of dis- the strategies used by those who are less powerful but who use lan-
course in order to develop a very systematic type of analysis of guage strategically to achieve what she terms local status, that is,
text. He provides working models and forms of practice from a form of interactional power achieved at a local level.
Foucaults theoretical interventions, together with a description hornborrow challenges a great deal of the work by critical dis-
of the efects of discursive structures on individuals. For him, course analysts who focus solely on the way that language is used
critical discourse analysis is not only concerned to describe dis- to oppress others, the standard example being the way that doc-
cursive structures but also shows how discourse is shaped by tors speak to patients (asking more questions, providing informa-
relations of power and ideologies, and the constructive efects tion, deciding the topic of the interaction, interrupting, and so on).
discourse has upon social identities, social relations and systems Instead, she examines interactions such as that of a woman who
of knowledge and belief, neither of which is normally apparent is being interviewed by police oicers in relation to a rape alle-
to discourse participants (Fairclough 1992, 12). Furthermore, gation that she had made. She focuses on the way that the police
Fairclough uses Foucaults conception of discourse because of oicers try to take control of the interaction by drawing on pow-
the stress that Foucault lays on the constitutive nature of dis- erful language resources, such as interruption. he woman inter-
course the fact that discourse structures the way that we per- viewee, however, does not simply submit to their interruptions but
ceive objects and reality. For Fairclough, critical discourse instead tries to structure the interaction from her own perspective
analysts can unpick commonsense knowledge and views of the and to meet her own needs. hrough the womans interventions in
world that present themselves as self-evident and natural, as all the interview, it is possible to see that she is not simply a victim of
of these types of knowledge will inevitably be profoundly ideo- oppressive linguistic strategies but that she employs a range of dis-
logical. By foregrounding the constructed and ideological nature cursive tools, such as persistently asking questions, to assert her
of this knowledge, it will be possible to suggest ways of seeing right to have her point of view considered.
that are more productive and egalitarian. Other discourse analysts, such as C. Walsh (2001), have
he inluence of Foucault can be seen in the emphasis that focused on the discursive structures that act upon women who
these theorists accord to the workings of power. Generally, enter the public sphere and that categorize the interventions
within critical discourse analysis, there is an emphasis on what of women as feminine or as trivial. She examines the way that
Foucault would term repressive power, that is, a view of power newspapers report on women in positions of power and the fact
relations that stresses the way that individuals are prevented that they often focus on their appearance, their sexuality, and the
from doing what they wish because of other individuals or insti- way they dress, rather than on the work that they do. She also
tutions. However, Foucault stresses that power is not simply examines the way that women are often represented as if they
the imposition of someones will upon another but, rather, that were in the private sphere instead of in the public sphere. She
power should be seen as a network of power relations among focuses on the systematic nature of this type of representation so
all members of a social group. Discourse is a key element in the that it can be seen to be a general trait, rather than a tendency in
working out of power relations since discourse not only marks certain newspapers (see also gender and language).
perceptions of power diference (one displays ones self or posi- he critical and analytical perspective of theorists and ana-
tion within a hierarchy through ones discursive choices) but also lysts such as Fairclough, Blommaert, C. Goodwin, hornborrow,
airms and contests those perceptions or power diferences. and Walsh is a signiicant reinterpretation of Foucaults work
In that sense, individuals engage in power relations even in the through the matrix of linguistics concern for veriiable, repli-
most mundane interactions. For example, within everyday conver- cable analyses.
sation, critical discourse analysts would draw attention to the way
Sara Mills
that only certain people consider it their role to sum up an inter-
action or to comment on the point of an interaction, which is a
very powerful position to construct for oneself through discourse.
J. hornborrows (2002) work draws on Foucaults notions of dis- Blommaert, J. 2004. Discourse. Cambridge: Cambridge University Press.
course and productive power, together with an analytical frame- Fairclough, N. 1992. Discourse and Social Change. London: Polity.
work from conversation analysis, to develop a form of discourse Foucault, M. [1969] 1972. Archaeology of Knowledge. Trans. A. M.
Sheridan Smith. New York: Pantheon.
analysis that focuses on the way power relations are efected within
Goodwin, C. 1994. Professional vision. American Anthropologist
institutions (see also Goodwin 1994). Analysis of discourse, here,
96: 60633.
focuses on the language resources available and the way that Mills, S. 2004. Discourse. 2d ed. London: Routledge.
they are used by those who have institutional power. Rather than Smith, D. 1990 Texts, Facts, and Femininity. London: Routledge.
assuming that certain elements of language are powerful in them- hornborrow, J. 2002. Power Talk: Language and Institutional Discourse.
selves, hornborrow considers that there are certain language Harlow, UK: Longman.
styles and procedures likely to be used by those who are in posi- Walsh, C. 2001. Gender and Discourse: Language and Power in Politics,
tions of power. Some of these styles, such as the use of indirectness the Church and Organisations. Harlow, UK: Longman.
or politeness, which appear to be relatively neutral styles, will Wodak, R. 1998. Disorders of Discourse. Harlow, UK: Longman.
be understood by others within a framework of power relations.
For example, if a manager makes an indirect request to an oice
DISCOURSE ANALYSIS (LINGUISTIC)
worker, that request will be understood as a command rather than
as a simple request. However, in addition to analyzing the strate- Although discourse analysis is variously deined (see examples
gies of those in positions of power, hornborrow also focuses on in the Introductions to Jaworski and Coupland [1999]; Schifrin,
262
Discourse Analysis (Linguistic)
Tannen and Hamilton [2001]; Johnstone [2006]), a generally of information within the text. In addition to function, parts of
accepted linguistic deinition of discourse itself is language above narratives (e.g., abstract, orientation, complicating action, coda)
and beyond the sentence. An advantage of this deinition is that were identiiable by linguistic (syntactic, semantic) properties.
it allows several diferent entry points into linguistic analyses of Likewise, syntactic modiications of a basic X did Y event struc-
discourse. Some discourse analysts, for example, focus on the ture on a clause-by-clause basis convey the subjective meanings
ways in which smaller language units (e.g. noun phrases, clauses, of the narrative the point of the story.
sentences) combine to create a coherent text that makes sense In contrast to the more formal approaches of Harris and Labov,
to others. Other discourse analysts focus on features that help Michael Halliday and R. Hasan (1976) focused on how language
to co-constitute the text. In other words, just as the structures, reveals cohesive connections within a text (see text linguis-
meanings, and functions of a text are continuously projected by tics) so that the reader (or listener) can understand not just the
the combinatory patterns of smaller units, so too are they a result meaning of each sentence but the meanings being conveyed
of those combinatory patterns of smaller units. Such features throughout the entire text. he following text, part of a recipe, is
might include the topic structure of the text or the various rela- annotated with subscripts for cohesive devices: Reference1, rep-
tionships across sentences (such as repetition, lexical colloca- etition2, substitution3, ellipsis4, conjunctions5, and lexical rela-
tions, or conjunctions) that help create cohesion among smaller tions6 provide cohesive ties.
parts. Still other discourse analysts focus on how sequences of
Apple1a pudding1b. First5 you1c peel6 and chop6 the fruit6 (with 1a).
language units (be they clauses or turns at talk) contribute to
hen5 __ 4 sprinkle it3 =1a with sugar and toss with the raisins6. __4
social meanings and functions. Interest in the functions of lan-
Bake the mixture3 (for 1a / 1b) for one hour. You1c + 2 may serve the
guage in social contexts leads to a range of other issues, for exam-
pudding3 (for 1a and 1b) with vanilla ice cream1.
ple, how repeated use of a particular noun, or distribution of
speech, can reproduce power or initiate resistance in social and Whereas the recipe is relatively dense in cohesion (roughly half
political spheres (see, e.g., entries on inequality, linguistic of the words in the excerpt are linked in cohesive ties), other
and communicative; politics of lanugage; gender types of texts may be less dense.
and language). Discourse analyses thus address features of In sum, key works in early discourse analysis focused, rela-
language within text, context, qualities of texts, and how lan- tively separately, on patterns of sentence-internal forms, clause
guage in texts is related to contexts. sequences, and cross-sentence meanings those that arise
After a brief history of key works in early discourse analysis, across sentences. Recent approaches continue the search for
I show how current approaches address phenomena and pro- various types of patterns, but add an interest in how those pat-
cesses of discourse and close with some general principles. terns emerge in texts, often as a dialogic process (see dialogism
and heteroglossia), in relation to context.
Early Approaches to Discourse
Although many linguists in the mid-1900s stopped their analyses Contemporary Approaches to Discourse
at the level of sounds, words, and then sentences, some moved his section illustrates diferent approaches by means of a brief
toward the next level of discourse by examining morphological analysis of two examples: Both are the opening phase of a lon-
patterns across sentences in written texts, the structure of rela- ger discourse, the irst, a classroom, the second, an oral history
tionships across clauses within spoken narrative, and aspects of interview.
language that display connections across sentences.
Zellig Harris (1952) derived procedures for analyzing OPENING A CLASSROOM LESSON. he discourse analyzed in (1)
arrangements of morphemes across sentences by building on is from a ifth grade class during parent visitation day. he class-
the tools of descriptive linguistics. Consistent with models of the room was overlowing with parents squeezed into the crowded
time (predating Chomskys turn to the sentence), Harris took a room. After greeting the parents and students, Mr. Clark (the
bottom-up approach that viewed discourse as the next level in a teacher) proceeded to what we see in the following:
hierarchy of morphemes. His procedure examined morphemes
(1) Mr. Clark: (a) Okay, lets get started.
in terms of their co-occurrence with (or distribution in relation
(b) First well review the problems from last night.
to) other morphemes (or sets of morphemes). Included were not
only actual sequences of equivalent morphemes but also chains In applying the approaches to the language of this short dis-
of morphological equivalencies that were seen as representa- course, we focus on some aspects of text (the information in,
tive of diferent genres or registers. Yet nothing but linguistic structure of, and relationships between successive clauses) and
structure within a given text was included: [T]he analysis of the context (social identities, relationships, and institutions). We
occurrence of elements in the text is applied only in respect to progress from a focus on knowledge (how to communicate infor-
that text alone and not in respect to anything else in the lan- mation and take social action) to the use of the linguistic code
guage (Harris 1952, 1). to convey a variety of situated meanings by people in particu-
William Labov and J. Waletzky (1967) developed a formal lar roles that are embedded within (and sustain) larger cultural
model of oral narrative (see narratives of personal experi- practices and social structures.
ence) that was based on temporal relationships among clauses pragmatics analyzes how we communicate more than
that had diferent functions in the verbalization of experience. the semantic content of language by depending upon our abil-
Later work (Labov 1972) focused not just on formal relation- ity to draw inferences based on such general principles as the
ships among clauses but also on the distribution and function cooperative principle (appropriate quantity, quality,
263
manner, and relevance of information), as well as semantic and continuation, then, Mr. Clarks statement could have been fol-
logical meaning, adjacent features of text, and the social con- lowed by others actions. Indeed, Lets get started and the utter-
text. For example, we can infer to whom Mr. Clark is speak- ance that follows are parts of a particular pair of sequentially
ing and therefore who will get started and review even related actions: hey sequentially implicate another action, the
though he does not explicitly designate their identity. First per- students response to the summons (see adjacency pair).
son plural pronouns include the speaker and someone else, but Mr. Clarks ability to maintain his turn and develop the
the other can be inclusive or exclusive of the hearer. Once we sequence of classroom activities is one way that his role in the
know about the context and shared schematic knowledge of who situation is established and reinforced. Interactional sociolin-
typically does what, when, and where, we can infer that the irst guistics reveals how numerous features of language provide
we includes the students and their parents, but the second we clues to (or indices of) the social situation, activities, participant
includes only the students. identities, and relationships that may actually have a role in cre-
Another aspect of communication that goes beyond what is ating the context of interaction. he use of lets, for example, sug-
literally meant and said actions performed through speech is gests that the speaker has authority over the hearer, thus evoking
the focus of speech-act analysis and theory. Actions can be a situation in which participants have an asymmetrical power
accomplished through language (e.g. requests, promises, warn- relationship (e.g., doctor/patient, parent/child). We can narrow
ings, assertions, thanks) only when speciic conditions (involv- down the nature of the authority and the situation by noting that
ing linguistic knowledge, assumptions about speakers/hearers the activity being started by Mr. Clark (review), and the object of
needs and wants, and background situations) are appropriate to the review (problems), indicates a learning environment or one
the realization of that speciic action. Okay, lets get started, for in which an expert is instructing a novice. hat the problems
example, is a directive, a general class of actions (including com- were from last night reveals a cyclical pattern, a structured rou-
mands, e.g., Begin! and hints, e.g., Its getting late) through which tine often found in formal institutions.
a speaker directs hearer(s) to take a future (not a past) action that Language choices inferences about meaning, actions, roles,
is something the speaker wants, is not likely to be done by the relationships, and participation are all embedded in broader
hearer otherwise, but is within the hearers ability. cultural matrices of recurrent practices, knowledge, and mean-
Linguistic alternatives also appear in how we pronounce, ings, which include beliefs about who should do what and how
select, or arrange our words. Alternatives that maintain semantic they should do so, as well as the evaluations based on larger
meaning are studied by variation analysis. Instead of saying lets values and ideologies (see ideology and language) of
(a) and well, for example, Mr. Clark could use the full forms let us particular outcomes of what is said and done. Ethnography of
and we will. Although these two variants have the same semantic communication elucidates these connections. For example,
meaning, they have diferent possible social meanings. Because the collaborative review of the problems portrays a cultural
full forms provide more explicit information, and allow stress on belief system in which learning and attaining information arises
us and will, they can emphasize the need for particular (possibly when novices work on their own (problems from last night)
reluctant) people to take an undesired action. Lexical variants and then, at a given point in time, present and review their solu-
would have diferent efects. Repetition of go in Lets get going tions with an expert.
and Well go over the problems would create a cohesive tie high- Just as language, inferences, actions, roles, relationships,
lighting the continuity of the actions and grouping them together. and participation are all embedded in culture, so too are they
Another alternative concerns how to organize information intertwined with social processes and structures that sustain (or
and arrange sentences: What should be irst? What are the con- restrict) power and privilege. critical discourse analysis
sequences of diferent orders? Narrative analysis focuses on the (see also discourse analysis [foucaultian]) explores how
organization of information across sentences by analyzing dif- ways of speaking can put those processes into place and reinforce
ferent ways of verbalizing past experiences in textual units that (or challenge) received means of authority. Mr. Clarks ability to
are also attuned to their contexts. One common feature of narra- manage the use of time, select the activity in which to engage,
tives is to present events in temporal order. Although Mr. Clark and organize the way in which information becomes distributed
is not telling a story, he is anticipating the future actions, and as knowledge is consistent with a school setting in which his role
the way he does so relects some of the underlying features of is not challenged. he power created by the institutional setting
narrative: He presents upcoming actions roughly in the order in links his discourse to broader social, cultural, and civic agendas.
which they will occur (rather than saying well review after we get What Mr. Clark says and more fundamentally, his ability to do
started) and highlights the transition from one event to another so thus positions him as one who can reinforce social structural
with language that focuses on the beginnings of activities (get norms (who teaches whom? how? when?) and as an arbiter of the
started). oicial set of values, beliefs, and ideologies that are sanctioned
Sentences appear not just within one persons turn at talk but means of maintaining a stock of received knowledge. And the
also across diferent peoples turns at talk. How social order is fact that he is speaking on a special day that happens only once
constructed through sequences of both grammatical units (sen- a year when parents are permitted to visit en masse to observe
tences) and other units of speech production (e.g., clauses, into- irsthand their childrens education highlights the public and
nation units) on a turn-by-turn basis in talk-in-interaction is a civic function of his role.
major focus of conversation analysis. Mr. Clarks Okay, lets
get started has three features (syntactic closure, inal intonation, OPENING AN ORAL HISTORY INTERVIEW. he discourse analyzed
semantic wholeness) common to turn transitions. Instead of a in (2) is from the beginning of an oral history interview. After
264
Figure 1. List structure.
greeting and introducing the Interviewee (IVee), the Interviewer Interviewers mmhmm (lines 3, 5, 12) and uh huh (line 10)
(IVer) asks a question. Rather than illustrate diferent approaches allow Interviewee to continue to the end of her intonation
to discourse, we use (2) to show how forms, structures, and and information units (line 13).
meanings are co-constructed. Key features are annotated with Interviewees mmhmm (line 3) or yes (line 15) opens a turn
subscripts: question/answer (Q/A) pairs1, turn-taking devices2, that will answer a question.
and the use of and3 to build and indicate topic structure (see Reciprocal uses of mmhmm (lines 16, 17) open an opportu-
Figure 1). In (2), Q and A indicate question and answer, respec- nity for turn exchange;
tively; lowercase letters (e.g., (Qa) indicate the successive Q/A and in initial or medial position in a turn (lines 7, 11) con-
pairs. Dual numbers (e.g., 2/3 in line 2) indicates multiple fea- nects lateral items in the list (as in Figure 1).
tures of organization. Interviewer and Interviewee co-construct a hierarchical topic
structure in which information is organized on the basis of
(2) 1. IVer: 1Q (a)Id like you to tell me a li- something about
lexical relationships (e.g., family, husband) and ad hoc cat-
yourself now.
egories (e.g., things I enjoy).
2. Your family and2/3
3. IVee: Mmhmm2. In sum, key works in early discourse analysis focused, rela-
4. 1AaUh Ive been living in Cleveland for the last 36 years. tively separately, on patterns of sentence-internal forms, clause
5. IVer: mmhmm2 sequences, cross-sentence meanings that arise across sen-
6. I uh at the present time uh I am a housewife, tences. Recent approaches continue the search for various types
7. and3 uh uh occupy myself uh uh sometimes helping my of patterns, but add an interest in how those patterns emerge in
husband with his oice, when needed. texts produced by more than one person in relation to context.
8. IVer: 1Q (b)What does he do?
9. IVee: 1A(b) Hes a podiatrist.
Conclusion
10. IVer: uhhuh2
Discourse analysis provides a range of methodologies that are
11. IVee: 1A (a) And3 uh other times, I pursue, uh really uh
applicable to diferent facets of language in text and context.
umthings that I enjoy um going to the museum, and3
Although we have been able to consider only some components
swimming, and3 uh visiting ill people, and3 uh um spend-
of discourse analysis, our discussion and sample analyses help
ing time uh decorating my home,
us extract several general principles (Schifrin 1994):
12. IVer: Mmhmm2
13. IVee: and3 thats about2/ 1A(a) (1) Analysis of discourse is empirical: Data are based on peo-
14. IVer: 1Q(c) May I ask how old you are? ple using language, not linguists thinking about how people
15. IVee: 1A(c)Yes2, Im sixty years old. use language.
16. IVer: Mmhmm.2 Sixty2 1A(c). (2) Analyses are accountable to the data: hey have to explain
17. IVee: Mmhmm.2 the data in both sequential and distributional terms.
Space prohibits discussion of each feature, but note the (3) Analyses are predictive: hey produce hypotheses that
following: can be falsiied or modiied by other data.
A multifaceted question (line 1) receives answers that occupy (4) Discourse is not just a sequence of linguistic units: Its coher-
several turns, during which short question/answer pairs ence (see coherence, discourse) cannot be understood if
(lines 8 and 9, 14 and 15) are embedded. attention is limited just to linguistic form and meaning.
Question forms (compare lines 1, 14, 18, and 19 to line 8) are (5) Resources for coherence jointly contribute to participant
more complex when they shift topic or level of information. achievement and understanding of what is said, meant, and
265
Discrete Ininity
done. In other words, linguistic forms and meanings work or words) that can be combined into ininitely many possible
together with social and cultural meanings, and interpretive sentences. Marc D. Hauser, Chomsky, and W. Tecumseh Fitch
frameworks, to create discourse. (2002) argue that humans are the only species whose language
(6) he structures, meanings, and actions of everyday spo- is characterized by discrete ininity, and there are debates con-
ken discourse are interactively achieved. cerning the evolution of the property.
he opposition discrete versus continuous applies to sys-
(7) What is said, meant, and done is sequentially situated;
tems, models, domains, and variables. A digital clock represents
that is, utterances are produced and interpreted in the local
time as discrete; an (idealized) analog clock represents time as
contexts of other utterances.
continuous.
(8) How something is said, meant, and done speakers A set is inite if its size (cardinality) is some natural number
selection among diferent linguistic devices as alternative (0,1,2, ), otherwise ininite. he set of all natural numbers and
ways of speaking is guided by relationships among the the set of all points on a line are ininite. It is standardly argued
following: that the set of all possible sentences in English is ininite, even
(a) speaker intentions; though only initely many sentences have ever been uttered. If
(b) conventionalized strategies for making intentions the set were inite, there would be a longest sentence. But from
recognizable; any sentence we can construct a longer one, for instance, by add-
ing an and-clause; so the set is ininite.
(c) the meanings and functions of linguistic forms in rela-
Some nonhuman communication systems (see animal com-
tion to the text and context in which they appear;
munication and human language) are arguably ininite, but
(d) the sequential context of other utterances; because of continuous variation on one or a few parameters, thus
(e) properties of the textual mode, for example, narrative, not discrete. he honeybees waggle dance language involves
description, exposition; continuous variations in tempo, body orientation, and intensity,
(f) the social context, for example, participant identities indicating distance, direction from the hive, and quality of food
and relationships, structure of the situation, the setting; source. If these continuous variables can take any real-number
value within some range, then the resulting language is nondenu-
(g) a cultural framework of beliefs and actions.
merably ininite, a cardinality greater than standardly attributed
When brought together, this set of heuristic tools leads to the to human languages, even though each message has only three
following principle: Our uses of language, and the functions words. In Chomskys terminology it is a nondiscrete ininity.
that it accomplishes, are interactively constructed by people he enterprise of generative grammar aims to account
using language together (e.g., taking turns at talk, drawing infer- for the discrete ininity (and other properties) of language; all
ences about communicative intentionality) and drawing versions of generative grammar employ recursion in some
upon properties of language and its ability to join smaller units form to provide a inite description of a denumerably ininite set
(clauses, sentences) into larger units (texts) that both relect and built from discrete building blocks.
create the social contexts in which they emerge. D. T. Langendoen and Paul M. Postal (1984) argue that sen-
tences need not be inite in length and that the class of sentences
Deborah Schifrin
of a language is nondenumerably ininite. Would this also be a dis-
crete ininity? here is no known deinition of discrete in this context
that would settle the issue (the phrase is rare outside Chomskyan
Halliday, Michael, and R. Hasan. 1976. Cohesion in English. contexts). he conventional answer is no, apparently because
London: Longman. typical examples of nondenumerably ininite sets, such as the set
Harris, Zellig. 1952. Discourse analysis. Language 28: 130. of points on a line, involve continuous domains. But if discrete
Jaworski, Adam, and N. Coupland, eds. 1999. he Discourse Reader.
in discrete ininity is meant to characterize the building blocks
London: Routledge.
of the system, the answer would be yes. Some mathematicians
Johnstone, Barbara. 2006. Discourse Analysis. Oxford: Blackwell.
Labov, William. 1972. he transformation of experience in narrative
think the phrase is unclear and best avoided altogether.
syntax. In Language in the Inner City. Philadelphia: University of Barbara H. Partee
Pennsylvania Press, 35496.
Labov, William, and J. Waletzky. 1967. Narrative analysis. In Essays
on the Verbal and Visual Arts, ed. June Helm. Seattle: University of WORKS CITED AND SUGGESTIONS FOR FURTHER READING
Washington Press, 1244. Chomsky, Noam. 1988. Language and Problems of Knowledge: he
Schifrin, Deborah. 1994. Approaches to Discourse. Oxford; Blackwell. Managua Lectures. Cambridge, MA: MIT Press.
Schifrin, Deborah, D. Tannen, and H. Hamilton, eds. 2001. Handbook of Hauser, Marc D., Noam Chomsky, and W. Tecumseh Fitch. 2002. he
Discourse Analysis. Oxford: Blackwell. faculty of language: What is it, who has it, and how did it evolve?
Science 298: 156979.
DISCRETE INFINITY Langendoen, D. T., and Paul M. Postal. 1984. he Vastness of Natural
Languages. Oxford: Basil Blackwell.
his locution was brought into linguistics by Noam Chomsky Studdert-Kennedy, M., and L. Goldstein. 2003. Launching language: he
(for instance, Chomsky 1988, 170) to characterize the fact that gestural origin of discrete ininity. In Language Evolution, ed. M.
human languages are built up from discrete units (morphemes Christiansen and S. Kirby, 23554. Oxford: Oxford University Press.
266
Disorders of Reading and Writing
DISORDERS OF READING AND WRITING Even when we restrict our focus to the linguistic bases of written
language impairments, it is clear that in both developmental and
literacy is one of our most important cultural tools. It allows us acquired cases, reading and writing can go wrong for a variety of
to communicate our thoughts and ideas across space and across
reasons.
time in a manner that seems as natural and efortless as speak-
ing and listening. It is easy to forget that like cars and musical
Developmental Disorders of Written Language
instruments, writing systems are inventions that need to be
As developmental disorders of written language are disorders of
learned, much as one learns to drive a car or play an instrument.
development, it is important to consider them within the con-
Much has been learned about the reading and writing process by
text ofered by models of typical development (see spelling
studying people who have impairments in reading or writing.
and writing and reading, acquisition of). It is useful to
Disorders of written language can be split into two broad
make a distinction between impairments that afect word-level
camps: developmental disorders and acquired disorders. Recent
processes (recognizing words, spelling) and impairments that
years have seen a tendency to use the term neurodevelopmen-
afect the higher level processes involved in comprehend-
tal disorder, rather than developmental disorder, relecting
ing text and producing written narrative. As children learn
the growing consensus that atypical development is often the
to read, there is generally a strong association between decod-
product of genetic and/or environmental inluences on early
ing (deined as the ability to read a word aloud) and compre-
brain development. In contrast, acquired disorders are a con-
hension: Children who are good at decoding tend to have good
sequence of brain damage, typically caused by disease or head
comprehension, and children who are poor at decoding tend
injury. he diference between developmental and acquired
to have weak comprehension. For some children, however, the
disorders is not analogous to disorders that afect children ver-
two sets of skills develop out of step. In dyslexia, a developmen-
sus adults: Developmental dyslexia is a lifelong condition
tal disorder experienced by 310 percent of children, decoding is
that continues to manifest itself throughout adulthood, and an
slow, efortful, and error prone, yet their actual comprehension
acquired disorder of written language can arise following brain
of what they have read can be impressive (Snowling 2000). In
damage inlicted during childhood. However, there are impor-
contrast, approximately 10 percent of children can be described
tant diferences between acquired and developmental disorders.
as poor comprehenders: Despite having well-developed decod-
As developmental disorders afect developing systems, they
ing skills, they are poor at understanding what they have read
are very rarely sharply deined, and one tends to see associated
(Nation 2005).
deicits across a range of behaviors; in contrast, acquired disor-
ders relect selective damage to what (one assumes) was a fully
DEVELOPMENTAL DYSLEXIA. Developmental dyslexia is typically
working system. Consequently, diferent patients with damage
diagnosed when a child experiences profound diiculty read-
to diferent subsystems can show remarkably diferent and
ing and spelling words, despite normal educational opportunity
remarkably speciic types of reading impairment (Bishop
and normal-range general intelligence. Dyslexia runs in families
1997).
and there is good evidence from behavioral genetics demon-
strating genetic heritability of the disorder. Although genes are
Deining Reading and Writing yet to be identiied, regions of interest have been implicated on
Consider what you are doing as you read a text. Letters and chromosomes 1, 2, 3, 6, 15, and 18, suggesting that patterns of
words are processed visually (see word recognition, inheritance are complex and polymorphic (Nation and Coltheart
visual) at a rate of many items per minute, their forms recog- 2006; Pennington and Olson 2005). It seems likely that genetic
nized and meanings decoded or inferred. Words are only part of factors (in interaction with environmental factors) inluence
the story: Phrases and sentences need to be interpreted, rel- the development of brain areas implicated in the neural cir-
evant background knowledge activated, and inferences gener- cuitry that underpins reading and spelling (Price and McCrory
ated as information is integrated during the course of reading. 2005; see also writing and reading, neurobiology of).
Control processes are needed to monitor both ongoing compre- However, our understanding of how genetic factors inluence
hension and the internal consistency of text, allowing the reader brain development and lead to developmental dyslexia is rela-
to initiate repair strategies (for example, rereading) if compre- tively unspeciied.
hension breakdown is detected. In short, readers need to form a Cognitive explanations of developmental dyslexia are more
mental model of the text they are reading. To some extent, one speciied. It is widely accepted that many people with dyslexia
can think of writing being the reverse of reading, with the writer have underlying impairments in processing phonological
beginning with a conceptual message that he or she wishes to aspects of oral language. According to the phonological dei-
communicate and ending with inkmarks on a page. cit hypothesis (Snowling 2000), children with dyslexia have
Although written language clearly involves visual and motor diiculty representing and processing phonological informa-
processes (identifying letters, scanning text, handwriting), the tion. his leads to diiculties on tasks that tap phonological
cognitive psychology of reading and writing has been most con- processing, including aspects of speech perception and
cerned with an understanding of the language bases of reading speech production and, most notably, diiculties with
and writing. hus, visuo-motor aspects of reading and writing will phonological awareness. In an alphabetic language, at
not be considered here. Instead, we focus on reading and writing least, learning to read and spell places heavy demands on pho-
as linguistic skills, skills that have their routes in our biological nological skills inasmuch as children need to learn to make
endowment for spoken language (see genes and language). ine-grained mappings between phonology and orthography.
267
Children with poor phonological skills ind this process more development of good decoding and word recognition. Clearly,
diicult, as evidenced by the well-replicated inding that people however, adequate decoding and strengths in the phonological
with dyslexia are poor at reading novel words: Young children skills that underpin decoding are not suicient to guarantee ade-
with dyslexia ind nonword reading extraordinarily diicult, and quate comprehension.
even well-compensated adults whose more obvious diiculties A similar dissociation exists between word-level versus
with reading have resolved are slower and often less accurate at higher-level aspects of writing. L. Cragg and K. Nation (2006)
reading nonwords a lasting legacy of their dyslexia. found that poor comprehenders spell at age-appropriate levels.
here is some debate as to whether there are subtypes of However, when asked to write a story from a series of picture
developmental dyslexia. According to the dual-route model prompts, the same children produced narratives that captured
(Coltheart 2005), words can be read via one of two independent less of the story content and contained a less sophisticated story
routes: a sublexical route, mediated by phonological rules dictat- structure. hese indings are consistent with what we know about
ing mappings between graphemes and phonemes, and a lexical poor comprehenders oral language skills, with strengths in pho-
route, mediated by visual-orthographic mappings. he sublexi- nological skills promoting adequate spelling but weaknesses in
cal route is needed to read nonwords, whereas the lexical route is language comprehension constraining the more compositional
needed to read words that do not obey grapheme-phoneme cor- aspects of narrative production.
respondence rules (i.e., exception words, such as yacht, chaos,
and enough). he majority of children with dyslexia have pho- Acquired Disorders of Written Language
nological impairments, and therefore in dual-route terms they Acquired disorders of reading and spelling are observed in
have phonological dyslexia, caused by an impaired sublexical patients with aphasia following stroke, head injury, or pro-
route. In contrast, a small proportion of children with dyslexia gressive brain disease (see brain and language ). Some
have less severe phonological deicits. For these children, typi- patients show deicits that are a consequence of impairments
cally referred to as having developmental surface dyslexia, their in visual processing and letter recognition (e.g., pure alexia;
greatest diiculty is with reading and spelling exception words, Behrmann, Plaut, and Nelson 1998). In line with our discus-
caused by an impaired lexical route. sion of developmental disorders, however, we focus here on
heorists disagree about the validity and stability of the sur- acquired disorders of written language that have their bases in
face subtype. More generally, they debate the need to evoke spoken language. he majority of work on acquired disorders
two separate routes to word reading. An alternative account is of reading and writing has focused on patients ability to read
provided by the triangle model, a connectionist model in single words aloud or spell single words to dictation. Some of
which reading aloud is accomplished via sets of interactive con- this work is reviewed very briely here; rather surprisingly, few
nections between three sets of units: phonological, orthographic, studies have investigated aspects of reading comprehension
and semantic. Unlike the dual-route model, individual difer- and narrative production, although diiculties in discourse-
ences in reading are not a consequence of an impaired lexical level processing have been noted in patients with right hemi-
or sublexical route. Instead, diferences in the quality of repre- sphere brain damage.
sentations (phonological, orthographic, or semantic) or in the
strength of mappings between diferent representations, bring SURFACE AND PHONOLOGICAL DYSLEXIA: TRADITIONAL COGNITIVE
about diferent patterns (or subtypes) of reading behavior (see NEUROPSYCHOLOGY AND THE DUAL-ROUTE MODEL OF READING
Plaut, 2005 for discussion of the triangle model and how it difers ALOUD. he study of patients with acquired dyslexia has played
from the dual-route model; see Snowling 2000 for a discussion of a central role in the ield of cognitive neuropsychology. In
how developmental dyslexia can be accommodated by the tri- particular, the dissociation between patterns of intact and
angle model). impaired behaviors in two types of acquired dyslexia, namely,
surface dyslexia and phonological dyslexia, provided important
POOR COMPREHENDERS. Unlike children with dyslexia, poor support for the dual-route model of reading aloud (Coltheart
comprehenders do not have diiculty with decoding and pro- 2005).
ducing words; however, they are poor at understanding what Patients with surface dyslexia are poor at reading exception
they read. In particular, they are poor at making inferences words (words that have irregular mappings between orthog-
when reading text, and they are less able to integrate informa- raphy and phonology), which they tend to regularize (e.g.,
tion across sentences in order to resolve anomalies (Oakhill and reading pint to rhyme with mint). According to the dual-route
Yuill 1996). It is also clear that poor comprehenders diiculties framework, this is a consequence of damage to the lexical route,
are not restricted to reading comprehension. hey are also poor meaning that patients overrely on the sublexical route; hence,
at listening comprehension and they have relative weaknesses in they produce overregularization to irregular forms. he
oral vocabulary and word knowledge, understanding igurative term phonological dyslexia is used to describe the condition of
language, and with aspects of grammar (Nation et al. 2004). hus, patients who show particular impairments in decoding novel
poor comprehenders diiculties with reading comprehension words. Traditionally, this has been interpreted within a dual-
should be seen against a backdrop of more general diiculties route framework as a consequence of damage to the sublexical
in processing and comprehending language, leading to diicul- route, responsible for translating graphemes to phonemes via
ties in building a mental model of text or discourse. In contrast phonological rules. hus, nonwords tend to be lexicalized, with
to these deicits, poor comprehenders show strengths in phono- patients reading a nonword as if it were a visually similar familiar
logical processing and phonological awareness, facilitating the word (e.g., reading bem as ben).
268
SURFACE AND PHONOLOGICAL DYSLEXIA: THE PRIMARY SYSTEMS Summary and Conclusions
HYPOTHESIS. he dual-route model is a model of the reading sys- Reading and writing are complex processes, and it is clear that
tem, relatively divorced from the underlying cognitive and lin- they may be impaired for a variety of reasons. Written language
guistic skills that subserve reading. An alternative approach is to is parasitic upon spoken language, and, therefore, it is no sur-
consider the extent to which acquired disorders are a consequence prise to ind that oral language weaknesses are associated with
of impairment to one or more of those underlying primary skills impairments of reading and writing in both developmental and
upon which reading is parasitic (for example, phonology, seman- acquired disorders. More speciically, diferent aspects of the oral
tics, and visual processing). his perspective, termed the primary language system (e.g., phonology and semantics) appear to be
systems hypothesis, is reviewed in detail by M. A. Lambon Ralph more or less associated with diferent aspects of reading or writ-
and K. E. Patterson (2005). It draws heavily on connectionist mod- ing failure. A challenge for future work is to understand how oral
els of reading, especially the triangle model described previously. language skills interact with each other and with orthographic
Space precludes a full description of the model (see Plaut 2005 for factors to produce diferent patterns of written language impair-
detailed review), but it difers fundamentally from the dual-route ment. In addition, many challenges remain for understanding
model in a number of key ways. As a model of the language sys- how genetic and environmental risk factors interact to inluence
tem rather than the reading system, it predicts that patients with brain development so as to cause a developmental disorder of
reading problems should also show concomitant weaknesses in reading or writing.
aspects of language processing more generally.
Kate Nation
How does the triangle model account for surface and phono-
logical dyslexia? Surface dyslexia (i.e., poor exception word read-
ing) is proposed to be a consequence of reduced activation from WORKS CITED AND SUGGESTIONS FOR FURTHER READING
semantic representations impacting on the connections between Behrmann, M., D. C. Plaut, and J. Nelson. 1998. A literature review and
phonology and orthography. Lambon Ralph and Patterson new data supporting an interactive account of letter-by-letter reading.
(2005) provide technical details underpinning this proposal, and Cognitive Neuropsychology 15: 751.
explain the balance of evidence supporting it. According to the Bird, H., M. A. Lambon Ralph, M. S. Seidenberg, J. L. McClelland, and
primary systems hypothesis, if surface dyslexia is a consequence K. Patterson. 2003. Deicits in phonology and past tense morphology.
Journal of Memory and Language 48: 50226.
of impaired semantics, then patients should exhibit semantic
Bishop, D. V. M. 1997. Cognitive neuropsychology and develop-
impairments that is, impairments on nonreading tasks that
mental disorders: Uncomfortable bedfellows. Quarterly Journal of
require knowledge of, or access to, word meanings. In support of
Experimental Psychology 50A: 899923.
this, patients with semantic dementia show a variety of seman- Coltheart, M. 2005. Modeling reading: he dual-route approach. In
tic impairments and show a surface dyslexia reading proile Snowling and Hulme 2005, 623.
(Graham, Hodges, and Patterson 1994). Cragg, L., and K. Nation. 2006. Exploring written narrative in chil-
In contrast to the semantic weaknesses considered to under- dren with poor reading comprehension. Educational Psychology
pin surface dyslexia, the triangle model proposes that impair- 21.1: 5572.
ments in phonology underpin phonological dyslexia. Inasmuch Graham, K., J. R. Hodges, and K. E. Patterson. 1994. he relationship
as reading nonwords places heavy demands on the connections between comprehension and oral reading in progressive luent apha-
between phonology and orthography (as nonwords have no sia. Neuropsychologia 32: 299316.
Lambon Ralph, M. A., and K. E. Patterson. 2005. Acquired disorders of
meaning, contributions from semantic knowledge is minimal),
reading. In Snowling and Hulme 2005, 41330.
if patients have weaknesses in the phonological domain, these
Nation, K. 2005. Childrens reading comprehension diiculties. In
should be exhibited as relatively stronger nonword than word
Snowling and Hulme 2005, 24866.
reading deicits exactly the pattern seen in patients with pho- Nation, K., P. Clarke, C. M. Marshall, and M. Durand. 2004. Hidden
nological dyslexia. And, consistent with the primary systems language impairments in children: Parallels between poor reading
hypothesis, patients with acquired phonological dyslexia also comprehension and speciic language impairment. Journal of Speech,
show weaknesses on nonreading tasks that tap phonological Hearing and Language Research 47: 199211.
skills, including word and nonword repetition and phonological Nation, K ., and M. Coltheart, eds. 2006. he genetics of reading.
awareness (Bird et al. 2003). Journal of Research in Reading 29. Special issue containing a number
of articles exploring the heritability of reading and related issues.
ACQUIRED DYSGRAPHIA. Often, patients with acquired dyslexia Oakhill, J. V., and N. Yuill. 1996. Higher order factors in comprehension
show associated impairments in spelling words to dictation disability: Processes and remediation. In Reading Comprehension
(dysgraphia); however, some patients show selective impair- Diiculties, ed C. Cornoldi and J. V. Oakhill, 6992. Mahwah, NJ:
ments in spelling. As with reading impairment, damage to difer- Lawrence Erlbaum.
Pennington, B. F., and R. K. Olson. 2005. Genetics of dyslexia. In
ent aspects of the language system produces diferent patterns
Snowling and Hulme 2005, 45372.
of spelling impairment. Some patients are very poor at using
Plaut, D. C. 2005. Connectionist approaches to reading. In Snowling
spelling-sound conversion rules to spell novel words, similar and Hulme 2005, 2438.
to the pattern of reading behavior seen in patients with phono- Price, C. J., and E. McCrory. 2005. Functional brain imagining studies of
logical dyslexia; others tend to make regularization errors when skilled reading and developmental dyslexia. In Snowling and Hulme
spelling exception words, akin to surface dyslexia (see Romani, 2005, 47396.
Olson, and Di Betta 2005 for discussion of these and other types Romani, C., A. Olson, and A. M. Di Betta. 2005. Spelling disorders. In
of acquired [and developmental] dysgraphias). Snowling and Hulme 2005, 43148.
269
Division of Linguistic Labor Dyslexia
Snowling, M. J. 2000. Dyslexia. Oxford: Blackwell. DYSLEXIA

Snowling, M. J., and C. Hulme, eds. 2005. he Science of Reading.
Oxford: Blackwell. Introduction: What Is Dyslexia?
Dyslexia is a speciic learning diiculty afecting literacy
development. Children and adults with developmental dyslexia
DIVISION OF LINGUISTIC LABOR show diiculties in reading and spelling that are not explica-
According to Hilary Putnams (1975) division of linguistic labor, ble in terms of their age, intelligence, or educational experience.
speakers routinely use terms whose extension (see intension Children with dyslexia typically have marked diiculties in learn-
and extension, reference and extension) they would not ing to read and spell words, though their understanding of what
be able to ix. For example, most of us cannot tell the diference they read may be good. hese diiculties are often accompanied
between gold and fools gold. Nevertheless, we know that the two by diiculties in short-term memory and organization. In adult-
are diferent, and when we use the word gold, we mean to refer to hood, the word-reading diiculties may resolve, but spelling and
the real thing to the material that experts who can distinguish other underlying diiculties remain.
between gold and fools gold call gold. If there is ever a dispute
about whether our use of the word is appropriate, we can consult Behavioral Manifestations of Dyslexia
one of these experts. Using examples like this one, Putnam pro- Reading development depends on two foundation skills, let-
posed that knowledge of word meaning is not a private mental ter-sound knowledge and phonological awareness, the
property. Instead, it is the responsibility and achievement of the ability to identify the small sounds in speech (Byrne 1998). A
collective linguistic community: Metallurgists can ix the extension childs ability to establish mappings between the letter strings
of the word gold, botanists can ix the extension of the word elm, of printed words and these speech sounds (phonemes) allows
and so on. he average speakers use of such terms depends upon printed words to be decoded and is the basis for the acquisition
an implicit structured cooperation between that person and the of later and more automatic reading skills. hus, individual dif-
experts in the relevant domains (see socially distributed ferences in phonological awareness predict diferences in the
cognition). ability of children to learn to read. he most common pattern
Putnam (1975) proposed the division of linguistic labor as of reading deicit in dyslexia in English is poor nonword read-
part of a seminal argument against traditional accounts of word ing, a task that requires the decoding of unfamiliar words. To
meaning. Many of these accounts hold that knowing the mean- some extent, spelling draws on the same processes as decoding;
ing of a word is a function of being in a particular psychologi- however, English words cannot be spelled solely on the basis of
cal state. In fact, he argued, two speakers can share the same sound-letter mapping rules but also require knowledge of gra-
psychological state (neuron for neuron) but mean diferent pho-tactic or morphological rules and sometimes rote learn-
things. For example, imagine two speakers who know exactly ing. hus, for children with dyslexia, spelling poses even more of
the same things about beech and elm trees: hey know that a signiicant challenge than reading.
both are large deciduous trees, but they cannot tell them apart An important issue is whether dyslexia has the same symp-
(for that, they defer to experts). If one speaker uses the word toms in more consistent or transparent languages than English.
beech to refer to an elm and the other uses the word to refer to Findings from a variety of transparent languages show that
a beech, the two speakers share the same psychological state, orthographic consistency of grapheme-phoneme correspon-
but they mean diferent things. As Putnam (1975, 144) famously dences afects the rate at which children acquire reading skills.
put it, Cut the pie any way you like, meanings just aint in the Speciically, when correspondences between letters and pho-
head! nemes are regular, children quickly learn the phonological skills
he notion that speakers do not know much about many of the required for reading and spelling. hus, children with dyslexia
words they use is not controversial. However, there has been vigor- learning to read in transparent orthographies have less serious
ous debate about which words are subject to a division of linguistic diiculties than their English-speaking counterparts; for them,
labor and whether a division of linguistic labor necessarily implies the main behavioral feature of dyslexia is a problem in read-
that meanings aint in the head (see Pessin and Goldberg 1996). ing luency (Caravolas 2005). Conversely, in languages such as
For example, J. Searle (1983) argued that knowing that there are Mandarin Chinese, in which the orthography does not consis-
experts who can be called upon to ix a words extension should be tently signal the corresponding phonology, one might expect
considered part of knowing the meaning of a word. that the relationship between dyslexia and phonological aware-
ness difers again. To date, there has been little research on this
Vikram K. Jaswal
issue (Hanley 2005), but the extant literature suggests that both
phonological and morphological processing skills are associated
with reading diiculties in Chinese.
Pessin, A., and S. Goldberg, eds. 1996. he Twin Earth Chronicles: Twenty
Years of Relection on Hilary Putnams he Meaning of Meaning.
Theories of Dyslexia
New York: M. E. Sharp.
Current theories of dyslexia are cast at either the biological or cog-
Putnam, Hilary. 1975. he Meaning of Meaning. In Minnesota Studies
in the Philosophy of Science. Vol. 7: Language, Mind, and Knowledge.
nitive levels of explanation. he predominant cognitive account
Ed. Keith Gunderson, 13193. Minneapolis: University of Minnesota of dyslexia views the primary cause as a phonological process-
Press. ing impairment (Vellutino et al. 2004). According to this hypoth-
Searle, J. 1983. Intentionality. Cambridge: Cambridge University Press. esis, children with dyslexia have phonological deicits that cause
270
Dyslexia
a wide range of symptoms, not all of which are directly related the preschool years (e.g., Snowling, Gallagher, and Frith 2003).
causally to the reading deicits (e.g., verbal short-term memory hese studies highlight a wide range of diferent literacy out-
problems and word-inding diiculties). As far as is known, such comes. Although many are slow in the early stages of reading,
symptoms are equally common among children learning to read some recover from this slow start to go on to be normal readers,
in all languages. whereas others have persistent problems.
Many other theories of dyslexia accept phonological diicul-
ties as a proximal cause of reading problems but cite more low- NEUROBIOLOGICAL BASES. Most children with speciic reading
level deicits as their distal cause. For example, the automization diiculties do not have any detectable neurological abnormality.
deicit hypothesis (Nicolson and Fawcett 1990) proposes that However, evidence suggests that atypical brain development is
diiculties in the cerebellum in dyslexic children place sim- implicated (Leonard et al. 2001). Other symptoms that co-occur
ilar constraints on learning of all skills, including phonology, with dyslexia may also be important in deining subtypes of dys-
naming abilities and basic motor skills. he proposal of William lexia, and the neuroanatomical markers of diferent forms may
Lovegrove, Frances H. Martin, and Walter L. Slaghuis (1986) that difer.
people with dyslexia have impairments of the magnocellular In addition to studies of brain structure, much recent work
system (the division of the visual system that responds to rapid has focused on functional abnormalities in the brains of people
changes) has also generated much research. Findings are mixed, with dyslexia. Typically, people with dyslexia have been reported
with some studies reporting no evidence of abnormal sensitiv- to show less activation than controls in the left temporal and
ity and others suggesting that group diferences between people parietal lobes (Price and McCrory 2005). However, it remains
with dyslexia and normal readers may be related to uncontrolled unclear whether diferences in brain activation are a sign of
diferences in IQ. Research investigating visual attention prob- some constitutional limitation of brain processing or whether
lems in dyslexia is also inconclusive. they simply relect activation of a persons inability to read words
An inluential hypothesis is that dyslexia stems from a dei- using a phonological approach (a task that uses these language
cit in basic auditory processing. Speciically, a rapid audi- regions).
tory processing deicit found with both speech and nonspeech
sounds would afect the perception of consonants distinguished ENVIRONMENTAL FACTORS. School, home, and broader environ-
by rapid changes in the speech signal, and further, poor speech mental factors contribute to a childs risk of developing reading
perception would afect the development of phonological problems. At the broadest level, reading disorders show social
processing skills (Tallal 2004). Investigation of auditory deicits class diferences, and direct literacy-related activities in the
in dyslexia has extended to such tasks as frequency discrimina- home are also important, though evidence suggests that these
tion, frequency modulation, binaural processing, and backward activities primarily afect reading comprehension via vocabulary
masking. However, as with indings on visual impairments, the growth (Phillips and Lonigan 2005).
literature is replete with conlicting results, and an alternative It is important to note that genes and the environment inter-
suggestion is that the deicit is not a general auditory impairment act, and there is evidence that children with dyslexia tend to
but is speciic to the processing of speech sounds. Investigations avoid reading activities, such that their reading problems may
of speech perception in dyslexia have highlighted subtle impair- become magniied over time. Where parents themselves have lit-
ments, although again there are conlicting results. he lack of eracy problems, home literacy experiences may also be less than
consensus in the ield regarding sensory impairments has led to optimal. In addition, comparisons of children from the same
the proposal that they frequently occur in dyslexia but are not area attending diferent schools have emphasized that school-
causally linked to it (Ramus 2004). Further investigation of this ing can make a substantial diference to reading achievement
complex issue is needed. (Rutter and Maughan 2002). Over time, the cumulative impact
of environmental processes can have a very signiicant efect on
Etiology of Dyslexia reading progress.
GENETIC FACTORS. It has long been known that dyslexia runs in In keeping with the relevance of both genetic and environ-
families; however, because families share genes as well as envi- mental factors, there is currently a move away from single-deicit
ronments, it is important to attempt to disentangle genetic and models toward multifactorial models that explain the nature and
environmental inluences. Twin studies have been helpful in this causes of dyslexia (Pennington 2006).
regard (Pennington and Olson 2005). Most twin studies of read-
ing and reading disability report that both reading and phono- Comorbidity
logical awareness are heritable skills, and thus it can be inferred Dyslexia shows some similarities with specific language
that dyslexia has a genetic basis. Furthermore, molecular genetic impairment, and there is some debate as to whether they
studies have found gene markers of dyslexia as well as some can- should be characterized as the same disorder (Bishop and
didate genes, though it is far from clear what the genetic mecha- Snowling 2004). here is also evidence of comorbidity between
nisms are (Fisher and Francks 2006). dyslexia and various emotional and behavioral problems. Most
It is important to note that the genes implicated in dyslexia strikingly, dyslexia is highly comorbid with attention-deicit
indicate a susceptibility to reading diiculties but not that read- hyperactivity disorder (ADHD) and, in particular, attention dif-
ing problems are fully genetically determined. he interaction iculties (Willcutt and Pennington 2000). Recent research sug-
of diferent skills in determining reading outcomes can be seen gests shared genetic risk factors as a possible cause. Children
in studies of children at family risk of dyslexia followed from with dyslexia also show an increased risk of developing clinically
271
Dyslexia Ellipsis
signiicant emotional diiculties, possibly as a result of their Nicolson, Rod I., and Angela J. Fawcett. 1990. Automaticity a new
reading diiculties (Carroll et al. 2005). framework for dyslexia research. Cognition 35: 15982.
Pennington, Bruce F. 2006. From single to multiple deicit models of
developmental disorders. Cognition 101: 385413.
Reading Intervention Pennington, Bruce F., and Richard K.Olson. 2005. Genetics of dyslexia.
heoretical knowledge of the relationship between phonological In Snowling and Hulme 2005, 45372.
skills and learning to read has led to the development of efective Phillips, Beth M., and Christopher J. Lonigan. 2005. Social correlates of
reading intervention programs that promote phonological skills emergent literacy. In Snowling and Hulme 2005, 17387.
in the context of reading (National Reading Panel, 2000) (see Price, Cathy J., and Eamon McCrory. 2005. Functional brain imaging
teaching reading). Such interventions are efective both for studies of skilled reading and developmental dyslexia. In Snowling
diagnosed dyslexics and for children who are at risk of reading and Hulme 2005, 47396.
problems. An underresearched issue is the problem of children Ramus, Franck. 2004. Neurobiology of dyslexia: A reinterpretation of the
who, despite high quality intervention, do not respond to teach- data. Trends in Neurosciences 27: 7206.
Rutter, Michael, and Barbara Maughan. 2002. School efectiveness ind-
ing and continue to have reading impairments. hese children
ings 19792002. Journal of School Psychology 40: 45175.
are often socially disadvantaged and may show additional emo-
Snowling, Margaret J. 2000. Dyslexia. 2d ed. Oxford: Blackwell.
tional and behavioral diiculties.
Snowling, Maragaret J., Alison Gallagher, and Uta Frith. 2003. Family
risk of dyslexia is continuous: Individual diferences in the precursors
Conclusions of reading skill. Child Development 74: 35873.
Dyslexia is a highly researched developmental disorder. here Snowling, Margaret J., and Charles Hulme, eds. 2005. he Science of
is now clear evidence that diiculties in phonological skills are Reading: A Handbook. Oxford: Blackwell.
Tallal, Paula. 2004. Improving language and literacy is a matter of time.
a major proximal cause of reading diiculties across languages.
Nature Reviews Neuroscience 5: 7218.
here is also evidence that reading is a complex skill inluenced
Vellutino, Frank R., Jack M. Fletcher, Margaret J. Snowling, and Donna
both by genetics and by the environment. However, outstand-
M. Scanlon. 2004. Speciic reading disability (dyslexia): What have
ing issues remain. Notably, models of the disorder are moving we learned in the past four decades? Journal of Child Psychology and
toward a multiple deicit model, and it is unknown which is the Psychiatry 45: 240.
most appropriate support for children who do not respond to Willcutt, Erik, and Bruce Pennington. 2000. Psychiatric co-morbidity
standard phonics-based reading intervention. in children and adolescents with reading disability. Journal of Child
Psychology and Psychiatry 41: 103948.
Margaret J. Snowling and Julia M. Carroll

Bishop, Dorothy V. M., and Margaret J. Snowling. 2004. Developmental
dyslexia and speciic language impairment: Same or diferent?
E
Psychological Bulletin 130: 85888.
Byrne, Brian. 1998. he Foundation of Literacy: he Childs Acquisition of ELLIPSIS
the Alphabetic Principle. Hove, UK: Psychology Press.
Caravolas, Marketa. 2005. he nature and causes of dyslexia in diferent Ellipsis is the nonexpression of some lexical material specii-
languages. In Snowling and Hulme 2005, 33657. cally, a word or words forming a syntactic constituent that
Carroll, Julia M., Barbara Maughan, Robert Goodman, and Howard is needed for the full interpretation of a sentence but is not
Meltzer, 2005. Literacy diiculties and psychiatric disorders: he expressed because it can be recovered from the linguistic or real-
case for comorbidity. Journal of Child Psychology and Psychiatry world context. Under a traditional syntactic deinition of ellip-
46: 52432.
sis, elliptical gaps must be able to be illed with overt material,
Cunningham, Anne, and Keith Stanovich. 1990. Assessing print
thus distinguishing them from other types of gaps, like traces
exposure and orthographic processing skill in children: A quick
of moved constituents. All natural languages permit ellipsis,
measure of reading experience. Journal of Educational Psychology
82: 73340. but they difer with respect to which constituents can be elided
Fisher, Simon E., and Clyde Francks. 2006. Genes, cognition and dys- in which conigurations. Ellipsis falls within the larger ield of
lexia: Learning to read the genome. Trends in Cognitive Sciences reference resolution.
10: 2507. Most studies of ellipsis concentrate on formalizing the licens-
Hanley, J. Richard. 2005. Learning to read in Chinese. In Snowling and ing and recoverability conditions for elided constituents. he
Hulme 2005, 31635. former must account for what makes ellipsis grammatical in
Leonard, Christine M., Mark A. Eckert, Linda J. Lombardino, homas given conigurations whereas the latter concerns the ways in
Oakland, John Kranzler, Cecile M. Mohr, Wayne M. King, and Alan which the meaning of the elided material can be understood
Freeman. 2001. Anatomical risk factors for phonological dyslexia.
from the context. When the meaning of an elided constituent is
Cerebral Cortex 11: 14857.
understood by coreference with a previously introduced linguis-
Lovegrove, William, Frances H. Martin, and Walter L. Slaghuis. 1986.
he theoretical and experimental case for a visual deicit in speciic
tic constituent, that constituent is called the antecedent.
reading disability. Cognitive Neuropsychology 3: 22567. A cross-linguistic sampling of the many types of constituents
National Reading Panel. 2000. Report of the National Reading that are subject to ellipsis includes arguments of a verb (1); head
Panel: Reports of the Subgroups. Washington, DC: National Institute of nouns in noun phrases with an overt quantiier, modiier, and so
Child Health and Human Development Clearing House. on (in [2], laps); main verbs in so-called gapping constructions
272
Embodiment
(in [2], swam); verb phrases selected by an overt auxiliary (3); accessible to consciousness and consists of our awareness of
and main verbs in sentences containing two or more overt argu- our own mental states, our bodies, our environment, and our
ments or adjuncts (4). he elided categories in the examples are physical and social interactions.
indicated by []. Textual antecedents, when present, are shown Scholars opinions about the proper locus of embodiment in
in boldface. cognition and language tend to privilege their own methodolog-
ical preferences. For instance, neuroscientists tend to privilege
1. [] Pomoe mne? [Russian; the subject is elided]
the brain and some peripheral aspects of the nervous system in
[] Help 2.SG.FUTURE meDATIVE
their studies of thought, language, and emotion; anthropologists
Will you help me?
focus on cultural-speciic behaviors and generally explore how
2. Jack swami 20 lapsj and Beth [i] 25 [j]. culture both is written onto bodies and gives cultural meanings
to bodily experiences and behaviors; cognitive linguists, and
3. Greg is almost inished swimming but Bruce has just started
some literary theorists, concentrate on the embodied nature
[].
of linguistic structure and behavior, as well as on the embod-
4. Kuda ty []? [Russian; the main verb is elided] ied nature of speaking/listening and writing/reading; and psy-
whereDIRECTIONAL youNOM chologists tend to study the role of diferent bodily actions on
Where are you going? various cognitive activities. Despite these difering approaches,
many agree that an embodied understanding of mind and lan-
Although ellipsis is generally deined syntactically, syntactic
guage requires attention to all three levels of embodiment and
approaches to the study of ellipsis (e.g., Lobeck 1995) are, by
their interaction.
necessity, partial because ellipsis decisions can be afected by
here is now a large body of linguistic research demonstrat-
nonsyntactic factors like the semantics of the utterance, the
ing that the existence and speciic meanings of many words and
potential for ambiguity, the physical context of the speech situ-
phrases emerged from recurring patterns of bodily experience.
ation, and so on (McShane 2005).
For instance, peoples frequent experiences of taking physical
Certain types of ellipsis, like gapping, either require or are
journeys (i.e., beginning at some source, moving along a path,
promoted by syntactic and/or semantic parallelism.
and reaching some destination) appears to inluence the devel-
Ellipsis is particularly challenging for natural language pro-
opment of metaphorical ways of talking about abstract ideas
cessing (NLP) systems since parsers (see parsing, machine)
and events, such as achieving a personal goal (e.g., I inally
must be able to detect the virtual presence of elided constituents,
am getting close to my Ph.D.) or having diiculties in personal
and language generators must be supplied with rules of ellipsis
relationships (e.g., Our marriage has hit a dead-end street). In
usage that go beyond the relatively broad generalizations found
this way, peoples bodily experiences of taking journeys is meta-
in theoretical treatments.
phorically extended to conceive of many ideas related to LIFE IS
Marjorie J. McShane A JOURNEY (Lakof and Johnson 1999).
Embodied experience may also directly inluence contem-
WORKS CITED AND SUGGESTIONS FOR FURTHER READING porary speakers understandings of many words and phrases.
Lobeck, Anne. 1995. Ellipsis: Functional Heads, Licensing and
Neuroscience research demonstrates that perceptual and motor
Identiication. New York: Oxford University Press. systems are speciically activated during immediate language
McShane, Marjorie J. 2005. A heory of Ellipsis. New York: Oxford processing. hus, areas of motor and premotor cortex associated
University Press. with speciic body parts are activated when people hear language
referring to those body parts. Listening to diferent verbs asso-
ciated with diferent efectors (i.e., mouth/chew, leg/kick,
EMBODIMENT
hand/grab) leads to diferent iring rates in diferent regions of
Embodiment refers to the ways in which persons bodies and motor cortex (i.e., areas responsible for appropriate mouth/leg/
bodily interactions with the world shape their minds, actions, hand motions exhibit greater activation) (Hauk, Johnsrude and
and personal, cultural identities. Embodied accounts of mind Pulvermuller 2004).
and language embrace the idea that human symbols are Psycholinguistic studies also demonstrate the automatic
grounded in recurring patterns of bodily experience, and there- recruitment of perceptual and motor systems in immediate
fore reject traditional dualistic, disembodied views of human language understanding. For instance, people are slower to
cognition and linguistic meaning. he study of embodiment understand a phrase like aim a dart when they irst form a ist
demands recognition that thought and language arise from the than when they shape their hand into a dart-throwing position,
continuous dynamic interactions among brains, bodies, and which suggests that semantic comprehension may engage rel-
the world. here are, in fact, three levels of embodiment that evant motoric processes (Klatzky et al. 1989). People also more
together shape the embodied mind (Lakof and Johnson 1999). quickly understand a statement like grasp the concept when
Neural embodiment concerns the structures that characterize they irst make, or imagine making, a grasping motion than
concepts and cognitive operations at the neurophysiological when no grasping motion is made (Wilson and Gibbs 2007).
level. he cognitive unconscious consists of the rapid, evolu- hus, people need not necessarily inhibit the physical meanings
tionarily given mental operations that structure and make pos- of certain metaphorically used words, like grasp, because these
sible conscious experience, including the understanding and meanings are recruited during the on-line construction of met-
use of language. he phenomenological level is conscious and aphorical meanings, such as when concepts are metaphorically
273
Emergentism
understood as things that can be grasped. Studies also show properties that amount to more than the sum of its parts. he
that people understand metaphorical ictive motion sentences, physical world ofers many examples of this, as Mill observes
such as he road runs along the coast, in terms of implicit, (p. 243):
imaginary sensations of movement implicit in these sentences
he chemical combination of two substances produces, as is well
(Matlock 2004). People are not aware of these simulations, and
known, a third substance with properties diferent from those of
so language processing is not dependent on deliberate thought
either of the two substances separately, or both of them taken
about motion. In general, psycholinguistic studies provide addi-
together. Not a trace of the properties of hydrogen or oxygen is
tional support for the broad claim, also now made in compu-
observable in those of their compound, water.
tational modeling research, known as simulation semantics
(Feldman and Narayanan 2004) that language use is closely tied Mills insight is relevant to the study of so-called complex sys-
to embodied imagination. tems ranging from atoms to the weather whose dynamic
he empirical work in cognitive science on embodiment in nonlinear behavior involves many interacting and intercon-
language and thought (see Gibbs 2006) mirrors other debates in nected parts. (A system is dynamic if it is constantly in lux; it
philosophy and literary studies on the role of embodied imagi- is nonlinear if efects are out of proportion to causes, as when
nation in literary and aesthetic experience. Readers emotional a neglected candle causes a ire that destroys an entire city.
involvement with iction, for instance, may arise from their simu- See self-organizing systems.) However, the question of
lations of themselves as the characters they read about and their whether and to what extent language is an emergent phenom-
ictional actions (Nichols 2006). In this manner, reading may not enon remains controversial.
be an abstract, purely mental process with little engagement of
the bodily imagination but is fundamentally tied to our powers Linguistic Emergentism
to recreate what it must be like to be and move like the people Although it is widely agreed that emergentist approaches to lan-
we are reading about. Debate about this, and about other issues guage necessarily stand in opposition to theories of the language
related to embodiment in thinking and language, is central in faculty that posit an innate universal grammar, other tenets
much contemporary scholarship in the humanities and cogni- of linguistic emergentism are less well deined, and there is no
tive sciences. consensus within the ield as to how precisely the standard prob-
lems of linguistic analysis should be confronted. Nonetheless,
Raymond W. Gibbs, Jr.
the starting point for a substantial portion of emergentist work
seems to involve a commitment to the emergentist thesis for
language:
Feldman, J., and S. Narayanan. 2004. Embodied meaning in a neural
theory of language. Brain and Language 89: 38592. he phenomena of language are best explained by reference to
Gibbs, R. 2006. Embodiment and Cognitive Science. New York: Cambridge more basic nonlinguistic (i.e., nongrammatical) factors and
University Press. their interaction.
Hauk, O., I. Johnsrude, and F. Pulvermuller. 2004. Somatotopic rep-
An appealing tag line for linguistic emergentism comes from
resentation of action words in human motor and premotor cortex.
Neuron 41: 3017.
Elizabeth Bates and Brian MacWhinney (1988, 147): language,
Klatzky, R. L., J. W. Pellegrino, B. P. McCloskey, and S. Doherty. 1989. Can they say, is a new machine built out of old parts. While there
you squeeze a tomato? he role of motor representations in semantic is no general agreement concerning just what those parts might
sensibility judgments. Journal of Memory and Language, 28: 5677. be, the list is relatively short, ranging from features of physiology
Lakof, G., and M. Johnson. 1999. Philosophy in the Flesh: he Embodied and perception, to processing and working memory, to prag-
Mind and Its Challenge to Western hought. New York: Basic Books. matics and social interaction, to properties of the input and of
Matlock, T. 2004. Fictive motion as simulation. Memory & Cognition the learning mechanisms.
32: 13891400. A signiicant amount of emergentist work within linguis-
Nichols, S., ed. 2006. he Architecture of he Imagination: New Essays on tics adopts the techniques of connectionism, an approach to
Pretense, Possibility, And Fiction. Oxford: Oxford University Press.
the study of the mind that seeks to model learning and cogni-
Wilson, N., and R. Gibbs. 2007. Real and imagined body movement
tion in terms of networks of (assumedly) neuron-like units. In its
primes metaphor comprehension. Cognitive Science, 31: 72131.
more extreme forms, connectionism rejects the existence of the
sorts of symbolic representations (including syntactic structure)
EMERGENTISM that have played a central role in explanatory work on human
language. Gary Marcus (1998, 2001) and Kevin R. Gregg (2003)
A signiicant body of linguistic research can be situated in the ofer a critique of this sort of eliminativist program while Paul
philosophical and scientiic tradition known as emergentism. Smolensky (1999) and Mark Steedman (1999) discuss ways to
his entry ofers a brief overview of this work, with a focus on its reconcile it with traditional symbolic approaches to language,
guiding principles and on the proposals it makes concerning the including the possibility that representations might be
nature of human language. abstract, higher-level descriptions that approximate the patterns
of neuronal activation that connectionist approaches seek to
The Emergentist Tradition model.
he roots of emergentism can be traced to the work of John Although connectionist modeling provides a useful way to
Stuart Mill ([1843] 1930), who proposed that a system can have test various predictions about language acquisition, processing,
274
Emergentism
change, and evolution, the eliminativist position is far from uni- rather than subjects, an SRN would no doubt learn just this
versally accepted within emergentism. Symbolic representations sort of pattern, even though it is not found in any known human
of one form or another are evident in the work of many emergen- language.
tists (e.g., Goldberg 1999; Tomasello 2003; OGrady 2001, 2005), here is clearly something missing here. Humans dont just
who nonetheless reject the view that the properties of those rep- learn language; they shape it. Moreover, these two facts are surely
resentations should be attributed to innate grammatical prin- related in some fundamental way, which is why hypotheses about
ciples (see innateness and innatism). how linguistic systems are acquired need to be embedded within
a more comprehensive theory of why those systems (and there-
Language Acquisition fore the input) have the particular properties that they do. here
To date, emergentist work within linguistics has focused most is, simply put, a need for an emergentist theory of grammar.
strongly on the question of how language is acquired (see, e.g.,
the many papers in MacWhinney 1999). he impetus for this Emergentist Approaches to Grammatical Theory
focus stems from opposition to the central claim of grammati- A substantial amount of analytic work has addressed the tradi-
cal nativism, which is that the principles underlying a good deal tional concerns of linguistic analysis, including core phenom-
of linguistic knowledge are underdetermined by experience and ena in the major areas of traditional grammatical theory.
must therefore be innate.
Emergentism is not opposed to nativism per se the fact that SYNTAX. It is possible to identify several strands of emergentist
the brain is innately structured in various ways is not a matter of work on syntax, each devoted to explaining the structural prop-
dispute. However, there is opposition to representational nativ- erties of sentences without reference to inborn grammatical prin-
ism, the view that there is direct innate structuring of particular ciples. Difering views have been put forward by MacWhinney
grammatical principles and constraints (Elman et al. 1996, 369 f; (2005) and William OGrady (2001, 2005), both of whom address
Bates et al. 1998), as implied by many of the proposals associated a series of issues that lie at the heart of contemporary syntactic
with universal grammar. analysis the design of phrase structure, coreference, agree-
Contemporary emergentism often includes a commitment ment, the syntax-phonology interface, and constraints
to explaining linguistic development by reference to the opera- on long-distance dependencies. MacWhinney seeks to explain
tion of simple learning mechanisms (essentially, inductive gen- these phenomena in terms of pragmatics, arguing that grammar
eralization) that extract statistical regularities from experience. emerges from conversation as a way to facilitate accurate track-
It is interesting that there is as yet no consensus as to what form ing and switching of perspective. In contrast, OGrady holds that
the resulting knowledge might take local associations and syntactic phenomena are best understood in terms of the opera-
memorized chunks (Ellis 2002), constructions (Goldberg 1999; tion of a linear, eiciency-driven processor that seeks to reduce
Tomasello 2003), or computational routines (OGrady 2001, the burden on working memory in the course of sentence forma-
2005). In addition, there is variation with respect to the exact tion and interpretation.
relationship that is assumed to hold between learning and rel- Still other work, such as that done within construction
ative frequency in the input. Some work implies a quite direct grammar, seeks to reduce syntax to stored pairings of form and
relationship (e.g., Ellis 2002), but other work suggests something function (constructions). Some of this work has a strong emer-
less direct (e.g., Elman 2002). gentist orientation (e.g., Goldberg 1999; Tomasello 2003), but
Emergentist work on language acquisition often makes use some retains a commitment to universal grammar (Goldberg
of computer modeling to test hypotheses about development. and Jackendof 2004, 563).
Jefrey Elman and his colleagues (e.g., Elman 2002) have been
able to show that a simple recurrent network (SRN) can achieve MORPHOLOGY. Very early connectionist work on morphol-
at least some of the milestones associated with language acqui- ogy called into question the existence of morphological rules
sition in children, including the identiication of category-like and representations, even for phenomena such as regular past
classes of words, the formation of patterns not observed in the tense inlection. Instead, it was suggested, a pattern-associator
input, retreat from overgeneralizations, and the mastery of sub- network learns the relationship between the phonological form
jectverb agreement. (An SRN learns to produce output of its own of stems and that of past tense forms (run~ran, walk~walked,
by processing sentences in its input; it is speciically designed to etc.), gradually establishing associations (connections) of dif-
take note of local co-occurrence relationships or transitional ferent strengths and levels of generality between the two sets of
probabilities given the word X, whats the likelihood that the elements the most general and strongest involving the -ed past
next word will be Y?) tense form. James McClelland and Karalyn Patterson (2002) ofer
Emergentist modeling has yielded impressive results, but it a succinct overview of this perspective.
raises the question of why the particular statistical regularities More recent work has raised important questions about the
exploited by the SRN are in the input in the irst place. In other nature of morphemes in general. A key claim of this research
words, why does language have the particular properties that it is that morphological structure emerges from statistical regulari-
does? Why, for example, are there languages (such as English) in ties in the formmeaning relationships between words. (Hay and
which verbs agree only with subjects, but no language in which Baayen 2005 ofers an excellent review of this research.)
verbs agree only with direct objects? Intriguing experimental work by Jennifer Hay (2003) suggests
Networks provide no answer to this sort of question. In fact, that the internal structure of an aixed word is gradient rather
if presented with data in which verbs agree with direct objects than categorical, relecting its relative frequency compared to
275
Emergentism
that of its base. he words inadequate and inaudible are a case in of English: Because the palatalization process has been sup-
point. Because adequate is more frequent than the aixed form pressed, the [] in words such as [i] she must be interpreted
inadequate, its presence in the derived word is relatively salient, as a sound in its own right, rather than as a process-induced
leading to a high native-speaker rating for structural complex- variant of /s/.
ity. In contrast, inaudible, which is more frequent (and therefore Crucially, this conclusion is drawn without the need for com-
more salient) than audible, receives a low rating for structural parison of minimal pairs or similar distributional analysis; the
complexity. phonemic inventory emerges in response to a much simpler
If this is right, then morphological structure exists but not and more basic phenomenon the suppression of processes
in the categorical form commonly assumed. Rather, what we based on exposure to particular individual words. Boersma
think of as morpheme boundaries emerge to varying degrees (1998) and Hayes, Kirchner, and Steriade (2004) discuss a broad
of strength from the interaction of more basic factors, such as range of other phonological phenomena from an emergentist
frequency, semantic transparency, and even phonotactics. perspective.
(he low-probability sequence in inhumane creates a sharper
morpheme boundary than the high-probability sequence in Concluding Remarks
insincere.) here is currently no comprehensive emergentist theory of lan-
guage or its acquisition, but there are various emergentist-inspired
THE LEXICON. here have been various attempts to develop an research programs devoted to the construction of such a theory.
emergentist approach to the lexicon, which is traditionally seen For the most part, this work is based on the simple thesis that the
as the repository of information about morphemes and words. core properties of language are best understood by reference to
One possibility, suggested by Joan Bybee (1998), among others, the properties of quite general cognitive mechanisms and their
is that the lexicon emerges from the way in which (by hypothesis) interaction with one another and with experience. he viability
the brain responds to and stores linguistic experience by creat- of this idea can and must be measured against its success in con-
ing units whose strength and productivity is determined largely fronting the classic empirical challenges of linguistic analysis
by frequency of occurrence. Some of these units correspond to iguring out how language works and how it is acquired.
words, as in a traditional lexicon, but many are phrases and other William OGrady
larger units of organization, including possibly abstract construc-
tions (see usage-based theory). WORKS CITED AND SUGGESTIONS FOR FURTHER READING
Elman (2005) also argues against a pre-structured lexicon,
proposing instead that lexical knowledge is implicit in the efects Bates, Elizabeth, Jefrey Elman, Mark Johnson, Annette Karmilof-Smith,
Domenico Parisi, Kim Plunkett. 1998. Innateness and emergentism.
that words have on the minds internal states, as represented
In A Companion to Cognitive Science, ed. W. Bechtel and G. Graham,
in the activation patterns created by an SRN. Because an SRN
590601. Oxford: Blackwell.
focuses on co-occurrence relationships (see above), these efects
Bates, Elizabeth, Judith Goodman. 1999. On the emergence of grammar
are modulated by context a words meaning, like its syntactic from the lexicon. In he Emergence of Language, ed. B. MacWhinney,
category, emerges from the contexts in which it is used rather 2979. Mahwah, NJ: Erlbaum.
than from an a priori vocabulary of linguistic primitives. Bates, Elizabeth, Brian MacWhinney. 1988. What is functionalism?
Papers and Reports on Child Language Development 27: 13752.
PHONOLOGY. Pioneering work on emergentist phonology was Boersma, Paul. 1998. Functional Phonology: Formalizing the Interactions
carried out by Patricia Donegan (1985), who noted the unhelp- between Articulatory and Perceptual Drives. he Hague: Holland
fulness to language learners of classic distributional analysis. As Academic Graphics.
she observed, it is implausible to suppose that children record Bybee, Joan. 1998. he emergent lexicon. In Proceedings of the 34th
Regional Meeting of the Chicago Linguistic Society: he Panels, 42135.
sets of phonetic representations in memory and then compare
Chicago: Chicago Linguistic Society. An excellent example of emergen-
them in the hope of determining which phonetic contrasts are
tist thinking about the lexicon.
distinctive and which are predictable from context (see speech Donegan, Patricia. 1985. How learnable is phonology? In Papers on
perception in infants and speech production). Instead, Natural Phonology from Eisenstadt, ed. W. Dressler and L. Tonelli,
Donegan suggests, children begin with a set of processes (nasal- 1931. Padua: Cooperativa Libraria Editoriale Studentesca Patavina.
ization, devoicing, and so forth) that emerge as responses to the Ellis, Nick. 2002. Frequency efects in language processing. Studies in
physical limitations of the human vocal tract and the auditory Second Language Acquisition 24: 14388.
apparatus. (hese limitations are inborn, of course, but are not Elman, Jefrey. 1993. Learning and development in neural networks: he
inherently linguistic in character, despite their linguistic con- importance of starting small. Cognition 48: 7199. A much-cited and
sequences.) A languages phonemic inventory and allophonic widely admired illustration of the value of computational modeling in
patterns then emerge as speciic processes are suppressed in the study of language acquisition.
. 2002. Generalization from sparse input. In Proceedings of the
response to experience.
38th Regional Meeting of the Chicago Linguistic Society, 175200.
A simple example involves the process that palatalizes /s/ in
Chicago: Chicago Linguistic Society. A highly readable summary of
front of a high front vowel, giving the pronunciation [i] for /si/ several important SRN-based studies of language acquisition.
in many languages (e.g., Japanese). A child learning English is . 2004. An alternative view of the mental lexicon. Trends in
forced to suppress this process upon exposure to words such Cognitive Science 8: 3016.
as see, which is pronounced [si], without palatalization. his, in Elman, Jefrey, Elizabeth Bates, Mark Johnson, Annette Karmilof-Smith,
turn, results in the admission of // to the phonemic inventory Domenico Parisi, Kim Plunkett. 1996. In Rethinking Innateness: A
276
Emergent Structure
Connectionist Perspective on Development. Cambridge, MA: MIT checkout lines. he term emergent refers especially to an open-
Press. ended process in which systematicity is partial and incomplete
Goldberg, Adele. 1999. he emergence of the semantics of argu- and in which a system is in a constant course of (re)formation.
ment structure constructions. In he Emergence of Language, ed. B.
In the study of language, the expression emergent grammar
MacWhinney, 197212. Mahwah, NJ: Erlbaum.
was coined by Paul Hopper (1987) as a methodological pro-
Goldberg, Adele, and Ray Jackendof. 2004. he English resultative as a
posal for approaching the relationship between grammar and
family of constructions. Language 80: 53268.
Gregg, Kevin R. 2003. he state of emergentism in second language
the local structure of natural discourse. Logically prior, ixed
acquisition. Second Language Research 19: 4275. grammar was, Hopper argued, inconsistent with the kinds of ad
Hay, Jennifer.and 2003. Causes and Consequences of Word Structure. New hoc linguistic decisions made by speakers. he notion of emer-
York: Routledge. gent grammar inverts the standard picture of grammar, as well
Hay, Jennifer, and R. Harald Baayen. 2005. Shifting paradigms: Gradient as the generally accepted logical priority of structure over text.
structure in morphology. Trends in Cognitive Science 9: 3428. An Linguistic structure is thus to be seen as a product of, rather than
excellent survey of work on emergentist morphology. a prerequisite to, discourse. Since discourse is ongoing, structure
Hayes, Bruce, Robert Kirchner, and Donca Steriade, eds. 2004. Phone- is emergent, that is, continually in a process of formation accord-
tically Based Phonology. Cambridge: Cambridge University Press.
ing to the current needs of the interaction. (See Weber 1997 for
MacWhinney, Brian. 1998. Models of the emergence of language.
further discussion.)
Annual Review of Psychology 49: 199227.
he database for the study of language from this perspec-
. 2002. Language emergence. In An Integrated View of Language
Development: Papers in Honor of Henning Wode, ed. P. Burmeister, tive is a corpus of transcribed texts, usually oral, and, recently,
T. Piske, and A. Rohde, 1742. Trier, Germany: Wissenshaftliche Verlag. conversational (Ochs, Scheglof, and hompson 1996). In this
. 2004. A multiple process solution to the logical problem of lan- respect, too, emergent grammar difers from structural and cog-
guage acquisition. Journal of Child Language 31: 883914. nitive grammar, in which conclusions are normally based on
. 2005. he emergence of grammar from perspective. In Grounding isolated constructed sentences. he explanation for grammar,
Cognition: he Role of Perception and Action in Memory, Language and according to this theory, lies in frequency (Bybee and Hopper
hinking, ed D. Pecher and R. Zwaan, 198223. Cambridge: Cambridge 2001) and the associated routinization of forms (Haiman 1994).
University Press. High-frequency forms tend to become phonetically reduced
MacWhinney, Brian, ed. 1999. he Emergence of Language. Mahwah,
and to be restructured (Bybee 2001). Typical examples are the
English pronoun+modal sequences like Ill, youre, weve, and so
Marcus, Gary. 1998. Rethinking eliminative connectionism. Cognitive
on. Emergent grammar is thus relevant to the more general study
Psychology 37: 24382.
. 2001. he Algebraic Mind. Cambridge, MA: MIT Press. of grammaticalization.
McClelland, James, and Karalyn Patterson. 2002. Rules or connections Incipient structure that is, looking backward at the histor-
in past-tense inlection: What does the evidence rule out? Trends in ical origins of a structured system or forward to the predicted
Cognitive Science 6: 46572. An update and survey of connectionist course of events leading to a structured system, as in the study
work on inlection. of irst language acquisition and of most varieties of cogni-
Mill, John Stuart. [1843] 1930. A System of Logic Ratiocinative and tive linguistics is more properly described as emerging
Inductive. London: Longmans, Green, and Co. than as emergent. he noun emergence is ambiguous in this
OGrady, William. 2001. An emergentist approach to syntax. Available respect.
online at: http://www.ling.hawaii.edu/faculty/ogrady/. A summary of
the detailed arguments for an emergentist theory of syntax found in Paul J. Hopper
OGrady (2005).
. 2005. Syntactic Carpentry: An Emergentist Approach to Syntax. WORKS CITED AND SUGGESTIONS FOR FURTHER READING
Mahwah, NJ: Erlbaum. An emergentist approach to syntax that seeks
Bybee, Joan. 1998. he emergent lexicon. Papers of the Annual Meeting
an understanding of many of the classic problems of syntactic theory
of the Chicago Linguistic Society 34: 42135.
in terms of processing.
. 2001. Phonology and Language Use. Cambridge: Cambridge
Palmer-Brown, Dominic, and Jonathan Tepper, Heather Powell. 2002.
University Press.
Connectionist natural language parsing. Trends in Cognitive Science
Bybee, Joan, and Paul Hopper, eds. 2001. Frequency and the Emergence of
6: 43742.
Linguistic Structure. Amsterdam: Benjamins.
Smolensky, Paul. 1999. Grammarbased connectionist approaches to
Haiman, John. 1994. Ritualization and the development of language.
language. Cognitive Science 23: 589613.
In Perspectives on Grammaticalization, ed. William Pagliuca, 328.
Steedman, Mark. 1999. Connectionist sentence processing in perspec-
tive. Cognitive Science 23: 61534.
Hopper, Paul. 1987. Emergent grammar. Papers of the Annual Meeting
Tomasello, Michael. 2003. Constructing a Language: A Usage-Based
of the Berkeley Linguistic Society 13: 13957.
heory of Language Acquisition. Cambridge: Harvard University
Lewin, Roger. 1992. Complexity: Life at the Edge of Chaos. New
Press. A widely cited example of a usage-based approach to language
York: Macmillan. A readable account of chaos theory and the emer-
acquisition.
gence of structured systems.
Ochs, Elinor, Emanuel Schegloff, and Sandra Thompson, eds.1996.
EMERGENT STRUCTURE Interaction and Grammar. Cambridge: Cambridge University
Press.
his term is used in many ields, including scientiic and social Weber, hilo. 1997. he emergence of linguistic structure: Paul
disciplines, where it has been applied to a variety of adaptive Hoppers emergent grammar hypothesis revisited. Language Sciences
self-organizing systems from termite mounds to grocery 19.2: 17796.
277
Emotion and Language
EMOTION AND LANGUAGE are gender, age, psychological history, personality factors, social
context, political and religious ailiation, and cultural factors
A vast domain of research on emotion and language cuts across
(see culture and language), all of which powerfully inlu-
many disciplines, methodologies, and theoretical frameworks.
ence peoples perception of emotion-linked words (Jay 2000).
To render this topic coherent and manageable, we focus on the
current resurgence of research on emotional words. Emotional
SELF-REPORT AND FIELD STUDIES. Field studies of taboo word use
words (e.g., lower, shit) contrast with connotatively neutral
indicate that emotional language is learned early and persists
words (e.g., toaster, being) and include subcategories such as
well into old age (Jay 2000). Self-report studies suggest that pun-
taboo words (insults, scatological references, and swearing or
ishment for cursing fails to alter the actual likelihood of swear-
curse words), threatening words (e.g., negative valence words
ing but nevertheless serves a function because the same people
referring to menacing situations such as murder and abuse),
admit that they would punish their own children for cursing (Jay,
and some emotion words (e.g., terror, disgust). In a con-
King, and Duncan 2006).
tinuum of vocal emotional expression ranging from nonverbal
(e.g., screams) to abstract verbal (e.g., igurative language; see
NEUROPSYCHOLOGICAL STUDIES. Neuropsychological studies
idioms, irony, metaphor, verbal humor), T. B. Jay (2003) have focused on two primary dimensions of emotions: arousal
argues, taboo words constitute the strongest form of emotional
(excitement) and valence (positivenegative). A primary neu-
language: Taboo words are more arousing than igurative lan-
ropsychological measure of arousal and unconscious auto-
guage and yield reliable and robust emotional efects more often
nomic activity is the skin conductance response (SCR; see,
than do threatening words.
e.g., LaBar and Phelps 1998). For emotional words presented
We review research on emotional words from historical,
to bilinguals, the SCR decreases as a function of the order in
methodological, and theoretical perspectives.
which a language is learned (Harris, Aycicegi, and Gleason
2003). he SCR also varies with the estimated emotional force
Historical Perspectives of aversive words (Dewaele 2004) and occurs even when
Historical perspectives illustrate the multiple domains and meth- presentation times are too brief for word identiication (Silvert
odologies of research on emotional words. In the mid-1800s, et al. 2004).
neuropsychological case studies of Hughlings Jackson (1958) amygdala activity also indexes arousal: hreatening words
and others helped shape current ideas concerning automatic trigger increased amygdalar activation (Isenberg et al. 1999),
or uncontrollable production of emotional words (see, e.g., Van and amygdalar damage impairs recognition of arousal but not
Lancker 1987). Carl Jungs (1910) work with emotional words valence characteristics of emotional words (Adolphs, Russell,
in free association tasks also shaped procedures for diagnos- and Tranel 1999; see also Lewis et al. 2007 for the role of other
ing clinical disorders such as schizophrenia (see also psycho- subcortical structures in arousal). Some cortical and subcorti-
analysis and language). From 1950 to 1975, experimental cal areas respond only to valence, some respond only to arousal,
psychologists used classical conditioning concepts to analyze and some respond to an interaction of valence and arousal,
the learning of emotional words (e.g., Staats 1968) and adopted particularly when valence is negative (Lewis et al. in press).
perceptual defense paradigms to determine whether ego-pro- Finally, some cortical areas respond to valence per se, while
tective processes shield threatening stimuli (taboo words) from others respond selectively to either positive or negative valence
awareness (e.g., Dixon 1971). However, both lines of research (Maddock, Garrett, and Buonocore 2003).
were largely abandoned: perceptual defense because of method- Relative activity in the left hemisphere (LH) versus right
ological laws and the learning of emotional words because com- hemisphere (RH) also indexes emotional processing, albeit
puter metaphors dominated the study of language and cognition less consistently across studies, and the nature and scope of
and downplayed emotion during the period of 1975 to 1990 emotion-linked processing in the RH is an ongoing issue (see
(Jay 2003). Borod, Bloom, and Haywood 1998). RH brain damage is associ-
ated with emotional blunting (Gainotti 1972) and diiculties in
Methodological Perspectives identifying emotional words or the emotion they represent, in
RATING STUDIES. Rating studies provide a method for deter- matching words and emotions, in interpreting emotional con-
mining the emotional qualities of words. A classic example is tent, in describing emotional autobiographical information,
the semantic diferential (Osgood, Suci, and Tanenbaum 1957), in self-expression with emotional words (Borod, Bloom, and
where ratings of words on bipolar connotative scales relect three Haywood 1998), and in comprehending and expressing humor
underlying dimensions: evaluation (the valence component, (Blake 2003). he corpus callosum that links the RH and LH
e.g., negativepositive); activity (e.g., fastslow); and potency also plays a role in comprehending emotion-linked prosody,
(e.g., strongweak). L. H. Wurm and D. A. Vakoch (1996) argued humor, and igurative usages (Brown et al. 2005; Paul et al. 2003).
that evolutionary considerations and relations between process- he frontal lobe seems to regulate or inhibit socially inap-
ing time data and the evaluation, activity, and potency ratings for propriate uses of emotional words, with links between frontal
words indicate an afective lexicon (for avoiding threats) that dif- lobe damage and verbal aggression, such as excessive cursing
fers from the general lexicon (for obtaining valuable resources). (e.g., Grafman et al. 1996).
Other rating studies involving the afective lexicon include
Bellezza, Greenwald, and Banaji (1986), Bradley and Lang CLINICAL AND INDIVIDUAL DIFFERENCE STUDIES. Clientpatient
(1999), and Jay (1992). Unrepresented in current rating studies interactions focus on emotions, and an inability to express ones
278
emotions in words may relect a serious psychiatric problem factors. American males are more likely to curse than females
known as alexithymia. Alexithymic individuals have few words both as children and as adults, although women also learn a
for describing their feelings and communicating emotional range of taboo words, whether they use them or not. Similarly,
distress, are unable to identify and describe subjective states, Americans with high sexual anxiety but no religious training are
and have diiculty interacting with others, including therapists less likely to use sex-linked curse words than profanity or blas-
(Taylor, Bagby, and Parker 1997). phemy, especially in conversations with same-sex others (see Jay
Clinical studies have developed strategies for facilitating ther- 1992, 2000, 2003).
apeutic communication and emotional interactions in general, W. Buccis (1997) multiple code theory (MCT) of emo-
for example, use of metaphor (see Stine 2005). Clinical studies tional information processing links Freudian and connec-
have also developed new ways of using emotion-linked words to tionist concepts via the concept of referential activity (RA).
diagnose psychopathology. An example is the emotional Stroop RA is an index of the ability to link primary (e.g., emotional,
task where clients name the font color of words while attempting unconscious) and secondary (e.g., verbal, conscious) levels
to ignore their meaning: Longer color naming times for speciic of processing within a connectionist network. Applied in the
word classes (e.g., web, spider) are associated with clinical prob- domain of clinical psychology, MCT has provided explana-
lems such as phobias (e.g., arachnophobia), anxiety and depres- tions for negative psychological states, such as repression, in
sive disorders, alexithymia, eating disorders, drug abuse, and a terms of the nature or quality of connections between these
range of other psychopathologies (see Williams, Mathews, and fundamentally linguistic versus emotional levels of process-
MacLeod 1996 for a review). ing. Under MCT, people with high versus low RA differ in their
ability to express and describe their emotions, in the structure
EXPERIMENTAL STUDIES. Recent experimental studies have made and organization of their narratives, and in their therapeutic
extensive explorations of the efects of emotional words on cog- success rates.
nitive processes such as memory and attention. For example, Resource theories of emotion and attention (e.g., Wells
in a variant of the emotional Stroop task known as the taboo and Matthews 1994) perhaps provide the broadest conceptu-
Stroop (MacKay et al. 2004), people name the font color of taboo alization of emotion and cognitive processes. Under resource
and neutral words (equated for length, familiarity, and category theories, threatening stimuli attract limited-capacity cognitive
coherence) while ignoring the meaning of the words and their resources, thereby reducing resources available for processing
screen location. hey then receive a surprise memory test for the and responding to other stimuli, for example, font color in clini-
words, the font color of particular words, or the screen location cal, emotional, and taboo Stroop tasks. his hypothesis readily
of particular words, and the results indicate better memory for describes phenomena such as the taboo Stroop efect (longer
taboo than neutral words and better memory for the font colors times for naming the font color of taboo than of neutral words)
and screen locations of taboo than of neutral words (see, e.g., but cannot describe other phenomena, for example, superior
MacKay et al. 2004; MacKay and Ahmetzanov 2005). hese and memory for the font color and screen location of taboo than of
other results suggest that taboo words facilitate recall of con- neutral words (see MacKay et al. 2004).
textual details in the same way as do lashbulb memories for Two exceptions to the descriptive or post hoc approach that
traumatic events such as the September 11, 2001 tragedies, characterizes resource theories are noteworthy. One is arousal
after which people vividly recall contextual details associated theory (e.g., LeDoux 1996) as applied to emotional words (e.g.,
with the emotion-linked event, for example, how and when Kensinger and Corkin 2003). Under arousal theory, low-level
they irst learned of the event, where they were, what they were sensory aspects of emotional stimuli, such as taboo words,
doing, and who else was present (see MacKay and Ahmetzanov directly engage an emotional reaction system (in the amygdala)
2005). independently of other stimulus factors, such as context and
Other results indicate that taboo words impair immediate presentation rate. he emotional reaction system then triggers
recall of prior and subsequent neutral words in rapidly presented enhanced skin conductance and facilitates memory consolida-
mixed lists containing taboo and neutral words (e.g., MacKay, tion for the emotional stimuli and their context of occurrence (in
Hadley, and Schwartz 2005), without impairing recall of neigh- the hippocampus).
boring words in pure, all-taboo lists (Hadley and MacKay 2006). What makes arousal theory attractive is its generality and test-
However, lexical decision times (the time to identify a stimulus ability. For example, arousal theory explains lashbulb memories
as a word) do not difer for taboo versus neutral words (MacKay under the hypothesis that arousal tends to induce storage of per-
et al. 2004). We discuss theoretical perspectives on this pattern ceptual images that include both the emotional stimulus and
of results next. its context of occurrence. However, arousal theory as applied to
emotional words has not fared well in recent tests: Contrary to
Theoretical Perspectives arousal theory, if presented in mixed taboo-neutral lists at rela-
Current research on emotional words illustrates a gamut of the- tively slow rates (e.g., 2,000 ms/word) or if presented in pure (all-
oretical perspectives that difer in their scope and goals and in taboo or all-neutral) lists at rapid rates (e.g., 200 ms/word), taboo
the nature and speciicity of the predictions they make. Jays words are no better recalled than neutral words equated for
(2000) neuro-psychosocial theory of cursing summarizes likeli- familiarity, length, and category coherence (Hadley and MacKay
hood estimates of various forms of cursing, based on neurolog- 2006). Also contrary to arousal theory, recent data indicate that
ical (e.g., conscious state, brain damage), psychological (e.g., taboo words do not trigger imagelike memories (MacKay and
personality, age, history), and social context (e.g., culture, class) Ahmetzanov 2005).
279
he second notable exception to the summary-description Dewaele, J. 2004. he emotional force of swearwords and taboo words
approach is node structure binding theory, or binding theory in the speech of multilinguals. Journal of Multilingual and Cultural
for short (e.g., Hadley, and MacKay 2006). Under binding the- Development 25: 20422.
ory, emotion-linked stimuli, such as taboo words, engage the Dixon, N. F. 1971. Subliminal Perception: he Nature of a Controversy.
London: McGraw-Hill.
emotional reaction system, which delays activation of binding
Gainotti, G. 1972. Emotional behavior and the hemispheric side of the
mechanisms (located in the hippocampus) for linking concur-
lesion. Cortex 8: 4155.
rent neutral stimuli to their context of occurrence. As a result, Grafman, J., K. Schwab, D. Warden, A. Pridgen, H. R. Brown, and
(less important) neutral stimuli only form links to their context of A. M. Salazar. 1996. Frontal lobe injuries, violence, and aggression: A
occurrence after links to context for (more important) emotion- report of the Vietnam Head Injury Study. Neurology 46: 12318.
linked stimuli have been formed. Hadley, C. B., and D. G. MacKay 2006. Does emotion help or hinder
hese binding theory assumptions have generated counter- immediate memory? Arousal versus priority-binding mechanisms.
intuitive predictions that subsequent experimental tests have Journal of Experimental Psychology: Learning, Memory, and Cognition
veriied. For example, unlike other theories, binding theory 32: 7988.
correctly predicted impaired recall of neutral neighbors before Harris, C. L., A. Aycicegi, and J. B. Gleason. 2003. Taboo words and rep-
and after a taboo word if and only if mixed (taboo-neutral) word rimands elicit greater autonomic reactivity in a irst language than in a
second language. Applied Psycholinguistics 24: 56179.
lists are presented rapidly (Hadley and MacKay 2006). Binding
Isenberg, N., D. Silbersweig, A. Engelien, S. Emmerich, K. Malavade, B.
theory also correctly predicted no diference in recall of taboo
Beattie, A. C. Leon, and E. Stern. 1999. Linguistic threat activates the
versus neutral words in pure (all-taboo or all-neutral) lists prehuman amygdala. Proceedings of the National Academy of Science
sented rapidly or slowly (Hadley and MacKay 2006). Unlike 96: 104569.
other theories, binding theory also correctly predicted no dif- Jackson, H. 1958. Selected Writings of Hughlings Jackson. Vol 2. New
ference in lexical decision times (the time to identify a stimulus York: Basic Books.
as a word) for taboo versus neutral words (MacKay et al. 2004). Jay, T. B. 1992. Cursing in America. Philadelphia: John Benjamins.
. 2000. Why We Curse. Philadelphia: John Benjamins.
. 2003. Psychology of Language. Upper Saddle River, NJ: Prentice-
Conclusion Hall.
Both historical and contemporary research on emotional Jay, T. B., K. King, and T. Duncan. 2006. Memories of punishment for
words relects a wide variety of theoretical and methodological cursing. Sex Roles 32: 12333.
approaches in ields ranging from neuroscience to psycholin- Jung, C. G. 1910. he association method. American Journal of
guistics to cognitive and clinical psychology. Further research is Psychology 31: 21969.
required to piece together these multiple domains and to develop Kensinger, E. A., and S. Corkin. 2003. Memory enhancement for emo-
a general understanding of emotional words and their relation to tional words: Are emotional words more vividly remembered than
other cognitive processes. However, emotional words currently neutral words? Memory and Cognition 31: 116980.
seem poised to resume their central position in the language sci- LaBar, K., and E. Phelps. 1998. Arousal-mediated memory consolida-
tion: Role of the medial temporal lobe in humans. Psychological
ences and related disciplines.
Science 9: 4903.
Kristin L. Janschewitz and Donald G. MacKay LeDoux, J. 1996. he emotional brain: he mysterious underpinnings of
emotional life. New York: Simon and Schuster.
Lewis, P. A., H. D. Critchley, P. Rotshtein, and R. J. Dolan. 2007. Neural
correlates of processing valence and arousal in afective words.
Adolphs, R., J. A. Russell, and D. Tranel. 1999. A role for the human Cerebral Cortex 17: 7428.
amygdala in recognizing emotional arousal. Psychological Science MacKay, D. G., and M. V. Ahmetzanov. 2005. Emotion, memory, and
10: 16771. attention in the taboo Stroop paradigm: An experimental analog of
Bellezza, F. S., A. G. Greenwald, and M. R. Banaji 1986. Words high and lashbulb memories. Psychological Science 16: 2532.
low in pleasantness as rated by male and female college students. MacKay, D. G., C. B. Hadley, and J. H. Schwartz. 2005. Relations between
Behavior Research Methods, Instruments, and Computers 18: 299303. emotion, illusory word perception, and orthographic repetition blind-
Blake, M. L. 2003. Afective language and humor appreciation after ness: Tests of binding theory. Quarterly Journal of Experimental
right hemisphere brain damage. Seminars in Speech and Language Psychology 8: 151433.
24:10719. MacKay, D. G., M. Shafto, J. K. Taylor, D. Marian, L. Abrams, and J. Dyer
Borod, J. C., R. L. Bloom, and C. S. Haywood. 1998. Verbal aspects 2004. Relations between emotion, memory and attention: Evidence
in emotional communication. In Right Hemisphere Language from taboo Stroop, lexical decision, and immediate memory tasks.
Comprehension: Perspectives from Cognitive Neuroscience, ed. M. Memory and Cognition 32: 47488.
Beeman and C. Chiarello, 285307. Mahwah, NJ: Erlbaum. Maddock, R. J., A. S. Garrett, and M. H. Buonocore. 2003. Posterior cin-
Bradley, M. M., and P. J. Lang. 1999. Afective Norms for English Words gulate cortex activation by emotional words: fMRI evidence from a
(ANEW): Instruction Manual and Afective Ratings. Technical Report valence decision task. Human Brain Mapping 18: 3041.
C-1, he Center for Research in Psychophysiology, University of Florida. Osgood, C. E., G. J. Suci, and P. H. Tannenbaum. 1957. he Measurement
Brown, W. S., L. K. Paul, M. Symington, and R. Dietrich. 2005. of Meaning. Urbana: University of Illinois Press.
Comprehension of humor in primary agenesis of the corpus callo- Paul, L. K., D. Van Lancker-Sidtis, B. Schiefer, R. Dietrick, and W. S.
sum. Neuropsychologia 43: 90616. Brown. 2003. Communicative deicits in agenesis of the corpus callo-
Bucci, W. 1997. Symptoms and symbols: A multiple code theory of som- sum: Nonliteral language and afective prosody. Brain and Language
atization. Psychoanalytic Inquiry 17: 15172. 85: 31324.
280
Emotion, Speech, and Writing
Silvert, L., S. Delplanque, H. Bouwalerh, C. Verpoort, and H. Sequeira. In her critical essay, J. T. Irvine (1990) wrote that many lin-
2004. Autonomic responding to aversive words without conscious guists tend to get cold feet when it comes to considering how
valence discrimination. International Journal of Psychophysiology emotions are expressed verbally. Accordingly, if we use the ter-
53: 13545. minology of Ferdinand de Saussure, we can say that emotion
Staats, A. W. 1968. Language, Learning, and Cognition. New York: Holt,
is accepted as integral to the parole, which is linguistically less
Rinehart and Winston.
meaningful than langue language in its broadest sense. hus,
Stine, J. J. 2005. he use of metaphors in the service of the therapeutic
alliance and therapeutic communication. Journal of the American
examination of emotion is pushed to the periphery of linguis-
Academy of Psychoanalysis and Dynamic Psychiatry 33: 53145. tics. Irvine also remarks that though there are languages with
Taylor, G. J., R. M. Bagby, and J. D. A. Parker. 1997. Disorders of phonological and morphological units that indicate
Afect Regulation: Alexithymia in Medical and Psychiatric Illness. emotional states in speech, linguists frequently tend to combine
Cambridge: Cambridge University Press. such elements with general descriptions of grammar, rather than
Van Lancker, D. 1987. Nonpropositional speech: Neurolinguistic stud- emphasizing such verbal expressions of emotion. However, she
ies. In Progress in the Psychology of Language, ed. A.W. Ellis, 49118. notes two important linguistic texts that also deal with emotion
London: Erlbaum. in language, namely, Edward Sapirs lexicon of emotions as mir-
Wells, A., and G. Matthews. 1994. Attention and Emotion: A Clinical roring culture and Roman Jakobsons work relating to the emo-
Perspective. Hove, UK: Lawrence Erlbaum.
tive function of language.
Williams, J. M. G., A. Mathews, and C. MacLeod. 1996. he emotional
C. Cai and R. W. Janney (1994) examined the rhetorical
Stroop task and psychopathology. Psychological Bulletin 120: 324.
Wurm, L. H., and D. A. Vakoch. 1996. Dimensions of speech percep-
strategies for expressing emotion by comparing psychological
tion: Semantic associations in the afective lexicon. Cognition and categories of emotions with linguistic categories. hey deine
Emotion 10: 40923. emotional communication as directed strategies for imparting
emotional information in speech or writing, insisting that such
expressions must be analyzed linguistically, because language
EMOTION, SPEECH, AND WRITING
spoken or written is the means for conveying emotion. heir
In our everyday life, we are frequently exposed to expressions such model comprises linguistic markers, including speciic emotion
as a thousand words cannot express a single emotion, what I words, obligatory words, syntax markers, and spoken language
feel is something that is beyond words, and so on. his kind of mechanisms (i.e., tones and intonation, prosody, length of
utterance, relecting the diiculty of expressing emotions, evokes syllables, etc.).
special interest: Can speech and writing really express emotions? Cai and Janneys writings infer that there are signiicant
In todays world, with the increasing awareness of emotion connections between textual linguistic usage and emotions,
as part of the self and the importance of expressing emotion as as evidenced also in diaries, letters, and other autobiographi-
part of human communication, language becomes vital to the cal writings. For instance, language was used to measure emo-
understanding and analysis of emotions. Lexical choices relect tion in a study by G. Collier, D. Kuiken, and M.E. Enzle (1982).
how people experience the world around them and, thus, con- he researchers noted that when describing negative emotions,
stitute mediators between individuals emotions, which are people use more complex constructions than in describing posi-
internal and subjective, and external entities, such as society and tive emotions, and this also applies to expressions of negative
environment. as opposed to positive personal qualities. he positive is always
heorists of emotion stress that language is the most con- more clearly expressed. Assessing descriptions of levels of posi-
venient channel for approaching research into emotions and tivity or negativity of emotions or traits indicated that the feel-
that emotion words are the best way to relect the emotional ings or traits described in more complex detail tended to be more
experience. Psychologists and psychoanalysts (Freud and his negative, that is, the more complex the descriptions, the more
followers) recognize that in spite of the importance of nonver- likely they were to be negative.
bal behavior, words are the natural way of exteriorizing the inner An earlier study by C. E. Osgood (1958) dealt with the connec-
emotional world (see psychoanalysis and language). tion between emotion and language, establishing the link
heorists of emotion (e.g., Ortony, Clore, and Foss 1987) also between the lingual characteristics of a text and the motivation
stress that language ofers the most convenient access for research- level of the author when writing it. he research studied suicide
ing emotions, and that emotion words are the best way of relect- notes, written under the inluence of very strong emotion the
ing emotional experiences. Linguists such as N. J. Enield and A. last letter as compared with ordinary correspondence with
Wierzbicka (2002) went further, stating that it would be impossi- family or close friends.
ble to examine peoples emotions without putting language at the In her chapter How and why is emotion communicated?
center, both as the object of the research and as the research tool. S. Planalp (1999) writes that verbal expression of feelings is
One reason for the complexity of such studies, according to sev- endemic to the process of communication, even though peo-
eral researchers, is linguistic usage that confuses emotion terms. ple do not always use words. hey do not, as a rule, announce
Criticism of psychological research into emotions focuses mainly that Im angry or Im feeling depressed at the moment, but
on the fact that most research in this ield relies largely on linguis- there are other verbal indications, like swearing or extravagant
tic labels and not on direct measurement of the emotion itself. If outbursts such as I could kill him!
this is, in fact, the case, it is particularly important to investigate J. W. Pennebaker and M. E. Francis (1996) analyzed per-
the language of emotions as a discrete issue, with tools exterior to sonal texts describing thoughts about commencing higher stud-
the emotion itself such as those of linguistics. ies by irst-year students at college. Linguistic and cognitive
281
Emotion, Speech, and Writing
parameters were classiied according to speciic verbal catego- for example, words relating to the self, social words directed
ries. his included classifying emotion words used by the sub- to others, words expressing positive or negative emotion, cog-
jects (in particular, positive and/or negative expressions), while nitive words, long words (more than six letters), and others.
the cognitive parameters included clarity, accessibility to the He believes that constant use of positive expressions indicates
reader, and schematic organization of the text. he connection optimism, whereas negative expressions indicate depression.
between these linguistic and cognitive variables and mental Cognitive terms (In my opinion, It seems to me, I think,
health was then examined, as were the academic achievements etc.) indicate that the writer does a lot of considering and prepa-
of the subjects in their ongoing studies. ration when writing and is thus more thoughtful and self-aware.
A. Boals and K. Klein (2005) examined how the words used Constant use of long words suggests that the writer is alienated,
in a narrative can convey stress or distress in regard to levels of keeping his/her distance.
pain after a negative emotional event. heir subjects were more Another group of researchers (Pennebaker, Slatcher, and
than 200 students who had undergone a romantic crisis or the Chung 2005) attempted to learn from the speeches of U.S. presi-
breaking up of a relationship. he students were asked to write dential candidates about their personalities and emotions. It was
about both the relationship and the efects of the separation. shown that in spite of advice received by political candidates from
he researchers found that there was conspicuously more use their advisors about using words correctly (e.g., using the irst
of negative emotion expressions, both of physical words and person plural instead of singular), they sometimes speak more
of irst person utterances, as compared with descriptions of the freely, revealing more about their personalities. Pennebaker
relationship before breaking up. It was possible to pinpoint lin- and his colleagues emphasized the use of functional words that
guistic diferences between rejection/repression and intensive indicate the ability to absorb and organize thoughts and ideas.
internalization of an experience. he rejecters tended to use Using the program developed for this study, they also examined
more casual language, negative emotion words, and the irst positive and negative expressions of emotion, cognitive words,
person singular, as well as pronouns when referring to others, exclusives, singular and plural expressions, etc.
but also used fewer cognitive words. Cognitive expressions imply From all these diverse studies, we learn that language, the
actively searching for meaning and comprehension of a stressful dominant aspect of intercommunication between individuals,
event and of depression, so that using them is characteristic of is the simplest method of revealing a diferent system, one that
people who thoroughly work through such an event. has its own attributes and inluences all aspects of our lives, that
Psychotherapy also ofers sources for researching emotions is, the emotional system. One can, as a rule, consciously control
via language. he therapists diagnoses and therapeutic meth- the content of the story one tells, but it is more diicult to control
ods are frequently based on patients language choice of words, the exact choice of each word. When it comes to what to write
slips of the tongue, narratives in their stories, repetitions when or say, we are aware of what we are doing, but this is not always
describing a trauma, and other markers (see, for example, Bucci, the case with how we do it. Even the most practiced speaker inds
2001). it diicult to monitor all the words he or she selects in order to
D. D. Danner, D. A. Snowdon, and W. V. Friesen (2001) exam- communicate. hus, linguistic markers are, in fact, the building
ined autobiographies of nuns as part of research known as the blocks that must be used as the foundation for researching emo-
Nun Study. hey examined the link between writings about tion in language.
positive emotional incidents, expressed in positive terms, and
Osnat Argaman
the life span of the writers. Emotional stability, measured accord-
ing to the use of emotion words, was found to have a signiicant WORKS CITED AND SUGGESTIONS FOR FURTHER READING
positive efect.
Use of nonautobiographical texts not produced under labora- Boals, A., and K. Klein. 2005. Word use in emotional narratives about
failed romantic relationships and subsequent mental health. Journal
tory conditions are derived from Internet chat, blogs, and e-mails
of Language and Social Psychology 24: 25268.
in various frameworks. For example, M. A. Cohn, M. R. Mehl, and
Bucci, W. 2001. Pathways of emotional communication. Psychoanalytic
J. W. Pennebaker (2004) examined changes in the language use Inquiry 21.1: 4070.
of American citizens after September 11, 2001. he researchers Cai, C., and R. W. Janney. 1994. Toward a pragmatics of emotive com-
studied randomly selected blogs over a period of four months munication. Journal of Pragmatics 22: 32573.
two months prior to and two months after the traumatic event. Cohn, M. A., M. R. Mehl, and J. W. Pennebaker. 2004. Linguistic mark-
Linguistic markers to psychological diferences were studied. ers of psychological change surrounding September 11, 2001.
Two weeks after the attack, the writers had returned to baseline Psychological Science 15: 68793.
in regard to use of lexical expressions of emotion. Pennebaker, Collier, G., D. Kuiken, and M. E. Enzle. 1982. he role of grammatical
who collaborated both with Francis in researching academic qualiication in the expression and perception of emotion. Journal of
achievement according to linguistic parameters and in Cohns Psycholinguistic Research 11: 63150.
Danner, D. D., D. A. Snowdon, W. V. Friesen. 2001. Positive emotions
work on linguistic markers in blogs written after 9/11, devel-
in early life and longevity: Findings from the Nun Study. Journal of
oped a tool for categorizing various types of text according to
Personality and Social Psychology 80: 80413.
style, called the LIWC (linguistic inquiry and word count), which Enield, N. J., and A. Wierzbicka. 2002. Introduction: he body in descrip-
is available online (Pennebaker 2007). Pennebaker maintains tion of emotion. Pragmatics and Cognition 10.1/2: 125.
that his computer program collates words from various cate- Irvine, J. T. 1990. Registering afect: Heteroglossia in the linguistic expres-
gories and translates them, according to their relative number sion of emotion. In Language and the Politics of Emotion, ed. C. A. Lutz
in the text, to psychological meanings. Main categories include, and L. Abu-Lughod, 12661. Cambridge: Cambridge University Press.
282
Emotion Words
Ortony, A., G. L. Clore, and M. A. Foss. 1987.he referential structure of related words (Shaver, Wu, and Schwartz 1992). Recent studies
the afective lexicon. Cognitive Science 11: 34164. on the Italian (Zammuner 1998) and the French (Niedenthal et
Osgood, C. E. 1958. Some efects of motivation on style of encoding. al. 2004) emotion lexicons suggest that prototypicality ratings are
In Style in Language, ed. T. A. Sebeok, 293306. Cambridge, MA: MIT driven by valence, intensity, duration, familiarity, age of acquisi-
Press.
tion, and frequency in the corpus.
Pennebaker J. W. 2007, he world of words. Available online at: http://
www.liwc.net/. How Are Emotion Words Represented in the Mind?
Pennebaker, J. W., and M. E. Francis. 1996. Cognitive, emotional and
psycholinguistics distinguishes abstract and concrete words
language processes in disclosure. Cognition and Emotion 10: 60126.
as separate classes of words, and recent work suggests that emo-
Pennebaker, J. W., R. B. Slatcher, and C. K. Chung. 2005. Linguistic
markers of psychological state through media Interviews: John Kerry
tion words may form yet a third class of words. In general, con-
and John Edwards in 2004, Al Gore in 2000. Analyses of Social Issues crete versus abstract words are easier to imagine, more quickly
and Public Policy 5: 197204. recalled, and more easily recognized. In addition, concrete words
Planalp, S. 1999. Communicating Emotion: Social, Moral and Cultural are more easily associated with a context, perhaps because of
Processes. Cambridge: Cambridge University Press. prior association with those contexts (Schwanenlugel, Akin, and
Luh 1992). When approached as a third class of words, emotions
words are rated as less concrete and lower in context availability
EMOTION WORDS
than abstract and concrete words. Nevertheless, they are rated
What Counts as an Emotion Word? higher in imageability than abstract words, perhaps because
Languages difer in the size and range of their emotion vocabu- of some connection to scripts or typical situations in which
laries. here are, for example, more than 500 words in English, they are experienced. Further, when participants give the irst
750 in Taiwanese Chinese (Russell 1991), and 256 in Filipino word that comes to mind in response to concrete, emotion, and
(Church, Katigbak, and Reyes 1996). In addition, translation abstract words, emotion words garner the highest number of
equivalents often cover overlapping but not identical semantic diferent associates (Altarriba and Bauer 2004). If associates are
space (Wierzbicka 1999). Clearly, the investigation of the emo- stored together (as an associative model of memory suggests),
tion lexicon requires the careful delimitation of what counts as then emotion words would seem to be linked to a richer concep-
an emotion word. tual base than either of the other two word types. It is interest-
Empirical approaches to this question are driven by ing that when Spanish-English bilinguals perform these tasks in
prototype theory (Fehr and Russell 1984; Rosch 1978), accord- Spanish, ratings of context availability in Spanish are higher than
ing to which semantic categories are recognized not by lists of in English. his raises the possibility that emotion words might
necessary and sufficient features but in terms of a gestalt be encoded in language-speciic ways (Altarriba 2006).
or conigurational whole. his approach suggests that emotion
Robert W. Schrauf
is a fuzzy category, and emotion words it the category in a
graded manner.
A number of taxonomies have been proposed. G. L. Clore, WORKS CITED AND SUGGESTIONS FOR FURTHER READING
A. Ortony, and M. A. Foss (1987) distinguished eight categories Altarriba, J. 2006. Cognitive approaches to the study of emotion-laden
in English: 1) pure afective states (e.g., happy), 2) afective-be- and emotion words in monolingual and bilingual memory. In Bilingual
havioral states (e.g., cheerful), 3) afective-cognitive states (e.g., Minds: Emotional Experience, Expression, and Representation, ed. A.
encouraged), 4) cognitive states (e.g., certain), 5) cognitive- Pavlenko, 23256. Clevedon, UK: Multilingual Matters.
behavioral states (e.g., cautious), 6) bodily states (e.g. sleepy), Altarriba, J., and L. M. Bauer. 2004. he distinctiveness of emotion con-
7) subjective evaluations of character (e.g., attractive), and 8) cepts: A comparison between emotion, abstract, and concrete words.
objective conditions (e.g., abandoned). Analyses of prototypi- American Journal of Psychology 117: 389410.
cality ratings of 585 candidate emotion words conirmed the Church, A. T., M. S. Katigbak, J. A. S. Reyes. 1996. Toward a taxonomy of
trait adjectives in Filipino: Comparing personality lexicons across cul-
empirical discriminability of the eight categories, and words in
tures. European Journal of Personality 10: 324.
the irst three (afective) categories had the highest typicality
Clore, G. L., A. Ortony, M. A. Foss. 1987. he psychological foundations
ratings. of the afective lexicon. Journal of Personality and Social Psychology
Phillip Shaver and colleagues (1987) used cluster analysis of 53: 75166.
prototypicality ratings of English emotion words to display a pro- Fehr, B., and J. A. Russell. 1984. Concept of emotion viewed from a
totype hierarchy, with two superordinate categories encompass- prototype perspective. Journal of Experimental Psychology: General
ing positive versus negative terms and ive basic-level terms: love, 113: 46486.
joy, anger, sadness, and fear. he rest of the terms are subor- Niedenthal, P. M., C. Auxiette, A. Nugier, N. Dalle, P. Bonin, M. Fayol.
dinates under these basic terms (Shaver et al. 1987; Storm and 2004. A prototype analysis of the French category emotion.
Storm 1987). It is interesting to note that negative emotion words Cognition and Emotion 18.3: 289312.
generally outnumber positive emotion words, perhaps explained Rosch, E. 1978. Principles of categorization. In Cognition and
Categorization, ed. E. Rosch and B. B. Lloyd, 2748. Hillsdale,
by the greater cognitive processing required by negative events
in comparison with positive events (Schrauf and Sanchez 2004).
Russell, James A. 1991. Culture and the categorization of emotions.
he Indonesian emotion lexicon has the same overall structure Psychological Bulletin 110.3: 42650.
(Shaver, Murdaya, and Fraley 2001), but in the Chinese lexicon, Schrauf, R. W., J. Sanchez. 2004. he preponderance of negative
a love category does not emerge separate from happiness- emotion words in the emotion lexicon: A cross-generational and
283
Emplotment
cross-linguistic study. Journal of Multilingual and Multicultural disagreements but also entail highly signiicant practical diver-
Development 25.2/3: 26684. gences as well.
Schwanenlugel, P. J., C. Akin, W. Luh. 1992. Context availability and For White, the formation of the story level is roughly
the recall of abstract and concrete words. Memory and Cognition Aristotelian, the shaping of a beginning, middle, and end.
20: 96104.
Emplotment proper follows Northrop Fryes modes (see Frye
Shaver, P., U. Murdaya, R. C. Fraley. 2001. Structure of the Indonesian
1957). Drawing on more recent work in cognition, we might
emotion lexicon. Asian Journal of Social Psychology 4: 20124.
Shaver, P. R., S. Wu, J. C. Schwartz. 1992. Cross-cultural similarities and
preserve Whites (and Fryes) insights, while understanding the
diferences in emotion and its representation: A prototype approach. precise structures and operations of those structures slightly
In Review of Personality and Social Psychology. Vol. 13: Emotion. Ed. diferently.
M. S. Clark, 175212. Newbury Park, CA: Sage.
Shaver, Phillip, Judith Schwartz, Donald Kirson, Cary OConnor. 1987. Emplotment and Cognition
Emotion knowledge: Further exploration of a protoytpe approach. he sort of emplotment discussed by White is part of our ordi-
Journal of Personality and Social Psychology 52: 106186. nary causal thought. Indeed, our everyday thought about every-
Storm, C., T. Storm. 1987. A taxonomic study of the vocabulary of emo- thing, from our personal lives to larger social patterns, is bound
tions. Journal of Personality and Social Psychology 53: 80516. up with emplotment in roughly Whites sense. hus, we might
Wierzbicka, A. 1999. Emotions Across Languages and Across
consider the more professional forms of emplotment alongside
Cultures: Diversity and Universals. Cambridge: Cambridge University
more ordinary forms in order to better understand both.
Press.
Zammuner, V. L. 1998. Concepts of emotion: Emotionness and dimen-
Historiography and everyday causal thought share several
sional ratings of Italian emotion words. Cognition and Emotion salient tendencies and constraints. First, they tend to be con-
12: 24372. cerned with particularity. Although we try to isolate general
principles for any sort of explanation, history and daily life are
unlike paradigmatic natural sciences, for in history and daily life,
EMPLOTMENT
generalities are most often a means of understanding particu-
Emplotment is the organization of events into a narrative. he lars rather than the reverse. In addition, our concerns in history
concept was developed most inluentially by Hayden White in and everyday life are not subject to repetition in controlled cir-
his treatment of historiography. White distinguishes ive levels cumstances where we can manipulate variables. As a result, our
of conceptualization in the writing of history (1973, 5). he irst causal accounts in these cases must range over a vast number
is the chronicle, a simple listing of events in the order of their of possible causal factors. We tend to chose the factors that are
occurrence. he second is the formation of these events into a important by a more or less loose comparison across sequences
basic causal sequence or story. he third, emplotment proper, that we have grouped together as parallel not experimentally
is their further elaboration into a narrative with a point. (he but conceptually. For example, in ordinary life, I may categorize
fourth and ifth levels, mode of argument and mode of ideo- several failed friendships together and infer their common prop-
logical implication, go beyond emplotment and thus the main erties, thus why the most recent friendship failed. his may occur
concerns of this entry.) According to White, diferent historians self-consciously or implicitly. Similarly, a presidential advisor
commonly organize even the same sequence of events into diver- might categorize several failed foreign policy initiatives together
gent histories, relecting diferent strategies of emplotment. he in order to infer what led to the failure of the most recent policy
same point applies beyond writers on history. Everyone emplots or to avoid such failure in a current policy.
events, from political igures shaping public policy to ordinary It is worth pausing over this point for a moment. In ordinary
people in conversational storytelling. cognition, our grouping together of (putatively parallel) event
We might consider the events of September 11, 2001, by way sequences is almost invariably bound up with prototype for-
of illustration. A chronicle would simply list the events of the mation. A prototype results from a weighted averaging across
day. A basic story would set out the causal relations the orga- instances of a category. Weighting is determined by several fac-
nization of the conspirators, their practice, their inal execution tors, prominent among them salience and distinctiveness for the
of their plans, and so on. It should be clear that, even here, there category (cf. Tversky 1977; Barsalou 1983, 212; and Kahneman
are diferent ways in which events may be selected and grouped and Miller 1986, 143, on contrast). For example, our prototype
together and diferent ways in which causal links may be pos- for a man will result from averaging across individual men, but
ited. For example, in the lead-up to the invasion of Iraq, some this is not a pure statistical average. Men to whom we pay more
commentators suggested that various actions of the Iraqi gov- attention (e.g., heroes and villains in movies) will be weighted
ernment were part of the September 11 causal sequence; others more heavily than men of whom we are only peripherally aware.
denied the connection, arguing that this was not even a plausi- Moreover, for individual men, distinctive characteristics (e.g.,
ble part of the basic story. he third level, emplotment, embeds facial hair, as a distinctive diference from both boys and women)
the causal sequence in a more elaborated structure. In the case will weigh more heavily than nondistinctive characteristics.
of the Bush administration, that structure was a war narrative in hus, our prototypical man is more manly than the statisti-
which the events of September 11 constituted an act of war. For cal average. Finally, once established, even in a minimal form,
many others, that structure was a crime narrative in which the our prototypes guide categorization. hey do so by directing our
events were a (massive) criminal violation. As these cases sug- attentional focus to distinctive (thus putatively identifying) char-
gest, diferences at the level of causal interpretation and difer- acteristics of individual men. Given that this is part of the general
ences at the level of emplotment not only manifest intellectual operation of the human mind, it presumably occurs with other
284
Emplotment
sorts of prototype as well, including prototypes for categories of is the point at which our emotion systems are engaged. he end
event sequences thus, narrative prototypes. of the story is the point at which our emotion systems return to
We may broadly distinguish, then, between two types of causal their normal state. his is why Americans tend to view the con-
understanding. he irst sort, found in what might be called the spiracy of the hijackers as the beginning of the story. With limited
general sciences, is experimental, based on the isolation of causal exceptions, they lack emotional interest in what preceded and
features by the controlled manipulation of variables. he sec- motivated the attacks.
ond sort, found in ordinary life and in what might be called the Of course, things do not end with this level of selection, and so
particularistic sciences, is prototypical, based on the formation on. Whenever we isolate aspects of a particular event sequence
of distinctive, (loosely) statistical structures. Of course, there due to their emotional force, we simultaneously activate cognitive
are intermediate cases. Moreover, there are diferent degrees to structures for understanding and responding to that sequence.
which statistical derivations may be made explicit and rigorous. hese structures crucially involve prototypes, which supplement
For example, there are areas of economic history where we might our emotional responses in elaborating interpretations, explana-
achieve relatively high levels of explicitness and rigor. In other tions, expectations, directing attentional focus, and so on. In the
cases, however, it is very diicult to make the statistical pro- case of event sequences, these are crucially narrative prototypes,
cess at all scientiic, for the selection of a comparison set (which including subprototypes bearing on actions and on agents. his,
guides causal inferences) is already so thoroughly imbued with then, leads us to the level of emplotment proper.
the implicit prototypes of the researcher. he narrative prototypes that guide emplotments undoubt-
his division in types of causal understanding is, of course, edly include the broad structures of narrative universals.
connected with the orientation of the particularistic sciences to For example, the hijackers may have emplotted their actions in
the explanation of particulars. But there are many particulars. terms of a sacriicial narrative in which the sufering of the home
Just how does our interest in certain particulars arise? In both society will be relieved by God due to the voluntary death of a
particularistic and general sciences, we attend to the explana- member of that society. In contrast, the U.S. government emplot-
tion of individual objects or events when we care about them. ted these same actions as the foreign aggression component of a
We care about something when it has an emotional impact, heroic plot. Beyond these cross-cultural patterns, emplotments
which is to say, when it engages some emotion system. hus, an also derive from more culturally speciic narrative structures,
understanding of emotion systems is crucial for understanding including structures related to culturally deined practices, such
our explanatory aims in particularistic study. As it turns out, an as those of legal systems (as in the emplotment of the September
understanding of emotion systems also gives us a way of under- 11 attacks as criminal acts).
standing Whites irst and second levels of conceptualization. Both universal and culturally speciic narratives are bound up
Speciically, both a chronicle and a basic story involve three not only with emotions but also with values related to those emo-
fundamental cognitive operations selection, segmentation, tions. hus, it is unsurprising that diferent emplotments tend to
and structuration. (On these processes, see Hogan 2003, 3840.) import diferent social agendas and diferent political attitudes
here are countless aspects of any given sequence of events and into the interpretation of the event sequence. (In Whites sys-
countless construals of those events and their components. tem, this appears in the ifth level of conceptualization, mode
Even a chronicle selects certain aspects while ignoring others, of ideological implication [1973, 5].) It is also unsurprising that
clusters those aspects together into the events that compose they tend to be points of consequential political contestation.
the chronology, and gives those aspects at least some degree of
internal structure. For example, in a chronology of the events of A Note on Emplotment and Grammar
September 11, we might include a statement that the hijackers he study of narrative is a consequential part of linguistic
took over one airplane at such and such a time. hat statement discourse analysis and sociolinguistics. In this way,
selects various aspects of the situation and organizes them into emplotment necessarily has an important place in the language
a brief causal moment. Moreover, in going beyond a chronicle sciences. However, it is worth mentioning that emplotment may
and telling the basic story, we might begin with the irst plane be related to more narrowly grammatical issues as well. One
crash, or we might begin with the conspiracy of the hijackers, or might argue that thematic roles are, irst of all, narrative posi-
we might begin with various aspects of U.S. policy in the Muslim tions that have grammatical consequences. Of course, one might
world (seen by the hijackers as justiication for the September 11 also see thematic roles as orienting our emplotments by way of an
attacks). he questions that arise here concern just why we select initial operation in grammar. Similarly, one might argue that the
certain aspects and construals over others and how we come to diferent causal relations encoded in causative construc-
decide that events begin and end at certain points. tions are a function of our tendency to emplot experience or
he simple answer to these questions is that our initial selec- at least that the linguistic propensity realized in some languages
tion and construal (as manifest in a chronicle) are the product of gives us a clue as to the diversity of causal sequences that broadly
our emotion systems. (For a more technical discussion of these constrain our emplotments. Finally, the grammatical encoding of
issues, and for research supporting this analysis, see Hogan event individuation in some languages (see Kroeger 2004, 2335)
2008 and Chapter 4 of Hogan 2009.) We are emotionally sensi- may indicate the dependency of certain grammatical features on
tive to certain sorts of properties, conditions, alterations, and so a prior (implicit) emplotment, or it may point us toward a further
on. hese draw our attentional focus. Our sense of a beginning area of research that will help us understand event individuation
and an ending (thus, our fashioning of a basic story) are equally and its relation to emplotment. In any case, there are reasons to
guided by our emotional responses. he beginning of the story believe that emplotment is closely related not only to broad issues
285
Encoding
in discourse but to more narrowly grammatical concerns as well. distinctions in sounds from any language, but those 1012 months
he most radical view of this relation would be that emplotment old can only distinguish sounds that are meaningful in their own
is cognitively fundamental to certain aspects of grammar (a point language (Stager and Werker 1997). Reacquisition of the ability
suggested by authors such as Mark Turner [1996]). Alternatively, to encode sounds from other languages can be diicult later in
it may simply be that certain features recur in grammar and nar- life. On the other hand, the stagelike encoding of complex lexi-
rative, due to shared cognitive sources (an account that may be cal and grammatical information over the course of childhood
suggested by certain aspects of frame semantics) or due to the probably has as much to do with the nature of language learn-
efects of grammar on emplotment. ing as with developing cognitive maturity (Snedeker, Geren, and
Shafto 2007). In the same vein, proponents of the controversial
Patrick Colm Hogan
critical period hypothesis suggest that ultimate proiciency
in a second language is a function of earlier age at acquisition,
due in part to maturational abilities and exposure to the language
Barsalou, Lawrence. 1983. Ad hoc categories. Memory and Cognition (DeKeyser and Larson-Hall 2005; see also second language
11.3: 21127. acquisition).
Frye, Northrop. 1957. Anatomy of Criticism: Four Essays. Princeton, Investigations of the encoding of linguistic knowledge have
NJ: Princeton University Press.
relied primarily (though not exclusively) on priming paradigms
Hogan, Patrick Colm. 2003. Cognitive Science, Literature, and the Arts: A
in which some language knowledge (phonological, lexical,
Guide for Humanists. New York: Routledge.
. 2008. Stories, wars, and emotions: he absoluteness of narrative
semantic) stored in long-term memory is activated, and then
beginnings. In Narrative Beginnings, ed. Brian Richardson, 4462. its efect is measured on some task that relies on that implicit
Lincoln: University of Nebraska Press. activation (e.g., associative priming with lexical decision as the
. 2009. Understanding Nationalism: Narrative, Identity, and task; see also spreading activation).
Cognitive Science. Columbus: Ohio State University Press.
Kahneman, Daniel, and Dale Miller. 1986. Norm theory: Comparing Encoding of Verbal Experience
reality to its alternatives. Psychological Review. 93.2: 13653. A great deal of our world knowledge is initially learned verbally,
Kroeger, Paul R. 2004. Analyzing Syntax: A Lexical-Functional Approach. and language scholars have been particularly interested in the
Cambridge: Cambridge University Press. extent to which linguistically encoded information retains its
Turner, Mark. 1996. he Literary Mind. Oxford: Oxford University Press.
linguistic form at retrieval. Experimental work in the labora-
Tversky, Amos. 1977. Features of similarity. Psychological Review
tory often focuses on new information learned as lists of words,
84: 32752.
White, Hayden. 1973. Metahistory: he Historical Imagination in
sentences, or paragraphs. Participants are then tested for their
Nineteenth-Century Europe. Baltimore: Johns Hopkins University memory of both the information (the conceptual information)
Press. and any accompanying linguistic detail in which it was pre-
sented (words, phrases, sentences, etc). bilingualism pro-
vides an ideal test case in this regard because the language of
ENCODING
both encoding and retrieval can be experimentally manipu-
Encoding refers both to the process of laying down informa- lated. An inluential theory in this ield is encoding speciicity,
tion in memory as the result of exposure to certain stimuli and which suggests that successful retrieval is premised on a match
to the organization of that information once it has been laid between information in the retrieval cue and information stored
down. From the viewpoint of language, this encompasses both in the encoded memory trace (Tulving 1983). In this case, the
the encoding of linguistic knowledge and the encoding of verbal language used at the time of encoding the information is puta-
experience (Francis 1999). he former refers to what we think of tively a feature of the mnemonic trace and may be reactivated at
as knowing a language and the latter refers to knowing things the time of retrieval. For instance, in their research on bilingual
in language (e.g., recalling a conversation). In either case, most recall for word lists, J. Altarriba and E. G. Soltano (1996) sug-
of what we know about encoding comes from studying its inter- gest that as bilinguals store concepts across languages, they
action with retrieval. also associate language-tags with the concepts that corre-
spond to the language in which the concepts were presented in
Encoding of Linguistic Knowledge the lists. Important recent work suggests that language-speciic
Linguistic knowledge includes the phonological forms, mor- information is deeply embedded at the level of semantic repre-
phosyntactic patterns, lexico-semantic items, pragmatics, and sentations, and new methods of investigation may be needed to
so on, that are held in long-term memory. It is information that explore how such encoding takes place and how the informa-
we usually produce automatically without attention to process- tion is reactivated at retrieval (Pavlenko 2008).
ing and for which we have no memory of the speciic contexts in At higher levels of complexity (beyond memory for information
which we acquired the individual items or skills. Studies in irst in words or phrases), memory for narratively organized personal
language acquisition focus on how individuals come to learn events, or autobiographical memories, also seems to be linguis-
(encode) all of the linguistic knowledge necessary to be compe- tically tagged. hus, research in this area shows that bilinguals
tent speakers of a particular language (Gass and Selinker 2001; recall events from their personal past in the language in which
Ritchie and Bhatia 1998). Such encoding involves complex inter- these events were encoded (Marian and Neisser 2000; Schrauf
actions between environmental input and physiological matura- 2000, Schrauf and Durazo-Arvizu 2006). hese results may also
tion. For instance, infants 46 months old can encode phonetic be explained by the principle of encoding speciicity because the
286
nonc/Statement (Foucault) Essentialism and Meaning
language in which an event took place (spoken, heard, written, other utterances (i.e., if certain procedures have been adhered
read) constitutes a feature of the mnemonic trace and predis- to) and if it takes place within an institutional setting (i.e., within
poses the individual to recall the event in that same language. a courtroom, by an appointed judge). Whereas Searles and
Austins deinition of speech-acts stressed the performative
Robert W. Schrauf
nature of such utterances the fact that they achieved something
WORKS CITED AND SUGGESTIONS FOR FURTHER READING in the real world Foucaults emphasis is much more on the fact
that statements bring about something because of their position
Altarriba, J., and E. G. Soltano 1996. Repetition blindness and bilingual within an institution and because of their interrelationship with
memory: Token individuation for translation equivalents. Memory
other discursive structures.
and Cognition 24: 70011.
For Foucault, the main reason for conducting an analysis of
DeKeyser, R., and J. Larson-Hall. 2005. What does the critical period
really mean? In Handbook Of Bilingualism: Psycholinguistic
statements is to discover the material and conceptual supports
Approaches, ed. J. F. Kroll and A. M. B. de Groot, 88108. New that allow it to be said and that keep it in place. hese support
York: Oxford University Press. mechanisms are both intrinsic to discourse itself and extra-
Francis, W. S. 1999. Cognitive integration of language and memory discursive, in the sense that they are sociocultural and institu-
in bilinguals: Semantic representations. Psychological Bulletin tional. Foucault is concerned to set statements in their discursive
125: 193222. frameworks; thus, statements do not exist in isolation since there
Gass, S. M., and L. Selinker. 2001. Second Language Acquisition: An is a set of structures that makes those statements make sense and
Introductory Course. Mahwah, NJ: LEA. gives them their force. hus, entry into discourse is seen to be
Marian, V., and U. Neisser. 2000. Language-dependent recall of auto- inextricably linked to questions of authority and legitimacy. Each
biographical memories. Journal of Experimental Psychology (General)
discursive act maps out the possible uses that can be made of the
129: 3618.
statement (although, of course, that is not necessarily what hap-
Pavlenko, A. 2008. Emotion mental lexicon. Bilingualism: Language
and Cognition 11.2: 14764.
pens to it). Each statement leads to others, and in a sense, it has
Ritchie, W. C., and T. K. Bhatia, eds. 1998. Handbook of Child Language to have embedded within it the parameters of the possible ways
Acquisition. San Diego, CA: Academic Press. in which future statements can be made.
Schrauf, R. W. 2000. Bilingual autobiographical memory: Experimental
Sara Mills
studies and clinical cases. Culture and Psychology 6: 387417.
Schrauf, R. W., and R. Durazo-Arvizu. 2006. Bilingual autobiographi-
cal memory and emotion: heory and methods. In Bilingual
Minds: Emotional Experience, Expression, and Representation, ed. Austin, J. 1962. How to Do hings with Words. Oxford: Clarendon Press.
Aneta Pavlenko, 284311. Clevedon, UK: Multilingual Matters. Dreyfus, H., and P. Rabinow. 1982. Michel Foucault: Beyond Structuralism
Snedeker, J., J. Geren, and C. L. Shafto. 2007. Starting over: International and Hermeneutics. Brighton: Harvester.
adoption as a natural experiment in language development. Foucault, M. [1969] 1972 in text. Archaeology of Knowledge. Trans.
Psychological Science 18.1: 7987. A. M. Sheridan-Smith. London: Tavistock.
Stager, C. L., and J. F. Werker. 1997. Infants listen for more phonetic detail Searle, J. 1979. Speech Acts. Cambridge: Cambridge University Press.
in speech perception than in word learning tasks. Nature 388: 3812.
Tulving, E. 1983. Elements of episodic memory. Oxford: Clarendon Press.
ESSENTIALISM AND MEANING
NONC/STATEMENT (FOUCAULT) What Is Essentialism?
he term nonc or statement was modiied and developed by he idea of essences has been important in Western philosophy
Michel Foucault to describe what for him constituted the small- since at least the time of Plato. Discussion of essences lourished
est element within a discursive structure (Foucault [1969] 1972). in classical and medieval philosophy and has been revived in
Foucault had some heated debates with others about the mean- recent decades by philosophers such as Saul Kripke, due primar-
ing of the word statement. In some of these discussions, some ily to work in modal logic, which is to say, the formalization of
critics asserted that the statement was the same as the speech- necessity and possibility. In the humanities, essentialism is often
act, as developed by John Austin and John Searle (see Dreyfus used to refer to any view that is not historicist. In philosophy
and Rabinow 1982, 449) for a fuller discussion). However, state- including the semantics of formal logic the term is used
ments the most fundamental building blocks of discourse do more narrowly for the belief that objects have deinitive features
seem to difer from speech-acts in important ways. Statements and incidental features. he incidental features may change with-
are those utterances or parts of text that have an efect. Statements out the identity of the object changing. However, if a deinitive
are not the same as sentences but are those utterances that can feature changes, then the objects identity changes. For example,
be seen to be grouped around one particular efect. hus, when a in suicient quantities and with the right light, water appears to
judge says, I sentence you to three years imprisonment, there be blue, but if we take a glass of water from a larger (blue) body,
are a number of these efects. he judge is institutionally sanc- it appears colorless. In scooping the glass out of the water, we
tioned and, therefore, the force of her/his pronouncement is to have altered the color of the water, yet we would not say that it
transform the accused into a criminal and to enforce a particular is a diferent thing than it was before. An incidental property of
sentence on that person. hus, I sentence you can be regarded appearance has changed, but the stuf itself remains the same.
both as a statement and as part of a discourse, since such a state- In contrast, suppose we take that glass of water and induce a
ment can only have efect if it is uttered within the context of chemical change so that it continues to appear clear but is no
287
Essentialism and Meaning
longer H2O. Its appearance would not have changed. However, that If someone lives in New Oslo, he or she speaks Norwegian.
we would be inclined to say that the stuf is not the same. In other But this is merely a contingent truth. After all, it could happen
words, its essence would have altered. that a monoglot Swedish speaker moves to New Oslo. he sit-
Essentialism is connected with semantics through natural uation is diferent with water. Some bit of non-H2O could not
kind terms. hese are terms that (putatively) refer to some nat- simply be added to the set of things that constitute water. he
urally delimited set of objects (including substances). Natural idea of possible worlds is a way of capturing this notion. Put sim-
kinds are distinguished from sets of objects that are merely ply, there are possible worlds in which not all residents of New
selected by human choice. In this view, a word such as water Oslo speak Norwegian. However, there are no possible worlds in
would be a natural kind term. which water is not H2O. hus, it is necessary that water is H2O.
Semantic essentialism is a form of meaning externalism hus, H2O is the essence of water. hus, water is a natural kind
in which our natural kind terms are deined (in part) by essences, term and its meaning is (partially) determined by the essence,
thus entities external to the minds of speakers. Crucially, this H2O. (Alternatively, it refers directly to H2O because that is the
holds even in cases where we do not know the essence in ques- essence of water.)
tion. In perhaps the most famous example of such a case, Hilary Insofar as we accept that there is a diference between nec-
Putnam (1975) imagined a Twin Earth that is identical with essary and contingent implications, and insofar as we accept
our earth, right down to our brain states when we use the word that in at least some of these cases the necessary implications
water. However, there is one key diference. he actual chemical have normative bearing on semantic relations (such as the rela-
composition of the stuf referred to as water on Twin Earth is tion between the word water, the things we call water, and the
not H2O, but something else call it XYZ. According to Putnam, essence, H2O), we seem to have committed ourselves to the exis-
water (as we use it) does not refer to the waterlike substance on tence of essences and to some form of semantic essentialism.
Twin Earth and it never did even hundreds of years ago, when Moreover, insofar as we accept the relation of all this to modal
we did not realize that our water is H2O. In other words, the nat- logic and possible worlds theory, we seem to have found a way
ural kind term, water, always referred to H2O and nothing else of determining what the essences are and how we might know
because, as a natural kind term, it was always deined (in part) them. Of course, possible worlds theory does not tell us that the
by the essence of its referents. (I say in part because essential- chemical composition of water is H2O. hat is learned empiri-
ist theories of meaning commonly allow for various semantic cally. What it (purportedly) tells us is that once we know the
components. For example, in setting out the meaning of water, chemical composition, we thereby know the essence, because
Putnam [1975, 269] includes syntactic markers, such as noun, that is what is unchanging across possible worlds.
semantic markers, such as liquid, and stereotypical properties, Nevertheless, on relection, it may be that things are not that
such as being colorless. But none of these determines the exten- clear. Perhaps we have simply noticed that identity is preserved
sion of the term, thus what the term refers to [see reference under a construal (as G. E. M. Anscombe [1963] might have put
and extension].) it). If I scoop out a glass of water and it stops being blue, then it is
At least two sets of issues arise in connection with the relation still the same under the construal water. However, it is not the
between essentialism and meaning. he irst set concerns the same under the construal blue. In other words, we would not
essences. he second set concerns the words for which essences say that the set of blue things now includes something clear. Note
determine the referents. We consider each in turn. that the point holds even if we dont quite know what water is (in
terms of chemical composition) or what being blue is (in terms
Essences, Causes, and Possible Worlds of light relection). Moreover, the point is not conined to terms
Saying that essences are deinitive properties works well enough such as water and blue. It appears to extend across the board.
as a way of introducing the general notion. But it can hardly Somewhat reminiscent of the problems faced by Platonic essen-
stand as an ontology. Yes, we think that the stuf in the glass has tialism in the Parmenides, these points may seem to suggest a
changed its identity after the chemical reaction, but not previ- promiscuous multiplication of essences. his would efectively
ously when it was scooped out of the pool. But what does that undermine any reason for isolating essences in the irst place.
tell us? (Conversely, these are just the sorts of phenomena that a nones-
Here we might subdivide the problem into whether or not sentialist approach to semantics might lead us to expect; see, for
essences exist, just what essences there are, and how we might example, meaning and stipulation.)
access these essences. Modal logic, with its associated possible On the other hand, none of this really counts for much, either
worlds theory, serves as a preliminary way of approaching all way. In all these cases, we are relying on intuition. (Kripke is
three issues. Again, modal logic treats relations of necessity and explicit about the role of intuition in essentialism; see, for exam-
possibility. For example, suppose it is necessary that p entails q. ple, [1972] 1980, 1012, 39, and 42; see also Putnam 1975, 271).
Suppose also that it is necessary that p. It follows that it is nec- he rules for modal entailment are ixed formally. However,
essary that q. hus, suppose it is necessary that if substance w is the truth or falsity of our premises is not ixed either formally or
water, then substance w is H2O. Suppose also that substance w is through empirical study. It is ixed only by our intuitions. But
necessarily water (i.e., w could not possibly be just this substance just what do our intuitions tell us? Do they tell us about meta-
and not be water). It follows that substance w is necessarily H2O. physical possibility and necessity? Or do they tell us something
Here, the question arises as to what diferentiates this necessity about the way our minds operate in construing possibility and
from ordinary truth. Suppose everyone in a particular town in necessity. It seems much clearer that our intuitions bear on the
Minnesota call it New Oslo speaks Norwegian. It is then true latter (whether or not they bear on the former). Indeed, from
288
Essentialism and Meaning
the perspective of cognitive neuroscience and evolutionary water that looks, tastes, and quenches thirst like H2O, even if it
psychology, our intuitions in these matters are unsurpris- has a diferent chemical composition? his brings us to our sec-
ing. At an early age, we begin to attribute essences (or hidden ond topic.
structures, as Putnam would say) to certain objects (see, for
example, Boyer 2001, 10620, and citations). hat attribution is Words and Meanings
simply a form of semantic organization that operates in the usual What, then, prevents us from grouping together all the things
adaptive way (see adaptation). Speciically, it is a simpli- that we can drink, or all the things that we can see through, rather
ied mechanism that has adaptive value because it approximates than all the things that have the same chemical composition
a function (for more on this distinction, see Hogan 2007). he (say, H2O)? Well, in fact, nothing prevents us from doing that.
function, in this case, is causal inference. Causal inference can We refer to things that we can drink as beverages and things that
be a slow, complex process. Attributing essences to kinds (e.g., to we can see through as clear. An essentialist can respond to this
water and to tigers) allows us to draw causal inferences quickly by saying that it is irrelevant. Only natural kind terms are linked
and with a great deal of accuracy. Speciically, it facilitates the with essences; beverage, clear, treat, and so on, simply do
exploitation of opportunities (e.g., for quenching thirst) and the not refer to natural kinds.
avoidance of threats (e.g., of being eaten). Given the preceding causal account of essences, it would
Perhaps, then, we should incorporate causality into our under- seem that a natural kind term is any term used to refer to an
standing of essences; perhaps we should say that the essence of object or substance that has some central causally consequential
an object is whatever property explains its other properties. Of property (or perhaps a small number of such properties). hus,
course, not every object or substance has such a causal nexus. water is a natural kind term because its molecular composition
Natural kinds, however, do. hus, water has a range of proper- has unique causal importance, explaining a wide range of other
ties that may be explained by it being H2O. Its being colorless properties. But objects are clear due to speciiable properties as
and its function in quenching thirst, for example, are explained well. Although the molecular composition of water and glass are
by its chemical composition. Conversely, its being colorless diferent, they share properties that allow the passage of light. So,
and its function in quenching thirst do not explain its chemical by this criterion, it would seem that clear should count as a natu-
composition. ral kind term. he same point could be made even about such
his causal criterion turns us away from speculation on pos- extreme cases as treat (roughly, something that someone likes a
sible worlds toward actual empirical science. Our conclusions lot but experiences only rarely), so long as being a treat is open to
might still be framed in terms of modal logic and possible worlds. causal explanation.
However, it is not clear that this will add anything to our under- As the importance of causal relations may suggest, these
standing. Speciically, in order to make the connection between issues bear on particulars as well as classes. (After all, real causal
empirical science and possible worlds, we have to rely once sequences are themselves particular.) Indeed, from the start, we
more on intuitions or we have to hold real-world causality con- have implied that in essentialist theory, particulars have essences,
stant across all possible worlds, which merely makes the modal as when we said that a bit of water is the same before and after
logic a translation of our empirical causal analysis. For example, we scoop it out. But this returns us to our earlier question about
consider again Putnams case of Twin Earth XYZ. If H2O and XYZ considering objects under a certain construal. Is there any issue
have no causally distinct consequences, even in chemical tests, of this bit of stuf really being or not being the same individual?
then how do we decide if water on our Earth means/refers to Or is the only issue whether or not it is the same under the con-
something diferent from what water means/refers to on Twin strual water?
Earth? We have only intuitions based on what is in efect a form hese questions are related to the problem of just how we
of Cartesian doubt. Ex hypothesi, we couldnt know or even come ix the relation between a word and a referent. In the context of
to suspect that there is a diference based on evidence. On the standard essentialist semantics, this is to say that it is related to
other hand, if there are causally diferentiating consequences, Kripkes idea of rigid designation. In this view, certain sorts of
then distinguishing between the two is merely making a causal terms names and natural kind terms rigidly designate their
and empirical division for which modal logic and possible worlds referents. A term rigidly designates its referent if it designates that
seem superluous. referent in all possible worlds (Kripke [1972] 1980, 48). hus water
his raises a further question. If we are simply seeking caus- designates H2O in all possible worlds. he same point holds for
ally crucial properties, just what is added to this by the term names. Al Gore refers to the same person in all possible worlds.
essence? What reason do we have to assert that a causally cru- In contrast, deinite descriptions are not rigid designators. he
cial property even a uniquely causally crucial property deter- actual winner of the 2000 presidential election refers to George
mines the identity of a substance? Isnt this something we have W. Bush in some possible worlds. In Kripkes view, almost any
merely stipulated reasonably, perhaps, but without any neces- property of an individual can be altered. Al Gore might really
sity beyond facilitating the achievement of certain practical tasks have lost the 2000 presidential election is a perfectly plausible
(such as doing certain sorts of things with water)? Again, a wide counterfactual, unlike Al Gore might not have been Al Gore.
range of properties of a given substance derive from the puta- But what does it mean to say that Al Gore is the same individual
tively deinitive property. But shouldnt we still be free to pick in all possible worlds?
out one of those other properties as deinitive, depending on our Kripke explains this by explaining the ixing of reference.
interests or speciic tasks? he point has semantic consequences For Kripke, reference is ixed by a causal sequence that is, a
as well as ontological ones. For example, cant we call anything sequence of transmission leading back to an initial linking of
289
Essentialism and Meaning Ethics and Language
a term with an object. Water names H2O because its use leads WORKS CITED AND SUGGESTIONS FOR FURTHER READING
back to a link with a certain substance having a certain essence. Anscombe, G. E. M. 1963. Intention. 2d ed. Ithaca, NY: Cornell University
Al Gore names Al Gore because its use leads back to a cer- Press.
tain individual, tracing a chain of transmission in reverse. (he Boyer, Pascal. 2001. Religion Explained: he Evolutionary Origins of
chain of transmission, presumably went from Gores parents to Religious hought. New York: Basic Books.
their friends [Bob, Trudy this is baby Al], and so on.) his is Hogan, Patrick Colm. [1996] 2008. On Interpretation: Meaning
called the causal theory of reference. (his intentional or seman- and Inference in Law, Psychoanalysis, and Literature. 2d ed.
tic causality is, of course, diferent from the physical causality Athens: University of Georgia Press.
of putatively essential properties, such as chemical composition. . 2007. Laughing brains: On the cognitive mechanisms and repro-
In order to keep the two distinct, I refer to the latter as physical ductive functions of mirth. Semiotica 165.1/4: 391408.
Kripke, Saul. [1972] 1980. Naming and Necessity. Cambridge: Harvard
causality in the remainder of this entry.)
University Press.
As with natural kinds, however, there has to be some limit on
Lewis, David. 1986. Philosophical Papers. Vol. 2. Oxford: Oxford University
just what we can vary about Al Gore in order to guarantee that Press.
he is the same person across possible worlds. It is not clear that Putnam, Hilary. 1975. Mind, Language, and Reality. Cambridge: Cambridge
there is any property that has the sort of physical causal force University Press.
that chemical composition has for water. Relying on intuitions Salmon, Nathan. 1981. Reference and Essence. Princeton, NJ: Princeton
about possible worlds, Kripke concludes that the earliest physi- University Press. An important, relatively early critical analysis of ref-
cal causal factors are the crucial ones here. It seems to me, he erential essentialism.
writes, that anything coming from a diferent origin would not Sosa, Ernest, and Enrique Villanueva, eds. 2006. Philosophy of lan-
be this object ([1972] 1980, 113). But here again we run into the guage. Philosophical Issues 16 (Special Issue). Includes recent essays
treating the extensive literature on these issues.
issue of just what intuition tells us. Put diferently, do we learn
anything about identity here, or do we only learn something
about the general importance of physical causality for the way ETHICS AND LANGUAGE
in which we think about the world, and thus the way we draw
his topic quickly threatens to become an unmanageably broad
intuitive inferences about identity? In other words, does this tell
because every ethical theory that regards value judgments as
us something epistemological and ontological, or simply some-
subject to rational evaluation for example, utilitarianism,
thing psychological and evolutionary?
Habermasian discourse ethics, Rawlsianism, virtue ethics,
Scanlons contractarianism, and neo-Kantianism contains
Conclusion
an account of the meaning of moral terms. But an account of
If they are valid, the preceding arguments may suggest that,
all of those theories would require a whole volume and would
ultimately, there is no real issue of essences (or essential iden-
take us away from issues speciically about language. Instead,
tity) in semantics. (For further discussion of these issues, see
I focus on an issue that, impinging as it does on the ield of
Hogan [1996] 2008, Preface, Chapter 2, and Chapter 3, partic-
lexical semantics on the one hand and the truth valuedness
ularly ixxi and 6370.) However, there are important issues of
of propositions on the other, is certainly linguistic and has
physical causal analysis (issues to which the active develop-
been central in both the philosophy of language and metaethics
ment of essentialist theories has helped to draw our attention).
for more than a century: the issue of cognitivism versus noncog-
More exactly, there is no issue of metaphysics and the founda-
nitivism. I describe how one form of noncognitivism, logical
tions of epistemology (requiring guidance by possible worlds
positivism, became inluential, especially in the social sci-
theorization). Rather, there is only 1) an issue of empirical
ences, and how the logical positivists arguments came to be dis-
science regarding the physical causal properties of objects (or
puted. But irst, of course, we need to deine our terms.
substances) and 2) the related, also empirical, issue of how our
A cognitivist (with respect to ethical discourse) holds that at
brains organize the world in terms of physical causality, iden-
least some ethical statements (e.g., George did a good thing
tity relations, and so on. hese forms of empirical study, then,
when he saved that child) are true. A noncognitivist holds that
may impact our descriptive account of the way in which mean-
such statements are not truth-apt; no ethical statement is either
ing operates. hey should also have consequences, of a more
true or false. (he logical positivists used to say that such state-
limited sort, for our normative deinitions of terms, particularly
ments are cognitively meaningless.) he following is an excep-
in scientiic contexts where precise physical causal analysis is
tionally aggressive statement of the noncognitivist position: All
paramount.
statements belonging to Metaphysics, regulative Ethics, and
On the other hand, not everyone agrees with these argu-
(metaphysical) Epistemology have this defect, are in fact unveri-
ments far from it, in fact. Essentialism is an important and
iable and, therefore, unscientiic. In the Viennese Circle, we are
highly inluential position, and advocates of essentialist seman-
accustomed to describe such statements as nonsense (Carnap
tics have responses to the preceding claims. For example, some
1934, 26).
writers would insist that we should not take causality as a primi-
Although most cognitivists regard ethical statements as true or
tive notion. Rather, we need to explain causality in terms of possi-
false sans phrase, there are quasi-realist positions (Blackburn
ble worlds (see Lewis 1986, 157269), leading us back to modality
1984), according to which it is linguistically appropriate to predi-
and, presumably, essences.
cate true of an ethical assertion, but the word true doesnt have
Patrick Colm Hogan the same function as it does when we say of a scientiic claim that
290
Ethics and Language
it is true. (True doesnt ascribe realist truth when applied to alone, and which Quine generalized so that it becomes evident
an ethical statement.) he position of Bernard Williams (1985), that not only other scientiic principles, but also the logic and
according to which scientiic statements aspire to absolute mathematics we use in our explanatory and predictive reason-
truth while ethical statements can be true in a particular coming are implicated (ibid., 255), White concluded that we may
munitys conceptual scheme but not absolutely true, has a say that just as Duhems view, when pressed to the extreme,
close relation to this quasi-realism. Evidently, such positions makes it diicult to maintain a radical separation between the
are noncognitivist in spirit, even though writers who adopt such analytic and the synthetic and the method of establishing logical
positions acknowledge that the word true can be used in ethical as opposed to empirical truth, so the view we have advocated will
discourse. break down the remaining dualism between logic-cum-empiri-
cal science and ethics (ibid., 256; see also Walsh 1987).
Logical Positivism
By calling ethical assertions nonsense, Rudolf Carnap meant Expressivism and Thick Ethical Concepts
not only that they lack truth value but that they are outside Carnaps claim that ethical sentences are nonsense is simply
the sphere of rational argument altogether. he real world not believable if nonsense is supposed to have the meaning it
inluence of this doctrine was enormous. For example, Lionel normally has. And Carnap continues in a way that makes it even
Robbins (1932), one of the most inluential economists of the more unbelievable. Conceding that there is some sense in which
1930s, enthusiastically endorsed it, as did Milton Friedman and sentences of metaphysics and ethics (and poetry!) are meaning-
Paul Samuelson. And the idea of value-free science obviously ful, he writes: We do not intend to assert the impossibility of
inluenced other social sciences as well associating any conceptions or images with these logically invalid
Even a critic of logical positivism must grant that it had one statements. Conceptions can be associated with any arbitrarily
enormous virtue, and that was its capacity for self-criticism. If compounded series of words; and metaphysical statements are
logical positivists seemed to the economists just mentioned to be richly evocative of associations and feelings both in authors and
logicians of science who had discovered how to demarcate the readers (1934, 26). Obviously, the lines of Jabberwocky are
cognitively meaningful from nonsense, the positivists them- also richly evocative of associations and feelings as well, but
selves were dissatisied with their formulations of the supposed All slithy were the borogroves and John is a cruel parent are
demarcation principle and constantly revised it (Hempel 1963; linguistically very diferent indeed!
see also Putnam 2002, 727). A year later, Carnap is a bit more sophisticated: [A] value
Besides criticisms faced from within the movement itself, statement is nothing else than a command in misleading gram-
the attempts to formulate a criterion of cognitive meaningful- matical form (1935, 25). But there are many diferences between
ness encountered criticisms from W. V. Quine, a lifelong friend imperatives and ethical statements. Carnap has failed to dis-
of Carnaps who shared his admiration for science and symbolic tinguish assertions of very diferent kinds.
logic and his noncognitivism with respect to ethics. More sophisticated attempts by logical positivists and their
Basically, all the logical positivist formulations presupposed 1) allies to explain the sense in which ethical sentences are mean-
that all meaningful language except for pure mathematics and ingful were soon made. he most important for a long time were
logic could be reduced to observation terms (it was supposed to Alfred Jules Ayer (1936) and Charles Stevenson (1944). Ayer
be clear which these are) and 2) that mathematics and logic are held that the function of ethical sentences is to express emo-
analytic or tautologous. Quine ([1951] 1961) famously demol- tions (hence, the term emotivism for this version of noncognitiv-
ished these two dogmas of empiricism, as he called them, and ism). Stevenson identiied the function of ethical sentences with
urged to the satisfaction of almost all philosophers of science expressing and inluencing attitudes. He further claimed that
and philosophers of mathematics that neither the program of the disagreements that occur in science, history, biography are
reducing all meaningful language to the positivists observation disagreements in belief, whereas it is disagreements in atti-
vocabulary nor the idea that mathematics consists of tautolo- tude that chiely distinguish ethical issues from those of sci-
gies (or, alternatively, truths by convention) is defensible. he ence (1944, 13). he task of explaining just what an attitude is
pillars on which the positivist criterion of demarcation rested fell could be left to psychology, Stevenson thought.
in 1950. he family of noncognitivist positions that regards ethical
Just as it was a friend of Carnap who rebutted many of Carnaps assertions as having the function of expressing attitudes is today
claims, so it was a friend of Quine, Morton White, who pointed known as expressivism; the most sophisticated contemporary
out that with the positivists criterion of cognitive signiicance statement of this position comes from Allan Gibbard (1990).
demolished, the whole basis for the positivists claim that ethical Another family of noncognitivist positions (foreshadowed by
sentences are nonsensical was also gone. First of all, the notion Carnaps description of ethical statements as commands in
of an observation is extremely unclear: Why isnt I saw X steal misleading grammatical form) holds that ethical statements
Ys wallet? an observation sentence? (Incidentally it would have a basically imperative function. he most famous statement
seem that stealing is a fairly clear notion by comparison to being of prescriptivism is by R. M. Hare (1952; see also Reichenbach
an observable predicate [White 1956, 109].) Secondly, after 1951).
pointing out that once we accept the holistic view of conirma- It will be noted that these positions concern the function of
tion originally proposed by Pierre Duhem, according to which ethical sentences as wholes. But ethical sentences have parts;
scientiic explanation and prediction puts to the test a whole in particular, they contain ethical predicates. And cognitivists
body of beliefs, rather than the one which is ostensibly under test in ethics, in addition to attacking the logical positivist roots of
291
Ethics and Language Ethnolinguistic Identity
emotivism and prescriptivism, stress the fact that ethical words McDowell, John. [1981] 1998. Non-cognitivism and rule-following. In
have descriptive as well as evaluative functions, thereby attack- Mind, Value and Reality, 198218. Cambridge: Harvard University Press.
ing emotivism and prescriptivism as inadequate accounts of the Originally published in Wittgenstein: To Follow a Rule, ed. Stephen H.
lexical semantics of ethical sentences. he idea that certain con- Holtzman and Christopher M. Leich, 14172. London: Routledge.
Murdoch, Iris. 1967. he Sovereignty of Good Over Other Concepts.
cepts used to describe events, people, and actions in ethical dis-
course have simultaneously evaluative and descriptive functions
Putnam, Hilary. 1981. Reason, Truth and History. Cambridge: Cambridge
attracted wide philosophical attention after the appearance of University Press.
Williams (1985), who referred to such concepts as thick ethical . 2002. he Collapse of the Fact/Value Dichotomy and Other Essays.
concepts. Williams himself says he irst encountered the notion Cambridge: Harvard University Press.
in a seminar given by Phillippa Foot and Iris Murdoch in the Quine, W. V. O. [1951] 1961. Two dogmas of empiricism. In From a
1940s; Murdochs he Sovereignty of Good Over Other Concepts Logical Point of View (2d ed.), 2046. Cambridge: Harvard University
(1967) is partly about such concepts, although she doesnt use Press.
this terminology (see also Putnam 2002). hese authors argue Reichenbach, Hans. 1951. he Rise of Scientiic Philosophy.
that to master the use of cruel, pert, deceit, propaganda, Berkeley: University of California Press.
brave, reasonable, and other thick words, one has to be Robbins, Lionel. 1932. On the Nature and Signiicance of Economic
Science. London: Macmillan.
able to identify, at least in imagination, with an ethical point of
Stevenson, Charles. 1944. Ethics and Language. New Haven, CT: Yale
view. Although Stanley Cavell does not use the term thick ethical
University Press.
concept, he does ague that the use of such words also requires Walsh, Vivian. 1987. Philosophy and economics. In he New Palgrave: A
the acquisition of a number of practices, such as apologizing, Dictionary of Economics. Vol. 3. Ed. J. Eatwell, M. Millgate, and P.
explaining why one did something when there is an ethical chal- Newman, 8619. London: Macmillan.
lenge, and ofering excuses (1979, Part III). In short, the picture of White, Morton. 1956. Towards Reunion in Philosophy. Cambridge: Harvard
these terms as merely expressing attitudes is naive. University Press.
he noncognitivist response has been to claim that the mean- Williams, Bernard. 1985. Ethics and the Limits of Philosophy.
ing of a thick ethical term like cruel can be factored into two com- Cambridge: Harvard University Press.
ponents: causes deep sufering (this would be the descriptive Zif, Paul. 1960. Semantic Analysis. Ithaca, NY: Cornell University Press.
component) and an attitude of moral disapproval (this would be
the evaluative component) (Hare 1981, 72). Hilary Putnam (1981,
ETHNOLINGUISTIC IDENTITY
2035) and John McDowell ([1981] 1998, 2012) argue that there
is no reason to believe that such a disentangling manoeuvre According to Epicurus (Letter to Herodotus), the diferent lan-
(as McDowell calls it) is in general possible. (his issue obviously guages of the world arose historically from the diferences in feel-
impinges directly on the concerns of linguists, as well as moral ings and sensory perception among peoples: [M]ens natures
philosophers.) Last but not least, the inal chapter of Paul Zifs according to their diferent nationalities [ethn] had their own
Semantic Analysis (1960) contains an interesting argument that peculiar feelings and received their peculiar impressions, and
expressivist and prescriptivist analyses of the paradigm thin so each in their own way emitted air formed into shape by each
(or purely evaluative) term good are unacceptable on purely of these feelings and impressions, according to the diferences
linguistic grounds. made in the diferent nations by the places of their abode as well
(Bailey 1926, 756). he idea of a strict linkage between a peo-
Hilary Putnam
ple and its language is also found in the Book of Genesis (10:5),
WORKS CITED AND SUGGESTIONS FOR FURTHER READING when, after the Flood, Noahs descendants spread out over the
earth, every one after his tongue, after their families, in their
Ayer, Alfred Jules. 1936. Language, Truth and Logic. London: Victor nations (King James Version).
Gollancz.
he Bible is the most direct source of the modern (post-Re-
Blackburn, Simon. 1984. Spreading the Word. Oxford, Oxford University
naissance) conception of the nation as a people linked by birth,
Press.
. 1993. Essays in Quasi-Realism. Oxford, Oxford University Press.
language, and culture and belonging to a particular place. his
Carnap, Rudolf. 1934. he Unity of Science. London: Kegan Paul, Trench, had not been the general European way of thinking prior to the
Trubner, & Co. Renaissance, when religious belonging provided a irst division
. 1935. he Logical Syntax of Language. London: Kegan Paul, among peoples and dynastic rule a second. Language meant
Trench, Trubner & Co. Latin, pan-European, and largely insulated from the vernacular
Cavell, Stanley. 1979. he Claim of Reason. Oxford: Clarendon Press. dialects spoken by most people going about their daily lives.
Gibbard, Allan. 1990. Wise Choices, Apt Feelings. Cambridge: Harvard hese vernaculars were not thought of as language or as having
University Press. Oxford: Oxford University Press. any importance beyond the practical needs of communica-
Hare, R. M. 1952. he Language of Morals. Oxford: Clarendon Press.
tion, whereas Latin was the sacred vehicle of divine rites and
. 1981. Moral hinking: Its Levels, Methods and Point: Oxford: Oxford
divine knowledge.
University Press.
As the Reformation increased access to the Bible, the sense
Hempel, C. G. 1963. Implications of Carnaps work for the philosophy
of science. In he Philosophy of Rudolf Carnap, ed. P. A. Schilpp, 685 of national belonging and the nation-language nexus spread
710. La Salle, IL: Open Court. London: Cambridge University Press. (see nationalism and language). Concern arose for ver-
Horgan, Terry, and Mark Timmons. 2006. Metaethics after Moore. naculars to be raised to the status of the language, making
Oxford: Oxford University Press. them eloquent, able to fulill some of the functions previously
292
Event Structure and Grammar
reserved for Latin. his would come to be perceived as a duty to atelic events. Event structure is also distinct from viewpoint or
the nation. he biblical-cum-modern conception of nation and grammatical aspect. he event in (a) is telic, and in the speak-
language remains powerful today, despite having been weak- ers viewpoint the action is ended. In grammatical aspect, this
ened by various attempts to overthrow it. Among these, marx- is a perfective aspect. In (c), like in (a), the event is telic as it will
ism, with its internationalist aims, was the most potent. Research end in a change of state and a boundary reached in the melting
into ethnolinguistic identity is at the heart of a broader program process. However, the speaker views the process as incomplete.
of inquiry into language and identity and forms a key aspect his is an imperfective aspect, while in event structure this is a
of the understanding of nationalism. According to social identity telic, bounded process. It is interestingly that (d) is seen as both
theory, national and ethnic identities are grounded in the knowl- an imperfective aspect and as an atelic event. he action is not
edge that individuals have of membership of a social in-group. completed in the speakers viewpoint presentation and, given
Anyone we do not perceive as a member gets classiied into an the nature of the object or theme, unlike the object of melting in
out-group which can come to represent not just the Other, but (a) and (c), the object of pushing will not change state, and thus
the hreat, the Enemy. (See also stereotypes.) is an unbounded, atelic event.
Taken to extremes, ethnolinguistic identity always becomes
oppressive, but kept within bounds, it is a positive force, help- Typology of Events
ing to give people a sense of who they are, anchoring their lives, Event types are often distinguished based on Z. Vendlers (1957)
and helping them avoid feelings of alienation. Since language classic typology of lexical aspect or Aktionsarten, a German term
and nation are conceptually so closely bound together, it is not for kinds of action, which groups verbs into subclasses based
surprising that the politics of language choice rarely depends on on their temporal features. hese subclasses or event types are
purely functional criteria, such as the language that will be most processes (e.g., activities such as walk and run), accomplish-
widely understood. he symbolic and emotional dimensions of ments (events that culminate by the use of temporal adverbials,
ethnolinguistic identity are powerful enough that language e.g., build or cook in an hour), and achievements (instantaneous
policies that ignore them are likely to prove dysfunctional in events that inish in a short time period, e.g., win and ind). Event
the long run. itself is part of a larger notion called situations, divided into
two categories: events and states (e.g., know and love) (Mani,
John E. Joseph
Pustejovsky, and Gaizauskas 2005). Hence, the term situation
aspect is often used in preference to the term lexical aspect.
WORK CITED
Bailey, C., ed. and trans. 1926. Epicurus: he Extant Remains. Main Theoretical Approaches
Oxford: Clarendon. Event structure has become an important part of grammar stud-
ies, especially in the debate about the exact nature of the relation-
ship between syntax and semantics. Two of many questions
EVENT STRUCTURE AND GRAMMAR
are often posed: At what level of the grammar should we repre-
he concept of event structure is prominent in many disciplines, sent event structure, and how should we indeed represent this
such as cognitive science, computer science, linguistics, and phi- notion? Researchers difer on answers to these questions. he
losophy. Within grammar studies, event structure concerns the following include the main formal theoretical approaches:
level of linguistic representation of a basic unit or organization of
(i) Lexical and decompositional approaches: Under lexi-
thought corresponding to individual acts or occurrences in the
cal approaches, event structure, in particular telicity, is
world. Speakers attempt to conceptualize and express this unit
an inherent property of lexical items and is represented
or organization the event structure by the use of natural lan-
in the lexicon (e.g., Vendler 1957; Levin 1999). A subset of
guage elements, such as words, phrases, and sentences.
lexical approaches includes decompositional approaches
in which event structure is computed on the basis of a set
Event Structure, Tense, and Aspect
of semantic primitives, which are then used to character-
An understanding of event structure requires a diferentia-
ize the meaning of every word in the language (e.g., Schank
tion between it and such related terms as tense and aspect.
1975; Jackendof 1991). For instance, in R. Jackendofs
Consider the following sentences that capture various situations:
decompositional approach, an accomplishment event like
(a) he cook melted the butter. X closes Y is represented on the basis of primitives such as
(b) he farmer pushed the wheelbarrow. CAUSE and BECOME: X closes Y is decomposed as X CAUSE
Y TO BECOME NOT OPEN and is represented as CAUSE
(c) he cook was melting the butter.
(X, BECOME (NOT (OPEN (Y)))).
(d) he farmer was pushing the wheelbarrow.
(ii) Compositional approaches: Event structure is represented
Sentences (a) and (b) both encode events that took place in the in the lexical structure where telicity is a lexical property and
past, and are thus encoded by the past tense forms of the verbs, can be computed from the lexical entry on the basis of accom-
melted and pushed. But there is a crucial diference between the panying material in the verb phrase (e.g. Folli 2001; Pustejovsky
two in terms of the nature of the event. he melting has an end 1991; Goldberg 2006). For instance, in J. Pustejoveskys event
point but the pushing could continue forever. Events with an composition approach, sometimes referred to as a gener-
end point are termed telic, while those without an end point are ative approach, an accomplishment event like X closes Y
293
Event Structure and Grammar Evidentiality
is represented as a preparatory stage of ACT (X, Y) & NOT Filip, Hana. 2000. he quantization puzzle. In Events as Grammatical
(CLOSED (Y)) and a result state of CLOSED (Y). Objects: he Converging Perspectives of Lexical Semantics and Syntax,
ed. Carol Tenny and James Pustejovsky, 3993. Stanford, CA: CSLI
(iii) Semantic approaches: hese approaches represented in
Publications.
works such as Krifka (1998) and Filip (2000) are quite distinct Folli, R. 2001. Constructing telicity in English and Italian. Ph.D. diss.,
from the lexical semantic approaches like Jackendofs works. Oxford University.
hese logical semanticists rely more on truth-conditional Goldberg, Adele. 2006. Constructions at Work: he Nature of Generalization
resources of words and sentences to refer to the semantics of in Language. Oxford: Oxford University Press.
events. Jackendof, R. 1991. Parts and boundaries. Cognition 41: 945.
(iv) Syntactic approaches: hese approaches represented by Krifka, Manfred. 1998. he origins of telicity. In Events and Grammar,
ed. Susan Rothstein, 197235. Dordrecht: Kluwer.
works such as Ritter and Rosen (1998, 2000), Travis (2000),
Levin, B. 1999. Objecthood: An event structure perspective.
Butt and Ramchand (2001), and Borer (2005) take the posi-
In Proceedings of CLS 35: Part 1: he Main Session, 22347.
tion that the event is nonlexical and argue that the event type Chicago: Chicago Linguistic Society.
is read of of the clausal functional projections. Mani, Inderjeet, James Pustejovsky, and Robert Gaizauskas. 2005. he
hese lexical, decompositional, compositional, semantic, and Language of Time: A Reader. Oxford: Oxford University Press.
Pustejovsky, J. 1991. he syntax of event structure. Cognition 41: 4781.
syntactic approaches may overlap and are thus not to be seen
Ritter,E.,and S.T.Rosen.1998.Delimitingeventsinsyntax.InheProjection
as clearly delineated alternatives. he lexical approaches, while
of Arguments: Lexical and Compositional Factors, ed. M. Butt and
intuitive, impose a lot of burden on the lexicon. Decompositional W. Geuder, 13564. Stanford, CA: CSLI Publications.
approaches are also intuitive, but a problem is that there is usu- . 2000. Event structure and ergativity. In Events as Grammatical
ally no general agreement on what primitives to set. he compo- Objects, ed. C. Tenny and J. Pustejovsky, 187238. Stanford, CA.: CSLI
sitional, semantic, and syntactic approaches may also have their Publications.
weaknesses but these are among the most promising. Schank, R. C. 1975. Conceptual Information Processing. Amsterdam:
North-Holland.
Future Trends Tenny, C., and J. Pustejovsky, eds. 2000. Events as Grammatical
Beyond the current state, two main trends may be noted. First, Objects: he Converging Perspectives of Lexical Semantics and Syntax.
event structure is currently studied with reference to words and Stanford, CA: CSLI Publications.
sentences, mostly in isolation. In the future, we need to study it Townsend, D., M. Seegmiller, R. Folli, H. Harley, and T. Bever. 2003.
in context, such as in speech and texts. D. Townsend and col- Processing Events in Sentences and Texts. Upper Montclair, NJ: Montclair
State University Press.
leagues (2003) lead this trend. Second, current research mostly
Travis, L. 2000. Event structure in syntax. In Events as Grammatical
studies how diferent types of objects and other functions inlu-
Objects, ed. C. Tenny and J. Psutejovsky. Stanford, CA: CSLI
ence telicity. But we also ought to look at how event structure in
Publications.
complex verbal constructions, such as serial verbs, is computed. Vendler, Z. 1957. Verbs and times. Philosophical Review 66.2: 14360.
lexical-functional grammar analyses like Bodomo (1993,
1997) and Alsina, Bresnan, and Sells (1997) lead this trend.
Tenny and Pustejovsky (2000), Mani, Pustejovsky, and EVIDENTIALITY
Gaizauskas (2005), and Dolling, Heyde-Zybatow, and Schafer his is a grammatical category that has source of information as
(2007) are further recent book-length readings that put most of its primary meaning whether the narrator actually saw what is
these issues in perspective. being described, made inferences about it based on some evi-
Adams Bodomo dence, or was told about it, and so on. Tariana, an Arawak lan-
guage from Brazil, has ive evidentials marked on the verb. If I
WORKS CITED AND SUGGESTIONS FOR FURTHER READING saw Jos play football, I will say Jos is playing-naka, using the
visual evidential. If I heard the noise of the play (but didnt see it),
Alsina, A. Joan Bresnan, and Peter Sells. 1997. Complex Predicates.
I will say Jos is playing-mahka, using the nonvisual. If all I see
Stanford, CA: CSLI.
Bodomo, A. 1993. Complex predicates and event structure: An inte- is that Joss football boots are gone and so is the ball, I will say
grated analysis of serial verb constructions in the Mabia languages of Jos is playing-nihka, using the inferential. If it is Sunday and
West Africa. Working Papers in Linguistics 20, Trondheim, Norway. Jos is not home, the thing to say is Jos is playing-sika since
. 1997. A conceptual mapping theory for serial verbs. In On-line my statement is based on the assumption and general knowl-
Proceedings of LFG97, ed. M. Butt and T. King, CSLI, Stanford edge that Jos usually plays football on Sundays. And if the infor-
University. Available online at: http://www-csli.stanford.edu/publi- mation was reported to me by someone else, I will say Jos is
cations/LFG2/bodomo-lfg97.html. playing-pidaka, using the reported marker. Omitting an eviden-
Borer, H. 2005. Structuring Sense. Vol. 2. he Normal Course of Events. tial results in an ungrammatical and highly unnatural sentence.
About a quarter of the worlds languages have some gram-
Butt, Miriam, and Gillian Ramchand. 2001. Complex aspectual struc-
matical marking of information source. he systems vary in their
ture in Hindi/Urdu. Oxford Working Papers in Linguistics, Philosophy
and Phonetics 6: 130. M. Liakata, B. Jensen, and D. Maillat edited this
complexity. Some distinguish just two terms. An eyewitness ver-
volume. sus non-eyewitness distinction is found in Turkic and Iranian
Dolling, Johannes, Tatyana Heyde-Zybatow, and Martin Schafer. 2007. languages. Other languages mark only the nonirsthand infor-
Event Structures in Linguistic Form and Interpretation. Berlin: Walter mation, for example, Abkhaz, a northwestern Caucasian lan-
de Gruyter. guage. Numerous languages express only reported, or hearsay,
294
Evidentiality Evolutionary Psychology
information, for example, Estonian. Quechua languages have a frequent source for reported and quotative evidentials, and
three evidentiality speciications: direct evidence, conjectural, the verbs feel, think, hear can give rise to a nonvisual evidential.
and reported. Closed-word classes deictics (see deixis) and locatives may
Systems with more than four terms have just two sensory give rise to evidentials, both in small and in large systems.
evidentials and a number of evidentials based on inference and Evidentials vary in their semantic extensions, depending on
assumption of diferent kinds; these include Nambiquara lan- the system. Reported information often has overtones of prob-
guages, from Brazil, and Foe and Fasu, of the Kutubuan family ability or unreliability, while visual evidentials may develop
spoken in the Southern Highlands of Papua New Guinea. meanings of certainty. hey can be extended to denote the
he terms veriicational and validational are sometimes used direct participation, control, and volitionality of the speaker.
in place of evidential. French linguists employ the term mdiatif morphemes marking tense, aspect, mood, modality, and evi-
(Guentchva 1996). A summary of work on recognizing this dentiality may occur in the same slot in the structure of a highly
category, and naming it, is in Jacobsen (1986) and Aikhenvald synthetic language.
(2004). Evidentiality is a property of a signiicant number of linguistic
Evidentiality does not bear any straightforward relationship areas, including the Balkans, the Baltic area, India, and a variety
to truth, the validity of a statement, or the speakers respon- of locations in Amazonia. Evidentials may make their way into
sibility. he truth value of an evidential may be diferent from contact languages, as they have into Andean Spanish. he texts
that of the verb in its clause. Evidentials can be manipulated to genre may determine the choice of an evidential. Traditional
tell a lie: One can give a correct information source and wrong stories are typically cast in reported evidential. Evidentials can
information, as in saying He is dead-reported when you were be manipulated in discourse as a stylistic device. Switching
told that he is alive, or correct information and wrong informa- from a reported to a direct (or visual) evidential creates the efect
tion source, as in saying He is alive-visual when, in fact, you of the speakers participation and conidence. Switching to a
were told that he is alive but did not see this. he ways in which nonirsthand evidential often implies a backgrounded aside.
semantic extensions of evidentials overlap with modalities Evidentiality is interlinked with conventionalized attitudes to
and such meanings as probability or possibility depend on the information and precision in stating its source.
system and on the semantics of each individual evidential
Alexandra Aikhenvald
term. In many languages (e.g., Quechua, Shipibo-Konibo, or
Tariana, all from South America), markers of hypothetical and
irrealis modality can occur in conjunction with evidentials on
one verb or in one clause. his further corroborates their status Aikhenvald, Alexandra Y. 2004. Evidentiality. Oxford: Oxford University
as distinct categories. Press.
Nonvisual and reported evidentials used with the irst person Aikhenvald, Alexandra Y., and R. M. W. Dixon, eds. 2003. Studies in
Evidentiality. Amsterdam: John Benjamins.
often refer to uncontrolled spontaneous action or have overtones
Barnes, J. 1984. Evidentials in the Tuyuca verb. International Journal of
of surprise, known as mirative.
American Linguistics 50: 25571.
Every language has some lexical way of referring to informa- Guentchva, Z., ed. 1996. Lnonciation mdiatise. Louvain-
tion source, for example, English reportedly or allegedly. Such Paris: ditions Peeters.
lexical expressions may become grammaticalized as evidential Jacobsen, William H., Jr. 1986. he heterogeneity of evidentials in
markers. Nonevidential categories may acquire a secondary Makah. In Evidentiality: he Linguistic Coding of Epistemology, ed.
meaning relating to information source. Conditionals and other Wallace L. Chafe and Johanna Nichols, 328. Norwood, NJ: Ablex. See
nondeclarative moods may acquire overtones of uncertain infor- other papers therein.
mation obtained from some other source for which the speaker
does not take any responsibility; the best-known example is the
EVOLUTIONARY PSYCHOLOGY
French conditional. Past tense and perfect aspect acquire
nuances of nonirsthand information in many Iranian and his term is used in several diferent, related senses. Among
Turkic languages, and so do resultative nominalizations and behavioral, social, and cognitive scientists, it properly refers to
passives. he choice of a complementizer, or a type of comple- a new scientiic paradigm or framework, together with the dis-
ment clause, may serve to express meanings related to the way cipline that has grown up around this framework, and the body
in which one knows a particular fact. In English, diferent com- of knowledge produced by the researchers working within that
plement clauses distinguish an auditory and a hearsay meaning framework. Some scholars outside the ield, as well as many
of the verb hear: Saying I heard Brazil beating France implies journalists and lay people, use it more loosely to refer to any ind-
actual listening, whereas I heard that Brazil beat France implies ing, speculation, or discussion that links evolution and behavior,
a verbal report of the result. hese evidential-like extensions are whether well informed or not. Evolutionary psychology as both
known as evidentiality strategies. Historically, they may give rise a research framework and a discipline is organized around the
to grammatical evidentials. proposition that the design features of the mechanisms compris-
he maximal number of evidentials is distinguished in state- ing a species psychology relect the character of the adaptive
ments. he only evidential possible in commands is the reported, problems they evolved to solve. his proposition was uncontro-
to express command on behalf of someone else: eat-reported! versial when applied by biologists to other species (e.g., Williams
means eat following someones command! Evidentials often 1966). However, it generated signiicant debate and opposition
come from grammaticalized verbs. he verb of saying is once it began to be applied to humans, who because of culture,
295
Evolutionary Psychology
intelligence, language and complexly variable social systems psychologists believe that they can more reliably, rapidly, and
appear notably diferent from other species (Sahlins 1977). efectively derive and test hypotheses about the functional orga-
he ield shares some tenets with early Chomskyan propos- nization of mental mechanisms than would be possible other-
als that the human mind contains numerous mental organs wise. hey argue that many major wrong turns in the history of
specialized for carrying out diferent cognitive tasks, such as a the behavioral sciences for example, many important aspects
language acquisition device (Chomsky 1965). he anti- of the Freudian, Skinnerian, or Piagetian paradigms would not
functionalist strain in Chomskys thinking led him to largely set have been made if their core propositions had been scrutinized
aside natural selection for communicative functions in his dis- for consistency with the kinds of outcomes that natural selection
cussions of language (Chomsky 1972). In contrast, evolutionary could plausibly have produced. he practice of using models of
psychologists such as the psycholinguist Steven Pinker (1994), ancestral-selection pressures as a guide to discovering previously
argue that the existence of mental organs can only be explained unknown psychological mechanisms renders them untroubled
as the consequence of natural selection. his is because selection by critics accusations that evolutionary analysis inevitably con-
is the only process known to science that builds complex func- sists of concocting post hoc just-so stories. To use general prin-
tional systems into the designs of organisms (Williams 1966). By ciples to derive predictions, and then to use these predictions
this standard, the intricate functional interdependence of the to discover something previously unknown, demonstrates that
various cognitive mechanisms underlying language provides such explanations are not concocted post hoc.
very strong evidence for the organizing role of natural selection he primary research goals of evolutionary psychology are
in constructing such mechanisms (Pinker and Bloom 1992). a) the discovery and progressive mapping of each of the evolved
Evolutionary psychology began to emerge in the 1970s and mechanisms of the human brain (or the brains of other species
1980s when a small number of researchers tried to synthesize of interest) and b) the exploration of the systematic behavioral
several distinct research orientations in a mutually consistent regularities and population-level phenomena that these evolved
way (Tooby and Cosmides 1992). he most important of these mechanisms generate in diferent social and cultural environ-
orientations were cognitive science, with its commitment to ments. So, for example, evolutionary psychologists claim to have
information-processing descriptions of psychological mecha- discovered and mapped the information-processing structure
nisms; modern primatology, huntergatherer studies, and of an evolved program in the human psychological architecture
paleoanthropology, which together ofered the prospect of char- whose function is to detect the individuals who are close genetic
acterizing the conditions in which humans evolved; evolutionary relatives, and then to generate greater sexual aversion and
biology (including behavioral ecology, sociobiology, ethology, greater altruism toward these individuals compared to others
and evolutionary game theory); and neuroscience, with its (Lieberman, Tooby, and Cosmides 2007). his evolved program
prospect of discovering the physical implementation of cogni- was predicted to be a part of our species-typical psychological
tive mechanisms. Evolutionary psychologists argued that cog- design, and is believed to explain some of the patterns involving
nitive mechanisms were, ipso facto, biological adaptations, family sentiments found across cultures (such as disgust at the
a proposition that inevitably connected cognitive science to prospect of incest with ones sibling).
evolutionary biology. If cognitive mechanisms are adaptations, Similarly, all human societies (and no nonhuman societies)
they then must exhibit an evolved organization, have an evolu- have complex languages and use them as the primary means of
tionary history, and have been naturally engineered to carry out communication. Evolutionary psychologists view languages as
evolved functions. Most importantly, the identiication of cogni- the population-level expression of a suite of evolved species-typ-
tive mechanisms with adaptations allowed the entire technical ical programs tailored by natural selection to facilitate commu-
apparatus developed within biology concerning adaptations to nication, especially of propositions (Pinker 1994). Although the
be imported and validly applied to cognitive science. evolutionary origins of language are obscure, evolution-
Evolutionary psychologists start from the premise that ary psychologists consider it inevitable that the present design
the brain, like our other organs, is the product of evolution. of the cognitive mechanisms underlying language competence
Speciically, the brain is viewed as an information-processing were naturally selected to function in a linguistic environment
organ that evolved over evolutionary time in order to regulate that is normal for our species. In consequence, a) they should be
behavior in an adaptively successful way. In a world illed with selected to assume the presence of a linguistic environment that
the disordering force of entropy, biologists and physicists rec- conforms to human language universals, and b) they should be
ognize that natural selection is the only known natural physi- designed to exploit the presence of these regularities to accom-
cal process that can push the designs of organisms uphill into plish the functions of acquisition, comprehension, and produc-
functionally organized systems. It follows that whatever function (as they appear to; Musso et al. 2003). Natural selection
tional organization there is to be found in the design of the brain thus provides a causal explanation for Chomskys assertion that
relects the history of selection that acted ancestrally on the strategies employed by the language acquisition device relect
species. Evolutionary psychologists use the cause-and-efect abstract uniformities across human languages (see universal
relationships between ancestral selection pressures and the grammar).
resulting functional architectures of the brains mechanisms as One central element that distinguishes evolutionary psy-
one powerful new tool to guide scientiic discovery. On this view, chology from other approaches is its focus on integrating what
the structure of each psychological mechanism should relect is known about evolution into the research process, rather than
the actions of the selection pressures that built it. Consequently, ignoring this knowledge. Applying information about ances-
by considering ancestral adaptive problems, evolutionary tral conditions and selection pressures allows evolutionary
296
psychologists to derive hypotheses about the design of human based on information it is exposed to. Because the evolved func-
information-processing mechanisms from the large preexist- tion of a neural (or psychological) mechanism is inherently
ing body of theories already developed and empirically tested computational (i.e., as a program mapping informational inputs
within modern evolutionary biology. For example, evolution- to outputs), the only form of description that can accurately
ary biologists know that for organisms like humans, mating with characterize how its organization solves its adaptive problem
close relatives causes genetic defects to express themselves at far is an information-processing description. Physical descrip-
higher rates in the incestuously produced children. his has led tions of brain subsystems cannot, by their nature, fully capture
evolutionary psychologists a) to the general prediction that nat- the information-processing interrelationships that embody the
ural selection had built a program in humans designed to iden- function of an evolved program (mechanism, adaptation, etc.).
tify close genetic relatives; b) to detailed predictions about the So, for example, however interesting it is to identify the brain
cues that the program would use to identify genetic relatives; and regions implicated in various aspects of language processing, it
c) to detailed predictions about how this kin detection program is still important to develop a parallel account in terms of com-
would be coupled to increased sexual aversion to individuals putational steps (data structures, operations, etc.). Similarly,
it identiied as genetic relatives (as well as increased altruism, simply observing that humans behaviorally tend to avoid incest
as predicted by kin selection theory). he analysis of ancestral inside the nuclear family is very diferent from having mapped
selection pressures and huntergatherer conditions made it pos- the information-processing steps in the evolved programs that
sible to design studies that could test (and did conirm) these take prespeciied cues to kinship as input, compute from them
propositions. hese studies, in turn, mapped the information- magnitudes that capture estimated genetic relatedness, and then
processing architecture of these functionally specialized pro- pass these magnitudes into the sexual-choice motivational sub-
grams (Lieberman, Tooby, and Cosmides 2007). In contrast, the system, where they generate sexual disgust at mating with those
disregard by sociocultural anthropologists (and Freudians) of the it identiies as genetic relatives.
selection pressures that select strongly against incest prevented A third diference in perspective between evolutionary psy-
them from discovering the existence of these evolved mecha- chologists and most other behavioral scientists is in how numer-
nisms. Once a mechanism is mapped, its population-level social ous and functionally specialized they expect the psychological
and cultural expressions can also be analyzed such as moral mechanisms of a species to be. For most of the last century, the
attitudes about incest in the case of kin detection and human lin- majority view among learning theorists, cognitive scientists, and
guistic variation in the case of language. neuroscientists has been that the psychological mechanisms
Evolutionary psychology originally emerged among anthro- that operate on experience to produce knowledge are likely to
pologists, cognitive scientists, biologists, and psychologists, be small in number, and to be primarily content independent
although it has subsequently difused into many other disci- and general purpose (Pinker 2002; Tooby and Cosmides 1992).
plines. Evolutionary psychology is not a subield of psychology, Content independence means that a cognitive procedure (such
and it is not devoted to the study of a speciic class of phenom- as association formation in connectionism) operates in the
ena. Rather, it is an approach to the behavioral, social, cognitive, same way regardless of the content it is processing. Hence, on
and neural sciences that can be applied to any of the topics they this view, the same cognitive procedures are expected to oper-
deal with. Originally reacting against the mutually contradictory ate on all contents uniformly, whether language, ighting, eating,
claims about the mind and human nature advanced in diferent sex, family interactions, or intergroup conlict. his blank slate or
disciplines, evolutionary psychologists constructed what they environmentalist view can be expressed by comparing the oper-
argue is a logically integrated scientiic framework that attempts ation of learning mechanisms or cognitive mechanisms to the
to reconcile into a single body of knowledge the results drawn operation of a tape recorder that processes all sounds uniformly,
from all relevant ields. Its advocates view it as an interdisci- regardless of their meaning: he content that ends up on the tape
plinary nucleus around which a single uniied theoretical and relects only the content present in the environment, and noth-
empirical behavioral science is being crystallized. Of course, not ing in the tape-recording machinery itself introduces content of
everyone in behavioral science agrees, with disagreements rang- its own that was not present in the environment.
ing from disputes over speciic analyses to broader rejection of From a selectionist perspective, however, such a blank-slate
the program, often in favor of culturalist and social construction- viewpoint seems extremely implausible, as well as inconsistent
ist views. with what is known about the cognitive architectures of nonhu-
A second feature that distinguishes evolutionary psychology mans (Gallistel 1990). Mutations for specialized design features
is the importance it places on achieving information-processing that exploit the rich recurrent structure of particular problem
descriptions of the designs of evolved mechanisms, rather than domains should spread by natural selection whenever they cost-
stopping at behavioral or neuroscience descriptions. Along with efectively improve the organisms propensity to solve important
most cognitive scientists, evolutionary psychologists believe that adaptive problems in a itness-promoting way. hat is, if there is
the brain, like any other computational system, can usefully be a particular set of cues that solves the problem of kin detection,
mapped both in physical terms (which, for the brain, means in then the mind could evolve a specialization that is designed to
neurophysiological and neuroanatomical terms) and also com- take only those cues as input. For a problem-solving strategy to
plementarily in information-processing terms. Evolutionary be applied generally across contents, it cannot employ problem-
psychologists go on to stress that the brain and its subsystems solving shortcuts that work only on particular problem subsets,
evolved as an organ (or set of organs) of computation: he brain such as grammar acquisition, depth perception, kin detection,
evolved in order to regulate behavior and physiology adaptively or mate selection. Hence, evolutionary psychologists consider it
297
likely that the mind solves the diverse computational problems Evolutionary psychology has grown rapidly in numbers and
posed by stereopsis, color vision, echolocation, face recogni- acceptance over the last three decades, and it is now presented
tion, object mechanics, navigation, and reasoning about social in many sources alongside Freudianism, behaviorism, cogni-
exchange by using at least some principles and operations that tive science, and neuroscience as one of the basic approaches to
are particular to each respective domain. Evolutionary psycholo- psychology. In that time, evolutionary psychologists have used
gists argue that evolved specializations that are activated only by evolutionarily derived predictions to discover scores of previ-
certain content domains or adaptive problems seem virtually ously unknown mechanisms and design features in the human
inevitable, rather than implausible or exceptional outcomes of psychological architecture (Buss 2005). Nevertheless, it remains
the evolutionary process. his is because selection inherently signiicantly more controversial than other young ields, such as
favors eiciency and puts no weight per se on uniformity or sim- cognitive neuroscience, and is still a minority viewpoint whose
plicity (Tooby and Cosmides 1992). speciics are vigorously disputed. Indeed, many researchers
Moreover, unlike a tape recorder, the designs of such evolved who are reluctant to associate themselves with the controversies
psychological mechanisms might be expected to regularly intro- surrounding evolutionary psychology have nonetheless quietly
duce particular contents, motivations, interpretations, and adopted many of its core principles, so that claims of evolved
conceptual primitives into the human mind that are not simply functional specializations and evolutionary origins are far more
derived from the environment. From an engineering perspec- common and unabashed in the behavioral sciences than they
tive, it is easy to see how such reliably developing contents could were even a decade ago. For example, the modularist tradition
enhance adaptive performance. For example, the environmental in cognitive development adopts what is largely an evolution-
regularity of venomous snakes posed an evolutionarily long-en- ary psychological stance: Various specialized competences the
during adaptive problem. his regularity appears to have selected theory of mind module, intuitive physics, and intuitive biology
for an evolved computational device implemented in the brains are viewed as evolved, reliably developing, domain speciic, and
of African primates (including humans). his adaptation con- designed to relect the special task demands posed by the adap-
tains a psychophysical speciication of snakes linked to a system tive problems special to each domain (Hirschfeld and Gelman
that motivates snake avoidance. Additionally, this avoidance is 1994).
up-regulated to the extent that the individual is exposed to con- Some controversies over evolutionary psychology are gen-
speciics who display fear toward snakes (hman and Mineka erated by misunderstandings, while others concern unsettled
2001). his depends on mental content about snakes being built theoretical and empirical issues (e.g., how can neural plasticity
into the mechanism. he human mind is suspected to contain be reconciled with the existence of evolved specializations in the
neurocomputational versions of what philosophers would once brain?). However, heated resistance is perhaps attributable to
have called innate ideas, such as snake, spider, mother, predator, the sensitivity of applying evolutionary theories broadly across
food, word, verb, agency, object, and patient (Tooby, Cosmides, human experience. For example, cognitive science originated
and Barrett 2005). By augmenting the cognitive architecture in in philosophy and linguistics, and as a result tends to focus on
such a fashion, natural selection could supercharge perceiving, relective issues, such as knowledge acquisition and speech com-
learning, reasoning, and decision making in evolutionarily con- prehension, which have only limited intrinsic personal or social
sequential domains. meaning. In contrast, evolutionary psychologists ambitions
At a minimum, evolutionary psychologists expect that in addi- extend to characterizing the mechanisms underlying all human
tion to whatever general-purpose cognitive machinery humans action. hese include social interactions such as aggression, sex-
have, we should also be expected to have a wide array of domain- ual attraction, exploitation, and cooperation. Evolutionary biol-
speciic mechanisms, including specialized learning mecha- ogy provides rich theories about these domains, but analysis of
nisms. So, for example, although the snake phobia system, the the causes of these phenomena inevitably triggers strongly felt
kin detection mechanism, and the language acquisition system personal and ideological reactions.
are all learning mechanisms, they are each specialized only for Language is commonly viewed by evolutionary psycholo-
their particular type of content (snakes linked to fear intensity, gists as the expression of a set of reliably developing cognitive
kinship cues linked to incest aversion and altruistic motivation, mechanisms that evolved to convey propositional information
and language inputs linked to linguistic competence). For this through a serial channel (Pinker 1994). he high degree of func-
reason, evolutionary psychologists do not regard learning as con- tional elaboration in language suggests that it has been shaped
stituting an alternative explanation for the claim that a particu- by selection over long expanses of evolutionary time. Although
lar kind of behavioral output was shaped by evolution. Evidence it seems likely that many mechanisms involved in language are
that something is learned is not in the least inconsistent with the general in that they are used in other cognitive tasks, it is dii-
claim that much of the knowledge produced was supplied by cult from an evolutionary psychological perspective to see how
specialized learning mechanisms permeated with evolved con- such an important activity would not have strongly selected for
tent. Critics of evolutionary psychology view its multiplication the emergence of proprietary cognitive specializations designed
of hypothesized cognitive mechanisms (e.g., specializations for to solve languages constituent subtasks with special eiciency.
language acquisition, kin detection, mate selection, and so on) Several lines of evidence argue that at least some (if indeed not
to be unparsimonious. Evolutionary psychologists respond that most) of the cognitive mechanisms underlying language are
although parsimony may have been a useful principle in physics, adaptations designed by natural selection for language. he
evolutionarily engineered systems are not designed to be simple competing hypothesis is that language is a by-product of gen-
but, rather, to be adaptively efective. eral intelligence, symbolic capacity, the capacity for culture,
298
Evolutionary Psychology Exemplar
neo-associationistic mechanisms, or other general-purpose Generation of Culture, ed. J. Barkow, L. Cosmides, and J. Tooby, 19136.
alternatives (Pinker 1994). First, computationally intricate lin- New York: Oxford University Press.
guistic capacities develop precocially far earlier than compa- Tooby, J., L. Cosmides, and H. C. Barrett. 2005. Resolving the debate
rable cognitive achievements in other domains. Second, genetic on innate ideas: Learnability constraints and the evolved interpen-
etration of motivational and conceptual functions. In he Innate
and developmental conditions can doubly dissociate language
Mind: Structure and Content, ed. P. Carruthers, S. Laurence, and S.
and general intelligence (i.e., one can speak well with low intel-
Stich, 30537. New York: Oxford University Press.
ligence and be unable to speak but have otherwise unimpaired Tooby, John, and I. DeVore. 1987. he reconstruction of hominid
intelligence). hird, underneath linguistic variability are design behavioral evolution through strategic modeling. In he Evolution of
features like linear order, constituency (see constituent Primate Behavior: Primate Models, ed. Warren Kinsey, 183237. New
structure), predicate-argument structure, case markers, York: SUNY Press.
morphophonemic rules, and phonological rules that are a) uni- Williams, George C. 1966. Adaptation and Natural Selection: A Critique
versal and b) well designed to communicate propositional infor- of Some Current Evolutionary hought. Princeton, NJ: Princeton
mation, such as who did what to whom, but poorly designed for University Press.
many other cognitive tasks, such as statistical induction, imag-
ery, face recognition, and so on (see phonology, universals EXEMPLAR
of; morphology, universals of; syntax, universals
of; semantics, universals of). his term occurs importantly in research and theorization in cat-
Finally, some evolutionary psychologists propose that egory identiication, recognition, categorization, and learning. It
language was a critical ingredient allowing humans to enter is used interchangeably with the terms instance or item across
their peculiar adaptive mode, the cognitive niche. On this view, various strands of research, including psychology, religion, and
the cognitive niche is a way of life in which massive amounts of history.
contingent information are generated and used for the regula- Within the context of category learning, for instance, the term
tion of improvised behavior that is successfully tailored to local exemplar refers to a speciic instance, such as a speciic cat to
conditions (Tooby and DeVore 1987; Pinker 1994). Essential to which a parent points when teaching a child the concept and
increasing the supply of useful propositional information was name of cat. Alternatively, during remediation of language skills
dramatically lowering the cost of its acquisition from others. in children with severe disabilities, researchers have utilized var-
Language appears admirably designed to accomplish this ious exemplars of graphical symbols to improve communication
task. (Schlosser 2003). In studies examining category relearning in
individuals who have sufered brain damage, training in naming
Daniel Sznycer, John Tooby, and Leda Cosmides of a subset of exemplars results in improved naming of untrained
exemplars within the category (Kiran 2007).
WORKS CITED AND SUGGESTIONS FOR FURTHER READING Within the topic of categorization of semantic concepts,
Buss, D. M., ed. 2005. he Handbook of Evolutionary Psychology. the terms speciic usage comes in the context of exemplar
Hoboken, NJ: Wiley. theory. Briely, this theory suggests that a category is repre-
Chomsky, N. 1965. Aspects of the heory of Syntax. Cambridge, MA: MIT sented by a collection of members (exemplars) that have been
Press. previously encountered, experienced, and stored as unique
. 1972. Language and Mind. New York: Harcourt, Brace, and and individual memory traces. A new object/item is judged as a
World. member of a given category provided that it is suiciently similar
Gallistel, C. R. 1990. he Organization of Learning. Cambridge, MA: MIT to the stored exemplars (Komatsu 1992). his speciic interpre-
Press.
tation of exemplar is at odds with an alternate view of catego-
Hirschfeld, Lawrence A., and Susan A. Gelman, eds. 1994. Mapping
rization, namely, the prototype theory, which suggests that a
the Mind: Domain Speciicity in Cognition and Culture. New York:
category is represented in terms of a single summary representa-
Lieberman, D., J. Tooby, and L. Cosmides. 2007. he architecture of tion (i.e., a prototype).
human kin detection. Nature 445.7129: 72731. Not all theorists agree that exemplar and prototype models
Musso, M., A. Moro, V. Glauche, M. Rijntjes, J. Reichenbach, C. Bchel, are competitors; there is yet another class of models according
and C. Weiller. 2003. Brocas area and the language instinct. Nature to which categorization decisions are made using exemplars,
Neuroscience 6: 77481. although the efect of using exemplars necessitates the creation
hman, A., and S. Mineka. 2001. Fears, phobias, and preparedness. of abstractions that can be later applied to novel exemplars (Ross
Psychological Review 108: 483522. and Makin 1999). Similarly, some connectionist networks
Pinker, Steven. 1994. he Language Instinct. New York: Morrow. assume that a category is represented by summary information
. 2002. he Blank Slate. New York: Viking.
across the entire network and, depending upon the input pro-
Pinker, Steven, and Paul Bloom. 1992. Natural language and natural
vided, speciic connection strengths in the network have greater
selection. In he Adapted Mind: Evolutionary Psychology and the
Generation of Culture, ed. J. Barkow, L. Cosmides, and J. Tooby, 45193.
inluence on the overall activation (Knapp and Anderson 1984).
New York: Oxford University Press. Finally, the interpretation of the term exemplar can also be
Sahlins, Marshall. 1977. he Use and Abuse of Biology. Ann Arbor: he inluenced by the level of category structure. As Edward Smith
University of Michigan Press. and Douglas Medin (1999) argue, the term can refer to a spe-
Tooby, John, and L. Cosmides. 1992. he psychological foundations ciic instance of the concept (e.g., your favorite blue jeans
of culture. In he Adapted Mind: Evolutionary Psychology and the in the category clothing) or to a subset of the concept (blue
299
Exemplar Theory
jeans). Further, whereas experimental investigations of exem- subsequently inluencing phonetic and phonological out-
plars typically refer to them as a basic level concepts (e.g., put (Pierrehumbert 2001), lexical and morphological output
apple), exemplars can also refer to an individual entity such as (Goldinger 1997 and Bybee 2002), and childrens manipula-
Macintosh apple, which is a subordinate concept. tions of syntactic structures (Tomasello 2003; see also syntax,
acquisition of).
Swathi Kiran
Exemplars are not prototypes. hey are individual instances of
linguistic usage retained in memory. Given the empirical neces-
sity for incorporating memory for exemplars into models of lan-
Knapp, A., and J. Anderson. 1984. heory of categorization based guage behavior, the crucial research question in exemplar-based
on distributed memory strorage. Journal of Experimental approaches becomes whether they alone can account for the
Psychology: Learning, Memory and Cognition 10: 61637. spectrum of linguistic behaviors or whether there remains inde-
Kiran, Swathi. 2007. Semantic complexity in the treatment of naming
pendent empirical justiication for the rule-based components.
deicits. American Journal of Speech Language Pathology 16: 112
Exemplar-based models of language are founded on the sim-
Komatsu, Lloyd. 1992. Recent views of conceptual structure.
Psychological Bulletin 112: 50026.
ple notion that in language use, speakers will compare a current
Ross, Brian, and Valarie Makin. 1999. Prototype versus exemplar mod- linguistic expression and its context (linguistic and nonlinguistic)
els in cognition. In he Nature of Cognition, ed. R. Sternberg, 20542. with their personal collections of memories for similar expres-
Cambridge, MA: MIT Press. sions and then choose at least one of the tokens in memory an
Schlosser, Ralf. 2003. he Eicacy of Augmentative and Alternative exemplar as the basis for deciding how to interpret or other-
Communication: Toward Evidence-Based Practice. Amsterdam and wise operate on that expression. Usually, the token(s) selected
Boston: Elsevier. will be similar to the input currently being considered and its
Smith, Edward, and Douglas Medin. 1999. he exemplar view. In context. Such models imply that the brain stores vast invento-
Concepts: Core Readings, ed. E Margolis and S. Laurence, 20721.
ries of memories for individual episodes of linguistic experience
and that it employs some procedure for comparing the features
of the new input or current context to the features of those
remembered exemplars, and then has some basis for choosing
EXEMPLAR THEORY
one of those exemplars as the model for an analogical response
An important goal of linguistic theory has been to develop explicit (or interpretation).
approaches to describing, modeling, and explaining linguistic Given the empirical evidence that memories for individual
behavior. Certainly, most familiar to linguists are the rule-based exemplars of previous linguistic behavior inluence current
models that derive abstract rules from linguistic exemplars, then linguistic behavior, we restrict our discussion in this entry to
use those rules to predict linguistic behavior. Such rule systems explicitly deined exemplar-based approaches that use actual
are readily shown to be empirically inadequate, both diachron- exemplars (typically gleaned from linguistic corpora) to predict
ically, as in the shift from digged to dug as the past tense of dig, linguistic behavior. he approaches discussed here are all com-
and synchronically (see synchrony and diachrony), as in putationally based and have actually been tested against real
the overgeneralization (see overregularizations) of glew linguistic behavior. he algorithms are also publicly available to
as the past tense of glow in place of glowed. hus, rule-based researchers.
models of language behavior must also incorporate some sort of In the three approaches that follow, the exemplars are
component that can account for analogical interactions among retained and directly used to predict linguistic behavior. A data
linguistic items. set of relevant exemplars is constructed from actual linguistic
Generally speaking, there are two broad categories of analog- corpora; then an algorithm is applied that compares the new
ical models under investigation in current linguistic research. input to the exemplars in the data set and selects certain of those
One group consists of approaches that use linguistic exemplars exemplars while lessening or even zeroing out the chances of
to derive an analogical system but which then do not consult other exemplars in the data set being used. Typically, the exem-
those individual exemplars of linguistic experience further in plars in the data sets are composed of outcomes associated with
predicting linguistic behavior. Best known among this cate- various linguistic variables or features. A prediction is then made
gory of analogical models are the connectionist approaches for the outcome deined by an input set of variables (the given
to language. Connectionist models typically pool their training context). Normally, exemplars that are in some sense closer to
input into schematic, prototype-like representations of a cat- the given context have a higher chance of being selected, but
egory that do not retain individualizing information about the sometimes the algorithm may select more distant exemplars. In
exemplars used to train the models. Such representations, how- other words, these approaches sometimes allow exemplars that
ever, make the models empirically inadequate in two respects. are not nearest neighbors that is, most similar to be used.
First, connectionist models incorrectly predict behaviors such
as categorization and response times in terms of similarity to The Generalized Context Model (GCM)
the prototype encoded in the network, rather than in terms of he generalized context model (Nosofsky 1992) was developed
similarity to individual exemplars. Second, there is abundant, primarily as a model of concept learning, choice behavior, and
and growing, evidence that memories for individual linguis- categorization. he GCM has been tested most extensively against
tic experiences do inluence subsequent linguistic behaviors. nonlinguistic behavior, but has also been tested on morpholog-
Research has derived clear evidence of speciic exemplars ical processes, such as predicting the plural forms of German
300
Exemplar Theory
nouns (Nakisa and Hahn 1996). he GCM determines the con- the best results for a particular task and a particular data set, and
ditional probability of assigning a given linguistic form say, the although the diferences among them are usually not great, there
base form of a noun to a particular form class, for example, a appears to be no principled basis for choosing one measure over
particular plural form. It does so by comparing the features of the another.
test form with the weighted sum of those features in all the exem- One important contribution of the MBL studies to exemplar-
plars of one response category, divided by the weighted sum of based modeling theory is that reducing the size of the data set by
those features across the exemplars of all the possible response omitting very low frequency exemplars, redundant exemplars,
categories. he model also factors in a response bias value for the and very rare but exceptional exemplars actually reduces the
diferent categories. hus, it arrives at a conditional probability level of correct predictability. hus, MBL researchers now rec-
for choosing any one response over the alternatives. ognize the need to construct large, complete data sets in order to
In the application of the GCM to nonlinguistic data, the maximize overall predictability.
accuracy of the models predictions depends crucially upon
the weightings assigned to the diferent stimulus features and Analogical Modeling (AM)
the response biases for the alternative categories. Typically, Analogical modeling (Skousen 1989) is not a nearest neighbor
both are determined ahead of time by constructing a confusion approach. While it includes nearest neighbors in its predictions,
matrix for the exemplars to be used. he resulting weightings it also regularly uses non-nearest neighbors to predict behavior.
(said to account for the efects of selective attention during train- In certain cases, the nearest neighbor model simply makes the
ing) and response biases then apply only to the given data set wrong prediction. (For an explicit example of where AM cor-
of exemplars. he feature weightings determined ahead of time rectly rejects the nearest neighbors in predicting behavior, see
for a given data set are equivalent to the information gain values chapter 2 in Skousen, Lonsdale, and Parkinson 2002.)
described for the memory based learning model that follows and AM is an explicit model of analogy. Non-neighbors can be
are subject, therefore, to the same theoretical criticisms in that used, but only under a well-deined condition of homogeneity.
they must be calculated ahead of time for a given data set and do AM uses a simple decision principle to determine homogene-
not generalize to a new data set. In applying the GCM to natural ity: namely, never allow the analysis to increase uncertainty,
language data, Ramin Nakisa and Ulrike Hahn (1996), of course, which means that no analogical analysis will ever allow any
were not able to obtain feature weightings for German nouns in unnecessary loss of information.
native speakers of the language, and the model therefore did not Unlike connectionist models, no training stage occurs in
perform as well as a competing connectionist model. AM, except in the trivial sense that one must collect exemplars
in order to make predictions. here is no setting of parameters
Memory Based Learning (MBL) nor any prior determination of variable signiicance (see prin-
Memory based learning (Daelemans and van den Bosch 2005) ciples and parameters theory). he signiicance of any
is a nearest neighbor model developed speciically for predict- combination of variables is always determined in terms of the
ing language behavior. Pure nearest neighbor approaches count given context for which we seek a predicted outcome.
each variable, or feature, as equally important for comparing an he resulting probability of using a particular exemplar
input item to the stored exemplars and identifying one or more depends upon three factors: 1) proximity: the closer the exem-
of the nearest neighbors, that is, most similar exemplars, as plar to the given context, the greater its chances of being selected
the basis for predicting an analogical response. However, as is as the analogical model; 2) gang efect: when a group of exem-
widely recognized, simple nearest neighbor models are empiri- plars in the same space behave alike, the chances of one of those
cally inadequate for predicting actual language behavior. Real exemplars being selected is multiplied; and 3) heterogeneity: the
people often give responses that clearly are not traceable to the chances of an exemplar being used is zero whenever there is any
most similar exemplar already known. intervening exemplar closer to the given context that behaves
Daelemans and his colleagues have addressed this empiri- diferently (that is, has a diferent outcome).
cal shortcoming by determining from the database ahead of Analogical modeling can be reinterpreted in terms of rules, as
time the overall signiicance of each individual variable to be follows: 1) Every possible true rule exists, and 2) the probability of
used in predicting outcomes. In this way, the distance of vari- using a true rule is proportional to its frequency squared. A true
ous neighbors from a particular input context can be adjusted rule is a rule whose context is homogeneous in behavior. Despite
according to the importance of each variable to a particular task. this equivalence, AM is not like regular rule approaches. Since all
he researchers have developed a number of similarity or dis- of the true rules are said to exist, there will be overlapping rules,
tance metrics for determining the signiicance of the individual redundant rules, and rules based on as little as one exemplar.
variables. Among the more important of these are informa- hese equivalent true rules, when considered from the perspec-
tion gain (IG), gain ratio (GR), the chi-square statistic (2), and tive of AM, are created on the ly; they are not stored some-
shared variance (SV). Depending on the particular data set and where, waiting to be used. In fact, until an outcome is selected,
its behavior, one gets diferent rates of correct prediction, but the all the true rules are constructed in a kind of superpositioning
predictions are almost always better than providing no weighting and are processed simultaneously and by the same reversible
of the variables at all. Indeed, without such adjustments in the procedures. his approach allows AM to be implemented as a
weighting of features, the nearest neighbor approaches cannot system of quantum computation.
predict actual language behavior accurately. Unfortunately, one Analogical modeling allows for imperfect memory. In fact, in
cannot know in advance which measure of similarity will provide order to model the variability of language properly, it is necessary
301
Extinction of Languages
to assume that access to exemplars is not always available and, in recognized oicially, restricted to local community and home
general, can be considered a random phenomenon. functions, and spoken by very small groups of people (see diglos-
One important result from AM is that one cannot assume in sia and language policy). Languages are most obviously at
advance which variables are signiicant and thus ignore the oth- risk when they are no longer transmitted naturally to children in
ers. Often, the potential value of a variable remains latent until the home by parents or other caretakers. UNESCO suggests that
the model is required to predict the outcome for an appropri- languages being learned by fewer than 30 percent of the younger
ate given context. his kind of result can occur when gangs of generation may be at risk, yet there is very little information about
non-neighbors are called upon to predict the behavior of a given the number of languages no longer being transmitted.
context. Most projections of the scale of the problem rely on size as a
proxy for degree of endangerment, despite lack of agreement on
Royal Skousen and Steve Chandler
how many speakers are thought necessary for a language to be
viable. A large language could be endangered if the external pres-
sures on it were great (e.g., Quechua with some millions of speak-
Bybee, Joan. 2002. Phonological evidence for exemplar storage of multi- ers) whereas a very small language could be perfectly safe so long
word sequences. Studies in Second Language Acquisition 24: 21521. as the community was functional and the environment stable (e.g.,
Daelemans, Walter, and Antal van den Bosch. 2005. Memory-Based Icelandic with fewer than 300,000). However, small languages can
Language Processing. Cambridge: Cambridge University Press.
disappear much more rapidly than large ones, and forces such as
Goldinger, Stephen D. 1997. Words and voices: Perception and produc-
the spread of farming, colonization, industrialization, and global-
tion in an episodic lexicon. In Talker Variability in Speech Processing,
ed. K. Johnson and J. W. Mullennix, 3365. San Diego, CA: Academic
ization have propelled a few languages all Eurasian in origin to
Press. spread over the last few centuries (see modern world-system,
Nakisa, Ramin, and Ulrike Hahn. 1996. Where defaults dont help: he language and the; colonalism and language).
case of the German plural system. In he Proceedings of the 18th Manx, for instance, was spoken on the Isle of Man for about 1,500
Annual Conference of the Cognitive Science Society, 17782. Hillsdale, years. Ned Maddrell, the last-known speaker, died in 1974. Not long
NJ: Lawrence Erlbaum. before his birth in 1877, nearly a third of the island (around 12,000
Nosofsky, Robert M. 1992. Exemplar-based approach to relating cat- people) still spoke Manx. Today, all the remaining Celtic languages
egorization, identiication, and recognition. In Multidimensional such as Breton, Scots Gaelic, Irish, Gaelic, and Welsh, and so on,
Models of Perception and Cognition, ed. F. G. Ashby, 36393. Hillsdale, are threatened to various degrees by the spread of English and/or
French. Marie Smith Jones (d. 2008) was the last person who spoke
Pierrehumbert, Janet B. 2001. Exemplar dynamics: Word frequency, leni-
Eyak, one of Alaskas twenty or so native languages. Only two,
tion and contrast. In Frequency Efects and Emergent Grammar, ed.
J. Bybee and P. Hopper, 119. Amsterdam: John Benjamins.
Siberian Yupik (spoken in two villages on St. Lawrence Island)
Skousen, Royal. 1989. Analogical Modeling of Language. Dordrecht: and Central Yupik (spoken in seventeen villages in southwestern
Kluwer. Alaska) are being transmitted to children as the irst language of the
Skousen, Royal, Deryle Lonsdale, and Dilworth S. Parkinson. 2002. home. No children are learning any of the nearly hundred native
Analogical Modeling: An Exemplar-Based Approach to Language. languages in what is now the state of California.
Amsterdam: John Benjamins. Based on estimates from the Ethnologue database compiled
Tomasello, Michael. 2003. Constructing a Language: A Usage-Based by the Summer Institute of Linguistics (Gordon 2005), Table 1
heory of Language Acquisition. Cambridge: Harvard University Press. displays the percentage of languages in diferent continents with
fewer than some number of speakers. he median number of
speakers for the languages of the world is only 5,000 to 6,000, and
EXTINCTION OF LANGUAGES
nearly 85 percent of languages have fewer than 100,000. Languages
An increasing number of books, scholarly articles, and media in Australia, the Paciic, and the Americas are mainly very small;
reports have predicted that 5090 percent of the worlds approxi- over 20 percent have fewer than 150 speakers, and nearly all have
mately 6,900 some languages may be at risk of extinction within fewer than 100,000, which is Krausss (1992) threshold for viability.
the next hundred years (see, for example, Krauss 1992; Nettle and Africa, Asia, and Europe, however, have a fair number of medium
Romaine 2000; Crystal 2000; Abley 2003). his alarming igure does sized languages with 100,000 to 1 million speakers, in addition to
not include dialects because no one knows exactly how many some giant languages. Such languages are probably safer in the
languages and dialects there are, and there are no clear criteria for short term at least. Even if the viability threshold is set at the lower
distinguishing between language and dialect (see Wolfram and level of 10,000 speakers, 60 percent of all languages may already
Schilling-Estes 1998 for discussion of dialect endangerment). be endangered. he situation is slightly better in Africa (33%), Asia
Estimates of the number of languages in danger of extinction (53%), and Europe (30%), but much worse in North and South
vary depending on the criteria used to assess risk. UNESCOs America (78% and 77%) and Australia and the Paciic (93%).
World Atlas of the Worlds Languages in Danger of Disappearing he issue of language extinction cannot be separated from
(2001) estimates that 50 percent of languages may be in various people, their identities, their cultural heritage, and their rights.
degrees of endangerment while Michael Krauss (1992) believes Maintaining cultural and linguistic diversity is a matter of social
that up to 90 percent may be threatened. More research is needed justice because distinctiveness in culture and language has
in order to understand the role of various factors, such as size (i.e., formed the basis for deining human identities (see ethno-
number of speakers), status, function, and so on, in supporting linguistic identity; identity, language and). Because
or not supporting languages. Most languages are unwritten, not language plays a crucial role in the acquisition, accumulation,
302
Family Resemblance
resemblance is the most prominent. However, it is crucial to be

Table 1. Percentages of languages according to continent of origin
clear about the nature and purpose of such terms in PI; they are
having fewer than indicated number of speakers
ofered by Wittgenstein as merely purpose-relative, perspicuous
Continent/ < 150 < 1000 < 10,000 < 100,000 <1 (ibid., 122), ways of presenting certain aspects of our language
Region Million use, proposed neither as elements of a theory of language nor as
methodological devices to be applied mechanically elsewhere.
Africa 1.7 7.5 32.6 72.5 94.2
Understanding the role and purpose of such terms is dependent
Asia 5.5 21.4 52.8 81.0 93.8 on understanding Wittgensteins vision of philosophy.
Europe 1.9 9.9 30.2 46.9 71.6 Wittgensteins vision of philosophy was that it is a therapeutic
North 22.6 41.6 77.8 96.3 100
activity, the purpose of which is the dissolution of philosophi-
America cal problems through the achievement of clarity. Such clarity is
not to be achieved by the production of philosophical theses, for
Central 6.1 12.1 36.4 89.4 100
philosophical problems and the consequent desire to produce
America
theories arise through our misunderstanding of the logic of our
South 27.8 51.8 76.5 89.1 94.1 language (1922, 3). herapeutic dialogue brings to the fore unac-
America knowledged or unconscious commitments to certain pictures
Australia/ 22.9 60.4 93.8 99.5 100 of the way things must be; once conscious, once acknowledged,
Paciic such commitments lose their thought-constraining grip.
World 11.5 30.1 59.4 83.8 95.2 One thought-constraining commitment that Wittgenstein
considered pervasive in philosophy was the commitment to
Source: From Nettle and Romaine (2000, 40). language as essentially representative: that words name objects,
the objects being their meaning ([1953] 1958, 1); as he wrote
maintenance, and transmission of human knowledge, the pros- on the irst page of the Blue Book (BB): we are up against one
pect of extinction raises critical issues about the survival of this of the great sources of philosophical bewilderment: a substan-
knowledge. Loss of linguistic diversity also threatens the scien- tive makes us look for a thing that corresponds to it (1958, 1; but
tiic study of language by diminishing the range of structures also see [1953] 1958, 4045 and 8990). It is not that such
for constructing hypotheses about universals. a view is explicitly espoused by philosophers but, rather, that it
often operates as an unconscious or unacknowledged picture
Suzanne Romaine of the way language must operate. Much of the opening of PI is
an attempt to bring to consciousness this hitherto unconscious
WORKS CITED AND SUGGESTIONS FOR FURTHER READING commitment, so that we might recognize it as nonobligatory; in
Abley, Mark. 2003. Spoken Here: Travels among hreatened Languages. recognizing it as such, the thought-constraining grip of this pic-
Toronto: Random House of Canada. ture will be loosened, and the philosophical problems and myths
Crystal, David. 2000. Language Death. Cambridge: Cambridge University to which it might lead, as well as the consequent desire to pro-
Press. pound theories, will dissipate.
Gordon, Raymond G., Jr., ed. 2005. Ethnologue: Languages of the World. One strategy Wittgenstein employs is to suggest that we might
15th ed. Dallas: SIL International. Available online at: http://www. look at the use of a word when one wants to know its meaning.
ethnologue.com/. his is designed to wean one away from the desire to look for (or
Krauss, Michael. 1992. he worlds languages in crisis. Language
theorize into existence) the object for which one assumes a sub-
68: 410.
stantive must stand. As a prophylactic, he suggests that we might
Nettle, Daniel, and Suzanne Romaine. 2000. Vanishing Voices: he
Extinction of the Worlds Languages. Oxford: Oxford University Press.
see language use and the practices in which it is embedded as a
UNESCO. 2001. World Atlas of the Worlds Languages in Danger of game ([1953] 1958, 7). Identiication of the language-game being
Disappearing. Paris: UNESCO. played such as the language-game of describing, that of order-
Wolfram, Walt, and Natalie Schilling-Estes. 1998. Endangered dialects: A ing, that of predicting, that of naming, and so on is suggested
neglected situation in the endangerment canon. Southwest Journal of as a procedure for identifying the use and thus the meaning of an
Linguistics 14: 11731. utterance. Having suggested this strategy, Wittgenstein responds
to an anticipated objection (ibid., 65) that in talking of language
in terms of diferent language-games, he leaves himself open to
F the charge of avoiding the question: What is the essence of a
language-game and thus of language as a whole? His response
to this anticipated objection is to suggest that there is nothing, of
necessity, common to all the activities that he is calling language-
FAMILY RESEMBLANCE
games, as there is nothing, of necessity, common to all uses of the
Ludwig Wittgensteins Philosophical Investigations ([1953] 1958, word game. Rather, when we look and see, diferent uses of
hereafter PI) is one of the most inluential and well-known the word game do not have anything common to all, but rather
texts of twentieth-century philosophy. A number of terms a whole series of overlapping similarities and relationships (ibid.,
have been introduced into the philosophical lexicon by this 66). he notion of a family resemblance is a perspicuous way of
work. Alongside language-game, the related term family characterizing such similarities (1958 [1953], 67).
303
Feature Analysis
he suggestion of family resemblance as a way of understand- Jakobson and Halle (1956), and Noam Chomsky and Halle
ing the similarities between diferent language-games or diferent (1968). (It should be noted, though, that before the 1960s most
uses of words helps bring to consciousness our unacknowledged phonologists maintained that features and phonemes were not
enthrallment to a picture of language-as-necessarily-having- necessarily psychological entities; cf. Sapir 1929.) he distinctive
an-essence, that is, a picture of something being essential to feature theory developed in these works, which maintains that
all instances of language use and to all uses of a word (such as phonemes are composed of bundles of abstract features, such
game). Family resemblance, as with all the terms introduced as [round], [nasal], and [high], stands in opposition to attempts
by Wittgenstein in PI, serves, and should stay subservient to, by connectionists and most phoneticians to deny the exis-
the therapeutic task (pace authors such as R. Bambrough and E. tence of features and other higher-order symbolic categories
Rosch and C. B. Mervis). Methodological readings and employ- in human linguistic cognition (cf. Shattuck-Hufnagel and Klatt
ments of the term, as in Bambrough (1960) and in Rosch and 1979; Soli and Arabie 1979; Lisker 1985; and much work in artic-
Mervis (1975) (see prototype), therefore, fundamentally mis- ulatory phonology [Browman and Goldstein 1989]).
understand the language-game in which family resemblance has
its home in Wittgensteins work. Evidence for Features
Auditory illusions such as the phonemic restoration efect
Phil Hutchinson
(Warren and Obusek 1971) crucially involve reference to higher-
level phonological representations. Evidence for these abstract
phonological representations being composed of features comes
Baker, Gordon. 2004. Wittgensteins Method: Neglected Aspects. from a wide variety of sources (see Vaux 2008 for a review of the
Oxford: Blackwell. Hugely important text on Wittgensteins vision of literature). Most frequently cited by linguists (cf. Tatham 1999;
philosophy. Poeppel, Idsardi, and van Wassenhove 2007) is the patterning of
Bambrough, R. 1960. Universals and family resemblances. Proceedings
phonemes in natural classes with regard to synchronic alter-
of the Aristotelian Society 61: 20722.
nations and phonotactics, diachronic sound changes, and phe-
Rosch, E., and C. B. Mervis. 1975. Family resemblances: Studies in the
internal studies of categories. Cognitive Psychology 7: 573605.
nomena of language acquisition and loss (see synchrony and
Wittgenstein, Ludwig. 1922. Tractatus Logico-Philosophicus. London: diachrony).
Routledge. Class behavior of this sort, the reasoning goes, is eiciently
. [1953] 1958. Philosophical Investigations. Oxford: Blackwell. captured by assuming that the linguistic processes in question
. 1958. he Blue and Brown Books: Preliminary Studies for the operate on features rather than phonemes. For example, the
Philosophical Investigations. Oxford: Blackwell pin-pen merger of /I/ and // before nasal consonants in some
varieties of English (Labov, Ash, and Boberg 2006) makes refer-
ence to the distinctive feature [+nasal] rather than the individual
FEATURE ANALYSIS
nasal phonemes of English, {m n }. Were the latter the case, we
Feature analysis extends to the cognitive domain the interest of would incorrectly predict the existence of similar neutralization
early Greek philosophers such as Democritus and Plato, in iden- rules before arbitrary collections of segments such as {m s h}.
tifying the fundamental building blocks (atoms) of the physical By requiring that phonological generalizations refer to feature
world (Greenberg 1967). Cognitive feature analysis starts from sets, on the other hand, we bring the inventory of possible pho-
the observation that animals appear to organize their perceptual nological rules signiicantly closer to what is actually attested.
worlds in terms of inite sets of discrete abstract elements rather (But see Flemming 2005 for a critique of this reasoning. For the
than simply storing and manipulating unanalyzed streams of pin-pen merger in particular, a phonetician might respond that
continuous gradient stimuli. Evidence for the existence of dis- the restriction to nasals can be explained phonetically, without
crete representational features governed by abstract combi- recourse to phonological features, by the fact that nasalization
natorial principles (as opposed to concrete [e.g., phonetic] renders formant structure less prominent and hence more con-
categories emerging from lower-level gradient processes) comes fusable. Some phonologists, such as Nick Clements, would reply
primarily from areas related to phonology but also from other that phonetics underdetermines the attested range of phonolog-
domains of human cognition, such as vision (Pylyshyn, Blaser, ical patterns.)
and Holcombe 2000); computation of object similarity (Tversky Acquisition studies also support the idea that phonologi-
1977), induction (Sloman 1993), typicality, asymmetry, diver- cal generalizations target feature-based natural classes rather
sity (Heit 1997), speech perception (Stevens 2002; Poeppel, than arbitrary lists of segments. Jenny Safran and Erik hiessen
Idsardi, and van Wassenhove 2007), sign language phonolo- (2003), for instance, show that infants can extract feature-based
gies, morphology, semantics, alphabet processing, and generalizations better than other generalizations, and Anne
vision and object perception (see Morgan 2003 and Vaux 2008 Pycha and colleagues (2003) demonstrate that adult learners
for references). acquire rules that manipulate a single feature faster and more
First explored formally by A. M. Bell (1867) for alphabets, accurately than rules that manipulate two features. D. Swingley
Alfred Kroeber (1909) for kinship systems, and A. G. Bell (1911) and R. Aslin (2002) and Christopher Fennell and Janet Werker
for speech, feature theory was most famously developed for (2003) show that humans are already sensitive to phonological
phonology by Roman Jakobson, Serge Karcevsky, and Nikolaj feature distinctions in the representation of familiar words by the
Trubetzkoy (1928 and [1939] 1958), with signiicant extensions age of 14 months (see also speech perception in infants
by Jakobson, Gunnar Fant, and Morris Halle ([1952] 1963), and phonology, acquisition of).
304
Feature Analysis
Similar feature-based perceptual distinctions have been hough the lions share of phonologists currently appear to
found to take place in the auditory cortex of adults (Phillips, prefer privative features, there are reasons to believe that fea-
Pellathy, and Marantz 2000) and in lexical access (Marslen- tures can be ternary (q.v. Kim 2002). One needs both [+] and []
Wilson and Warren 1994) and masked phoneme identiication speciications, for instance, to account eiciently for exchange
(Miller and Nicely 1955). he fact that humans analyze speech rules such as height inversion in Brussels Flemish (Zonneveld
signals in terms of distinctive features may be connected to the 1976; Fitzpatrick, Nevins, and Vaux 2004). A third, un(der)speci-
quantal nature of auditory responses to sound, such as responses ied value appears necessary to derive i) ternary patterns such
to acoustic discontinuities and closely spaced spectral promi- as Turkish voicing alternations (Inkelas, Orgun, and Zoll 1997),
nences (Chistovich and Lublinskaya 1979; Delgutte and Kiang ii) phonetic interpolation efects (cf. Keating 1988; Cohn 1990;
1984; Stevens 1972, 1989, 2002; Clements and Ridouane 2006). Anderson 1999), iii) permanent underspeciication (cf. Odden
he same features have been implicated in speech produc- 2005 on tone in consonants), and iv) phonetic vacillation in
tion as well, notably in studies of speech errors (Fromkin 1973; un(der)speciied segments (Vaux and Samuels 2005; Hale and
Goldrick 2004). Kissock 2007).
In addition to the natural class patterns discussed, phonolog-
ical evidence for distinctive feature theory comes from consider- Issues in Feature Theory
ations of economy: Languages appear to organize their feature In terms of the substance of the features themselves, a number of
systems so as to minimize the number of features employed to outstanding issues remain:
distinguish among both consonants and vowels (Archangeli
and Pulleyblank 1994; Clements 2008; Poeppel, Idsardi, and van Are the features pure abstractions (as in classical structur-
Wassenhove 2007). alist phonology and Hale and Reiss 2000; cf. Meillet 1903
on the content of phonological reconstructions) or phoneti-
cally based (Jakobson, Fant, and Halle [1952] 1963 et seq.)?
Content of Feature Theories
David Odden (1991, 364) observes on this point that a theory
Feature theories tend not to strive for maximal economy, though,
in which phonological features [i.e., pure abstractions] are
preferring to balance their feature inventories in accordance with
arbitrarily mapped onto phonetic features is more powerful
the following principles (from Tatham 1999):
than one in which phonological and phonetic features are the
he inventory should be able to characterize all contrasting same, since the former property includes the latter.
segments in human languages; If the content of features is phonetic, is it acoustic (Jakobson,
It should be able to capture natural classes in a clear Fant, and Halle [1952] 1963; Flemming 1995), articulatory
fashion; (Chomsky and Halle 1968; Vaux 2008), or both interchange-
It should be transparent with regard to phonetic correlates. ably (Stevens 2003)? An articulatory basis for features makes
sense in light of increasingly robust evidence (pace Ohala
As a rule, Jakobson prioritized simplicity and generality and
1996) that humans cognitively model relevant actions and
therefore had fewer features (e.g., 1215 in Jakobson, Fant and
events in terms of the physical activities necessary to execute
Halle [1952] 1963); Trubetzkoy ([1939] 1958) and Chomsky and
them (motor theory; Ribot 1890; Taylor 1962; Liberman et
Halle (1968) proposed much larger inventories, being more
al. 1963; Tettamanti et al. 2005). he claims of motor theory
interested in capturing phonetic detail and phonological gener-
have recently been bolstered by imaging studies of the activ-
alizations, respectively.
ity of mirror neurons in the premotor area of the monkey
Phonologists also difer as to whether features are:
brain, which are activated by both execution and observa-
binary or equipollent (+/-; Jakobson, Fant, and Halle [1952] tion of manual and oral actions by both irst and third person
1963; Chomsky and Halle 1968); agents (Fogassi and Ferrari 2004; see also mirror systems,
privative/unary (present vs. absent, as with Trubetzkoys imitation, and language).
[1939] 1958 analysis of bilateral oral/nasal vowel contrasts; Are features the smallest units of discrete linguistic represen-
cf. also dependency phonology [Anderson and Ewen 1987], tations, or are they in turn composed of smaller elements,
modiied contrastive speciication [Avery and Rice 1989], either muscular (Halle 1983a) or acoustic (Kingston 2003)?
and, with gestures instead of features, articulatory phonology Are features universal (Chomsky and Halle 1968; Stevens
[Browman and Goldstein 1989]); 1972; Kuhl 2000), language-speciic (Pulleyblank 2001;
Pierrehumbert 2003), or drawn from a universal inventory
ternary (+/-/absent, as in theories that use archiphonemes
but only on the basis of observed phonological contrasts (the
[Trubetzkoy [1939] 1958] or underspeciication [Dresher,
Toronto School [e.g., Dresher, Piggott, and Rice 1994])?
Piggott, and Rice 1994]);
Are features organized in hierarchical trees (Clements 1985;
multivalued (= the gradual oppositions of Trubetzkoy [1939] Halle, Vaux, and Wolfe 2000), classes (Padgett 2002), or not at
1958; cf. also Ladefoged 1971; Lindau 1975; Williamson all (Reiss 2003a)?
1977); Do vowels employ some features that consonants lack
variable (i.e., allowing diferent numbers of values for a given (Trubetzkoy [1939] 1958; Clements 1991)?
feature in diferent contexts; Trubetzkoy [1939] 1958); or Do features encode markedness relations? Is it the case,
a combination of privative features and binary features (Sagey in other words, that + values of features are marked, and
1986; Steriade 1995). values are unmarked? his seems to be the position taken
305
Feature Analysis
by Chomsky and Halle (1968). he Toronto School employs Fogassi, Leonardo, and Pier Francesco Ferrari. 2004. Mirror neurons,
essentially the same system, but with privative rather than gestures and language evolution. Interaction Studies 5.3: 34563.
binary features; as a result, the more features a segment con- Fromkin, Victoria. 1973. Speech Errors as Linguistic Evidence. he
tains, the greater its degree of markedness. (See Steriade 1995, Hague: Mouton.
Goldrick, Matthew. 2004. Phonological features and phonotactic con-
Reiss 2003b, and Clements 2008 for further discussion.)
straints in speech production. Journal of Memory and Language
51: 586603.
Bert Vaux
Greenberg, Joseph. 1967. he irst (and perhaps only) non-linguistic dis-
tinctive feature analysis. Word 23.1: 21420.
Hale, Mark, and Madelyn Kissock. 2007. he phonetics-phonology
Anderson, John, and Colin Ewen. 1987. Principles of Dependency interface and the acquisition of perseverant underspeciication. In
Phonology. Cambridge: Cambridge University Press. he Oxford Handbook of Linguistic Interfaces, ed. Gillian Ramchand
Anderson, Stephen. 1999. he nature of phonetic representations. Talk and Charles Reiss, 81102. Oxford: Oxford University Press.
given at Keio University. Available online at: http://bloch.ling.yale. Hale, Mark, and Charles Reiss. 2000. Phonology as cognition. In
edu/Public/Phonetic_Reps.pdf. Phonological Knowledge: Conceptual and Empirical Foundations,
Archangeli, Diana, and Douglas Pulleyblank. 1994. Grounded Phonology. ed. N. Burton-Roberts, Philip Carr, and Gerard Docherty, 16184.
Cambridge, MA: MIT Press. Oxford: Oxford University Press.
Avery, Peter, and Keren Rice. 1989. Segment structure and coronal Hall, T. Alan, ed. 2001. Distinctive Feature heory. Berlin: Mouton de
underspeciication. Phonology 6: 179200. Gruyter.
Baltaxe, Christiane. 1978. Foundations of Distinctive Feature heory. Halle, Morris. 1983a. On distinctive features and their articulatory
Baltimore: University Park Press. implementation. Natural Language & Linguistic heory 1: 91105.
Bell, Alexander Graham. 1911. he Mechanics of Speech. New York and . 1983b. On the origins of the distinctive features. International
London: Funk and Wagnalls. Journal of Slavic Linguistics and Poetics 27: 7786.
Bell, Alexander Melville. 1867. Visible Speech: he Science of Universal Halle, Morris, Bert Vaux, and Andrew Wolfe. 2000. On feature spread-
Alphabetics. London: Simkin, Marshall. ing and the representation of place of articulation. Linguistic Inquiry
Browman, Catherine, and Louis Goldstein. 1989. Articulatory gestures 31: 387444.
as phonological units. Phonology 6: 20151. Heit, Evan. 1997. Features of similarity and category-based induction.
Chistovich, L., and V. Lublinskaya. 1979. he center of grav- In Proceedings of the Interdisciplinary Workshop on Categorization and
ity efect in vowel spectra and critical distance between the for- Similarity, University of Edinburgh, 11521.
mants: Psychoacoustical study of the perception of vowel-like stimuli. Inkelas, Sharon, Orhan Orgun, and Cheryl Zoll. 1997. Exceptions and
Hearing Research 1: 18595. static phonological patterns: Cophonologies vs. prespeciication.
Chomsky, Noam, and Morris Halle. 1968. he Sound Pattern of English. In Derivations and Constraints in Phonology, ed. Iggy Roca, 393418.
New York: Harper and Row. See especially Chapter 7. Oxford: Oxford University Press.
Clements, G. N. 1985. he geometry of phonological features. Phonology Jakobson, Roman, Gunnar Fant, and Morris Halle. [1952] 1963.
Yearbook 2: 22352. Preliminaries to Speech Analysis. Cambridge, MA: MIT Press.
. 1991. Vowel height assimilation in Bantu languages. Working Jakobson, Roman, and Morris Halle. 1956. Fundamentals of Language.
Papers of the Cornell Phonetics Laboratory 5: 77123. Ithaca, NY: Cornell he Hague: Mouton.
University. Jakobson, Roman, Serge Karcevsky, and Nikolaj Trubetzkoy. [1928] 1971.
. 2008. he role of features in phonological inventories. Quelles sont les mthodes les mieux appropris un expos com-
In Contemporary Views on Architecture and Representations in plet et pratique dune langue quelconque. In Selected Writings (2d
Phonological heory, ed. Eric Raimy and Charles Cairns. Cambridge, expanded ed.), ed. Jakobson, 36. he Hague: Mouton.
MA: MIT Press. Keating, Patricia. 1988. Underspeciication in phonetics. Phonology
Clements, G. N., and Rachid Ridouane. 2006. Quantal phonetics and dis- 5: 27592.
tinctive features: A review. Proceedings of ISCA Tutorial and Research Kim, Yuni. 2002. Phonological features: Privative or equipollent? B.A.
Workshop on Experimental Linguistics, 2830 August 2006, ed. Antonis thesis, Harvard University.
Botinis, 1724. Athens: University of Athens Press. Kingston, John. 2003. Learning foreign vowels. Language and Speech
Cohn, Abigail. 1990. Phonetic and Phonological Rules of Nasalization. 46: 295349.
Ph.D. diss., University of Califonia at Los Angeles. Kroeber, Alfred. 1909. Classiicatory systems of relationship. Journal
Delgutte, B., and N. Kiang. 1984. Speech coding in the auditory nerve: IV. of the Royal Anthropological Institute of Great Britain and Ireland
Sounds with consonant-like dynamic characteristics. Journal of the 39: 7784.
Acoustical Society of America 75: 897907. Kuhl, Patricia. 2000. Language, mind, and brain: Experience alters per-
Dresher, B. E., G. L. Piggott, and K. Rice. 1994. Contrast in phonol- ception. In he New Cognitive Neurosciences (2d ed.), ed. Michael
ogy: Overview. Toronto Working Papers in Linguistics 13.1: iii-xvii. Gazzaniga, 99115. Cambridge, MA: MIT Press.
Fennell, Christopher, and Janet Werker. 2003. Early word learners abil- Labov, William, Sharon Ash, and Charles Boberg. 2006. he Atlas of North
ity to access phonetic detail in well-known words. Language and American English. Berlin: Mouton de Gruyter.
Speech 46.2/3: 24564. Ladefoged, Peter. 1971. Preliminaries to Linguistic Phonetics.
Fitzpatrick, Justin, Andrew Nevins, and Bert Vaux. 2004. Exchange rules Chicago: University of Chicago Press.
and feature-value variables. Presented at he 3rd North American Liberman, Alvin, F. Cooper, K. Harris, and P. MacNeilage. 1963. A
Phonology Conference, Concordia University, Montral, Qubec. motor theory of speech perception. In Proceedings of the Symposium
Flemming, Edward. 1995. Auditory representations in phonology. on Speech Communication Seminar, Royal Institute of Technology,
Ph.D. diss., University of California at Los Angeles. Stockholm. Paper D3, Vol. 2.
. 2005. Deriving natural classes in phonology. Lingua Lindau, Mona. 1975. [Features] for vowels. UCLA Working Papers in
115: 287309. Phonetics 30. Los Angeles: Department of Linguistics, UCLA.
306
Feature Analysis Felicity Conditions
Lisker, Leigh. 1985. he pursuit of invariance in speech signals. Journal Steriade, Donca. 1995. Underspeciication and markedness. In
of the Acoustical Society of America 77: 11991202. Handbook of Phonological heory, ed. John Goldsmith, 11474.
Marslen-Wilson, William, and Paul Warren. 1994. Levels of perceptual Oxford: Blackwell.
representation and process in lexical access: Words, phonemes, and Stevens, Kenneth. 1972. he quantal nature of speech: Evidence from
features. Psychological Review 101: 65375. articulatory-acoustic data. In Human Communication: A Uniied
Meillet, Antoine. 1903. Introduction ltude comparative des langues View, ed. P. Denes and E. David, 5166. New York: McGraw-Hill.
indo-europennes. Paris: Hachette. . 1989. On the quantal nature of speech. Journal of Phonetics
Miller, George, and Patricia Nicely. 1955. An analysis of perceptual con- 17: 346.
fusions among some English consonants. Journal of the Acoustical . 2002. Toward a model for lexical access based on acoustic land-
Society of America 27.2: 33852. marks and distinctive features. Journal of the Acoustical Society of
Morgan, Michael. 2003. Feature analysis. In he Handbook of Brain heory America 111.4: 187291.
and Neural Networks, ed. M. Arbib, 4447. Cambridge, MA: MIT Press. . 2003. Acoustic and perceptual evidence for universal phono-
Odden, David. 1991. Review of generative and non-linear phonology, by logical features. In Proceedings of the XVth International Congress of
Jacques Durand. Language 67.2: 3637. Phonetic Sciences, Barcelona, 338.
. 2005. Introducing Phonology. Cambridge: Cambridge University Swingley, D., and R. Aslin. 2002. Lexical neighborhoods and the word-
Press. Chapter 6, Feature theory, is particularly relevant. form representations of 14-month-olds. Psychological Science
Ohala, John. 1996. Speech perception is hearing sounds, not tongues. 13: 4804.
Journal of the Acoustic Society of America 99: 171825. Tatham, Mark. 1999. Distinctive feature theory. Available online at
Padgett, Jaye. 2002. Feature classes in phonology. Language http://www.essex.ac.uk/speech/teaching-01/documents/df-theory.
78.1: 81110. html.
Phillips, Colin, Tom Pellathy, and Alec Marantz. 2000. Phonological Taylor, James. 1962. he Behavioral heory of Perception. New Haven,
feature representations in auditory cortex. Manuscript, University CT: Yale University Press.
of Delaware. Available online at http://www.ling.udel.edu/colin/ Tettamanti, M., G. Buccino, M. Saccuman, V. Gallese, M. Danna, P.
research/papers/feature_mmf.pdf. Scifo, F. Fazio, G. Rizzolatti, S. Cappa, and D. Perani. 2005. Listening
Pierrehumbert, Janet. 2003. Phonetic diversity, statistical learning, and to action-related sentences activates fronto-parietal motor circuits.
acquisition of phonology. Language and Speech 46: 11554. Journal of Cognitive Neuroscience 17.2: 27381.
Poeppel, David, William Idsardi, and V. van Wassenhove. 2007. Speech Trubetzkoy, Nikolay. [1939] 1958. Grundzge der Phonologie.
perception at the interface of neurobiology and linguistics. In Gttingen: Vandenhoeck and Ruprecht.
Proceedings of the Royal Society of London B 363: 107186. Tversky, Amos. 1977. Features of similarity. Psychological Review
Pulleyblank, Douglas. 2001. Deining features and constraints in 84: 32752.
terms of complex systems: Is UG too complex? Paper presented at Vaux, Bert. 2008. he role of features in a symbolic theory of phonol-
the Workshop on Early Phonological Acquisition, Carry-le-Rouet, ogy. In Contemporary Views on Architecture and Representations in
Marseilles, France. Phonological heory, ed. Charles Cairns and Eric Raimy. Cambridge,
Pycha, Anne, Pawel Nowak, Eurie Shin, and Ryan Shosted. 2003. MA: MIT Press.
Phonological rule-learning and its implications for a theory of vowel Vaux, Bert, and Bridget Samuels. 2005. Laryngeal markedness and aspi-
harmony. In Proceedings of WCCFL 22, ed. M. Tsujimura and G. ration. Phonology 22: 395436.
Garding, 10114. Somerville, MA: Cascadilla Press. Warren, Richard, and C. Obusek. 1971. Speech perception and phone-
Pylyshyn, Zenon, E. Blaser, and A. Holcombe. 2000. Tracking an object mic restorations. Perception and Psychophysics 9: 35862.
through feature space. Nature 408: 1969. Williamson, Kay. 1977. Multivalued features for consonants. Language
Reiss, Charles. 2003a. Quantiication in structural descriptions: Attested 53.4: 84371.
and unattested patterns. Linguistic Review 20: 30538. Zonneveld, Wim. 1976. A phonological exchange rule in Flemish
. 2003b. Accepting markedlessness: How non-phonological Brussels. Linguistic Analysis 2: 10914.
symbolic computation shapes trends in attested phonological sys-
tems. In Proceedings of the 29th Annual Meeting of the Berkeley
Linguistics Society, Parasession on Phonetics Sources of Phonological FELICITY CONDITIONS
Patterns: Synchronic and Diachronic Explanations, 56981.
he distinction between felicity conditions and truth con-
Ribot, hodule. 1890. Psychologie de lattention. Paris: Alcan.
Safran, Jenny, and Erik hiessen. 2003. Pattern induction by infant lan-
ditions was introduced by J. L. Austin as the basis for dis-
guage learners. Developmental Psychology 39: 48494. tinguishing between two types of utterances, performative
Sagey, Elizabeth. 1986. he representation of features and relations and constative, respectively. As such, it is central to the view
in non-linear phonology. Ph.D. diss., Massachusetts Institute of within pragmatics that utterances perform actions, an idea
Technology. that gave rise to the theory of speech-acts. Constatives such
Sapir, Edward. 1929. he status of linguistics as a science. Language as Snow is white have truth conditions, that is, they assign truth
5.4: 20714. or falsity to the proposition expressed (Snow is white is true if
Shattuck-Hufnagel, S., and Dennis Klatt. 1979. he limited use of dis- and only if snow is white). Performatives, on the other hand,
tinctive features and markedness in speech production: Evidence from have felicity conditions (i.e., pragmatic conditions that must
speech error data. Journal of Verbal Learning and Verbal Behavior
be met if the utterance is to achieve its intended goal) and will
18: 4155.
only function felicitously or happily if these conditions of use
Sloman, S. 1993. Feature-based induction. Cognitive Psychology
are met.
25: 23180.
Soli, S., and P. Arabie. 1979. Auditory versus phonetic accounts of So, for an utterance such as I hereby pronounce you husband
observed confusions between consonant phonemes. Journal of the and wife to be felicitous, the following conditions must apply
Acoustical Society of America 66.1: 4659. (Austin 1962, 15):
307
Felicity Conditions Field (Bourdieu)
(1) a) here must exist an accepted conventional procedure to get H to do something, it follows that this must be a future act
having a certain conventional efect (in our example, chang- (propositional content), that S really wants H to carry out this
ing the wording to I reckon youre man and wife would make future act (sincerity condition), and that H would not have done
it infelicitous); and b) the particular persons and circum- so spontaneously (preparatory condition). In addition to these
stances in a given case must be appropriate for the invocation conditions, there are general felicity conditions applying to any
of the particular procedure invoked. speech-act: Both S and H must be able to speak and understand
(2) he procedure should be executed by all participants cor- the language, they must have no pathological conditions, and
rectly and completely. Stopping halfway through a marriage the act should not occur in a parasitical context, such as a joke
ceremony, for example, would result in its being null and void. or a play.
On the basis of these criteria, Searle (1975) distinguishes ive
(3) When the participants, as is often the case, are required
classes of acts: assertives or representatives (which describe
to have speciic thoughts or feelings or when any subsequent
a state of afairs), directives (which attempt to get H to do
conduct is speciied, the participants involved should actu-
something), commissives (which commit S to some future
ally (intend to) have these requisite thoughts and feelings and
course of action), expressives (which express Ss psychological
conduct themselves appropriately.
state), and declarations (which efect institutional changes). As
Austin later abandoned the distinction between constatives and S. C. Levinson points out, however, these classes are not really
performatives in favor of a model in which all utterances have built in any systematic way on felicity conditions (1983, 240).
felicity conditions. he question remains how hearers determine the speakers
Acts that fail to meet criteria 1 and 2 are labeled misires, and intended illocutionary force of the utterance in the absence of
those that fail to meet criterion 3 abuses. his is an important any linguistic devices that mark it (such as performative verbs),
distinction: Misires simply result in the intended act not being as is the case in indirect speech-acts.
realized because of some deiciency in the procedure (e.g., the
Ronald Geluykens
speaker not having the requisite authority), whereas abuses
involve a conscious decision on the part of either the speaker (S)
or the hearer (H) to ignore or manipulate the action (e.g., when S
makes a promise that he/she does not intend to keep). Austin, J. L. 1962. How to Do hings with Words. Oxford: Oxford University
It should be noted that for Austin, the so-called uptake of the Press.
action is an integral part of its being felicitous. So, for instance, Levinson, S. C. 1983. Pragmatics. Cambridge: Cambridge University
Press.
when S ofers H something, as in Have a cookie, not only must S
Searle, J. R. 1969. Speech Acts: An Essay in the Philosophy of Language.
have the requisite intention of wanting to do something for H (as
stipulated in condition 3a), but H must actually act accordingly, . 1975. A taxonomy of illocutionary acts. In Language, Mind,
that is, perform the appropriate uptake and respond to the ofer and Knowledge, ed. K. Gunderson, 34469. Minneapolis: University of
by either accepting or refusing. his notion is closely related to Minnesota Press.
the perlocution associated with the action, which is thus an
integral part of performing a speech-act happily, an aspect often
FIELD (BOURDIEU)
neglected by later speech-act theorists.
J. R. Searle (1969) used various classes of felicity conditions Field is one of Pierre Bourdieus two fundamental concepts
as the main criterion for distinguishing between diferent types the other being habitus and is addressed in several contexts
of illocutionary acts. First of all, there are restrictions on the in his work, with precise deinitions given. For example:
propositional content expressed through the speech-act, which,
A ield may be deined as a network, or a coniguration, of objec-
depending on the type of illocutionary force involved,
tive relations between positions. hese positions are objectively
must be of a certain type, such as, for instance, a future act by S
deined, in their existence and in the determinations they impose
(promises) or a past act by H (complaints). Secondly, there are
upon their occupants, agents or institutions, by their present and
preparatory conditions associated with speech-act types, that
potential situation (situs) in the structure of the distribution of
is, speciic social preconditions; for a command to be felicitous,
species of power (or capital) whose possession commands access
for instance, it is a precondition that S is in a position of author-
to the speciic proits that are at stake in the ield, as well as by
ity over H. Otherwise, the command will be inefective. hirdly,
their objective relation to other positions (domination, subordi-
Searles sincerity condition refers to the fact that S is supposed
nation, homology, etc.). (Bourdieu 1992, 97)
to be sincere when performing a speech-act in order for it to be
felicitous, which crucially depends on the speaker having the In fact, the notion of ield appears throughout Bourdieus
appropriate beliefs or feelings (as Austin had stated already). So, work. For example, his early studies were very much inspired
when uttering a promise, S must sincerely intend to carry out by personal experiences in the ield, in this case, Algeria and
the future act. he fourth and most crucial criterion Searle calls his home environment in the Barn, southwest France (see
the essential condition, which summarizes the appropriate illo- Bourdieu [1958] 1961 and 1962). Both of these works included
cutionary force of the utterance. So, for a promise, the speaker studies of language (see Grenfell 2006). However, the concept of
assumes an obligation to commit to the carrying out of the action ield as an analytic tool developed slowly in the course of further
expressed in the promise. It is the essential condition that deter- studies on education (see Bourdieu [1970] 1977 and [1964] 1979)
mines all the others: If something is a request, that is, an attempt and culture (see Bourdieu [1965] 1990 and [1966] 1990) in the
308
Field (Bourdieu)
1960s. Increasingly, ield became Bourdieus main theoretical although all ields are in some ways interlocking, they all possess
tool in analyzing a wide range of social phenomena. a degree of independence or autonomy. hey are also bounded
Bourdieus early work was developed in opposition to two with strict rules of entry. Moreover, they are ruled by speciic
salient intellectual traditions, both of which were highly inlu- values, logics, and behaviors (mostly implicit but also explicit),
ential during his formative years (the 1950s): existentialism and which determine the legitimate ways of thinking and doing
structuralism. Existentialism is best represented by the things within the particular ield. Such legitimation is a nec-
work of the French philosopher Jean-Paul Sartre, with its phi- essary part of the functioning of the ield according to its own
losophy of personal liberation through the subjective choices logic and purpose, as well as representing the interests of those
we make in deining our lives. Structuralism is best represented who hold position within it. Such social features also deine the
by the work of the anthropologist Claude Lvi-Strauss and his orthodoxy of the ield the doxa against which other nonor-
study of the objective rules that can be found across cultures thodox forms oppose themselves heterodoxa. Similarly, ields
and which govern human behavior taboos, myths, and so on. have a speciic orthodox language, which can be expressed at
here is a philosophy of language at the base of both of these any level: phonetic and syntactic, as well as gesture and
traditions, albeit from an objective and subjective point of view. expression.
Bourdieu referred to the divide between objectivism and sub- Fields are dynamic and in a constant state of lux; whole
jectivism in the social sciences as the most fundamental, and sections of society can be understood historically as the move
the most ruinous ([1980] 1990, 25). His entire theory of prac- from one ield structure and logic to another, for example, in
tice can be seen as an attempt to bridge this divide. He deined the way the ruling class in France moved from a situation of
his approach as a science of the dialectical relations between basing their power on inherited money and industrial wealth
objective structures and the subjective dispositions within to one where intellectual and academic qualiications became
which these structures are actualised and which tend to repro- the main form of social legitimation. For Bourdieu, the cur-
duce them [1972] 1977, 3), and the relationship between ield rency of ields was capital social, economic, and cultural.
and habitus as one of ontological complicity (1982a, 47). he Diferent conigurations are held by individuals and groups
same complicity can be seen in the relation between langue and in the ield and can be used to buy social positioning. Fields
parole. are therefore also the sites of struggle and conlict a process
he notion of structure afords just such a reconciliation. Both by which they evolve according to the salient sociostructural
habitus and ield are structured. In other words, social spaces forces of society at large.
need to be understood as diferentiated and thus structural Fields can be studied internally or externally. In both cases,
in essence. Similarly, individual cognition arises from, gener- position is all important. Bourdieu lists this method of ield
ates, and is generated by mental structures that are also essen- analysis (1992: 1047) as
tially structured because of their systems of diferentiation. In
(1) Analyze the position of the ield vis--vis the ield of
a seminal paper in 1966 Intellectual Field and Creative Project
power.
([1966] 1971) Bourdieu builds on the discovery of the historian
E. Panofsky that there was a link between Gothic art, for exam- (2) Map out the objective structural relations between the
ple in the design of cathedral architecture, and the mental hab- positions occupied by those in the ield.
its of those involved. In other words, each was symptomatic of (3) Analyze the habitus of those involved.
the other. Bourdieu used this principle to argue that there was
In this way, both the logic of practice of the ield and its pro-
a structural homology between subjective thought and objec-
cedures can be rendered visible, making what is normally mis-
tive surroundings, the latter most noticeable in forms of social
recognized open to public scrutiny. Studying language in this
organization. Such homologies exist because they are both gen-
way allows for a focus on linguistic features, while at the same
erated by and generate the logic of practice of the ield, itself
time relating them to ield position and the position of the ields
deined in terms of its substantive raison dtre. For Bourdieu,
within the overall network of ields (see Encrev 1983; Fehlen
therefore, social and mental structures were both structured
2004; Grenfell 2004b; and Vann 2004). Field positions and posi-
and structuring concrete and dynamic. Similarly, language is
tioning can depend on economic power but also on symbolic
both structured and structuring.
power acquired from taste, style, and language capital that
Much of Bourdieus own work subsequently brought this
buys social prestige. Ultimately, ield language articulates and
understanding and methodology to a series of ield stud-
expresses the power order found within it. In social ields, this is
ies: for example, the academic ield ([1984] 1988); the artistic
the language of the state and dominant social classes.
ield ([1987] 1993); the religious ield (1982b); the judicial ield
([1986] 1987); the bureaucratic ield ([1989] 1996); the scientiic Michael Grenfell
ield (1975); the cultural ield (1993); the literary ield ([1992]
1996); the economic ield ([2002] 2005); and the political ield WORKS CITED AND SUGGESTIONS FOR FURTHER READING
(1981). Even the academic discipline of applied linguistics can
Bourdieu, Pierre. [1958] 1961. Sociologie de lAlgrie. New rev. ed.
be regarded as a ield (see Grenfell 2004a). Many of these are
Paris: Que Sais-je.
shown to be deined in terms of overarching power structures
. 1962. he Algerians. Trans. A. C. M. Ross. Boston: Beacon.
in society: class, the state, economic interests. However, there , with Jean-Claude Passeron. [1964] 1979. he Inheritors: French
are also ields within ields microcosms which exhibit the Students and their Relation to Culture. Trans. R. Nice. Chicago: University
same features as macro ields but in local contexts. For example, of Chicago Press.
309
Field (Bourdieu) Film and Language
, with Luc Boltanski, Robert Castel, Jean Claude Chamboredon, Grenfell, Michael, and David James. 1998. Bourdieu and Education: Acts
and Dominique Schnapper. [1965] 1990. Photography: A Middle-Brow of Practical heory. London: Falmer.
Art. Trans. S. Whiteside. Oxford: Polity. Swartz, David. 1997. Fields of struggle for power. In Culture and
. 1966. Champ intellectual et project crateur. Les Temps Power: he Sociology of Pierre Bourdieu, 11742. Chicago: University of
Modernes 246: 865906. Chicago Press.
. [1966] 1971. Intellectual ield and creative project. In Vann, Robert. 2004. An empirical perspective on prac-
Knowledge and Control: New Directions for the Sociology of Education, tice: Operationalising Bourdieus notions of linguistic habitus. In
ed. M. F. D. Young, 16188. London: Collier Macmillan. Pierre Bourdieu: Language, Culture and Education, ed. M. Grenfell and
, with Alain Darbel and Dominique Schnapper. [1966] 1990. he M. Kelly, 7384. Bern: Peter Lang.
Love of Art: European Art Museums and heir Public. Trans. C. Beattie
and N. Merriman. Oxford: Polity.
. with Jean-Claude Passeron. [1970] 1977. Reproduction in FILM AND LANGUAGE
Education, Society and Culture. Trans. R. Nice. London: Sage.
. [1972] 1977. Outline of a heory of Practice. Trans. R. Nice. he thought that ilm is structured like a language is pervasive in
Cambridge: Cambridge University Press. ilm studies. It even has echoes in ordinary talk, such as when we
. 1975. he speciicity of the scientiic ield and the social conditions speak of the grammar of ilm, of ilm as text, and of reading a
of the progress of reason. Social Science Information 14.6: 1947. ilm. An early formulation is due to the Soviet director and theorist
. [1980] 1990. he Logic of Practice. Trans. R. Nice. Oxford: Polity. Vsevelod Pudovkin (1958), who argued that each shot is akin to a
. 1981. La representation politique: Elments pour une tho- word, that the sequence of shots is like a sentence, and that the
rie du champ politique. Actes de la receherche en sciences sociaels editing relations constitute a syntax. A more sophisticated ver-
37: 324.
sion of the claim was developed by Sergei Eisenstein (1949), who
. 1982a. Leon sur une leon. Paris: Les Editions de Minuit.
held ilms to be like hieroglyphic languages and ideograms.
, with Monique de Saint Martin. 1982b. La sainte famille.
Lpiscopat franais dans le champ de pouvoir. Actes de la recherch
he idea of ilm as a language became dominant in ilm stud-
en sciences sociales 44/45: 253. ies through the inluence of semiotics in the 1960s, particu-
. [1984] 1988. Homo Academicus. Trans. P. Collier. Oxford: Polity. larly through the work of Christian Metz, who provided the most
. [1986] 1987. he force of law: Toward a sociology of the judicial nuanced defense of the view. Metz rejects Pudovkins analogy of
ield. Hastings Journal of Law 38: 20948. the shot with the word since a shot gives information, as does a
. [1987] 1993. Manet and the institutionalisation of anomie. In he sentence but unlike a word. Indeed, he holds that the word, which
Field of Cultural Production, ed. and introd. Randall Johnson, 23853. is the unit of language, is missing; the sentence, which is the unit of
Oxford: Polity. speech, is supreme (Metz 1974, 69). he main structural parallel
. [1989] 1996. he State Nobility: Elite Schools in the Field of Power. to language lies in the editing relations, which constitute a gram-
Trans. L. C. Clough. Oxford: Polity.
mar of ilm. For instance, the alternate syntagma (parallel editing)
, with Loc Wacquant. 1992. An Invitation to Relexive Sociology.
by which two scenes are intercut with each other in an A-B-A-B
Trans. L. Wacquant. Oxford: Polity.
. [1992] 1996. he Rules of Art. Trans. s. Emanuel Oxford: Polity.
sequence is said by Metz to be a code denoting simultaneity.
. 1993. he Field of Cultural Production. Oxford: Polity. Although the view of ilm as a language is pervasive within
. [2000] 2005. he Social Structures of the Economy. Oxford: Polity. ilm studies, some ilm scholars are critics of the view, notably
Dreyfus, Hubert, and Paul Rabinow. 1993. Can there be a science Stephen Prince (1993), who points out that the empirical evi-
of existential structure and social meaning? In Bourdieu: Critical dence is at odds with the claim that ilm images have a merely
Perspectives, ed. C. Calhoun, E. LiPuma, and M. Postone, 3544. conventional relation to their denotata. Moreover, analytic
Oxford: Polity. philosophers of ilm have almost universally rejected the claim
Encrev, Pierre. 1983. Le sens en pratique. Construction de la rfrence that ilm is structured like a language (for instance, Currie 1995,
et structure social de lintraction dans le couple question/rponse. 11337; Harman 1977).
Actes de la Recherche en Sciences Sociales 46: 330.
he reasons for this rejection are straightforward. A language
Fehlen, Fernand. 2004 Pre-eminent role of linguistic capital in
comprises a vocabulary and a syntax. he vocabulary must be
the reproduction of the social space in Luxembourg. In Pierre
Bourdieu: Language, Culture and Education, ed. M. Grenfell and M.
inite so that it can be learned, and the lexical units that com-
Kelly, 6172. Bern: Peter Lang. prise it refer to objects and properties by virtue of conventions.
Grenfell, Michael. 1993. he linguistic market of Orlans. In he syntax consists of a inite set of rules for combining lexi-
France: Nation and Regions, ed. M. Kelly and R. Bock, 7299. cal units together. From this inite basis, a potentially ininite
Southampton: ASM & CF. number of meaningful sentences can be generated by recursive
. 2004a. Language: Construction of an object of research. In procedures (see recursion, iteration, and metarepre-
Pierre Bourdieu: Language, Culture and Education, ed. M. Grenfell and sentation), and the meaning of these sentences derives sys-
M. Kelly, 2740. Bern: Peter Lang. tematically from the way that vocabulary is combined by syntax
. 2004b. Bourdieu in the classroom. In Culture and Learning: Access
(see compositionality).
and Opportunity in the Curriculum, ed. M. Olssen, 4972. Wesport,
In contrast, ilms are composed of photographs, which have
CT: Greenwood.
a causal, and therefore nonconventional, relation to their deno-
. 2006. Bourdieu in the ield: From the Barn to Algeria a timely
response. French Cultural Studies 17: 22339. tata. Vocabulary is inite, but there is no limit to the number
Grenfell, Michael, and Cheryl Hardy. 2003. Field manoeuvres: Bourdieu of photographs that can be taken. And there is nothing like a
and the young British artists. Space and Culture 6.1: 1934. minimal lexical unit in a photograph: Each part of a photograph
. 2007. Art Rules. London: Berg. of a cat denotes a part of the cat, down to the limits of visual
310
Film and Language Filters
indiscriminability, but if the cat is called Felix, this is a mini- FILTERS

mal unit, since its parts, such as lix, do not denote part of the
Filters are output constraints of the form *xy that rule out illicit
cat. Films are composed of pictures, but pictures do not have a
sequences generated by a computational system. Filters can be
merely conventional relation to the world. From acquaintance
added to any computational system, allowing hardware to
with at most a small number of pictures, one can go on to inter-
be kept maximally general. Filters play diferent roles in difer-
pret correctly any other picture in the same style, a feature that
ent theories, ranging from statements ensuring satisfaction of a
Flint Schier (1986) called natural generativity, grounded on the
universal property (the case ilter [Chomsky 1981; Rouveret and
fact that we use our object-recognitional capacities to recognize
Vergnaud 1980), which requires an (overtly realized) noun phrase
pictures of those objects. In contrast, acquaintance with a small
(NP) argument to be associated with a case coniguration), to the
number of words in a language does not allow one correctly to
ranked violable output constraints of optimality theory.
interpret other words in that language: Language lacks natural
Many named ilters, covering diverse phenomena, date from
generativity.
the 1970s and 1980s. Although they appear to be relections of
Nor should we hold that the shot is analogous to the sentence.
the underlying computational system and architecture, theoreti-
Indeed, Metzs version of the claim is, taken strictly, incoherent
cal understanding must await future research.
since he denies that there is anything corresponding to words
he doubly-illed C ilter (Chomsky and Lasnik 1977) excludes
in cinema; yet a sentence is by deinition composed of words.
the co-occurrence of a pronounced wh-phrase and complemen-
Moreover, there is nothing in a photograph corresponding to
tizer in Spec (speciier/subject position), CP (complementizer
subject-predicate structure. A photograph of a black cat lacks
phrase) and C (complementizer) (I wonder who (*that/*whether)
parts that pick out separately the cat and blackness, unlike the
she saw, *the [man [who that] you saw ). his ilter relates to the
sentence the cat is black.
more general question of how the distribution of overt and covert
he claim that ilms have a syntax is also untenable: if ilms have
material over hierarchical structures is determined.
nothing corresponding to words, they cannot have syntax since
he that-t ilter (Perlmutter 1971) prohibits an extraction site
syntax is what couples words together. Film structure does involve
next to the C (that, for): who do you think (*that) t left). hat-t
conventions, such as the alternate syntagma. But not all commu-
relects a very general, and theoretically not understood, prob-
nication conventions are structured linguistically: Pointing is a
lem with subject extraction.
convention, but pointing, though it is a referential device, is not
he person case constraint (Perlmutter 1971) expresses the
linguistic. And, in general, communication need not be linguistic,
interaction of case forms and person marking: It bans irst and
as is shown by animal communication. Nor does the alternate
second person accusative clitics or agreement markers in the
syntagma denote simultaneity in the way that a linguistic phrase
presence of a (third person) dative clitic or agreement marker.
such as at the same time does. he alternate syntagma joins
(French: le(ACC)-lui (DAT) but *me(ACC)-lui(DAT) *me.ACC- te.DAT ).
shots together; the linguistic phrase joins sentences or clauses
It is widely assumed to follow from universal person and case
together. However, as just noted, shots are not like sentences (or
hierarchies, either structurally hardwired or encoded as soft
clauses), and Metzs view that shots are like sentences that are not
constraints.
composed of words is incoherent. So, though it is a convention,
Some ilters deal with restrictions on recursion. he doubl-ing
the alternate syntagma is not a linguistic convention.
ilter (Ross 1977) restricts (complement) recursion of -ing forms
Philosophers of ilm have given good reasons to reject the ilm
in English. A verb like continue can combine with an -ing comple-
as language hypothesis. Film is a form of communication that, in
ment (it continued raining), unless continue is the complement
its visual dimension, communicates as pictures do: In terms of
of an -ing selecting head (*its continuing raining). he head-inal
Peirces semiotics, ilms are composed of icons, not symbols (see
ilter (Williams 1982) restricts recursion on the nonrecursive side
icon, index, and symbol).
of the head. (a man [afraid of dogs], *an [afraid of dogs] man.)
Berys Gaut here is no generally accepted theoretical understanding of how
restrictions on recursion emerge from the derivation (but see
WORKS CITED AND SUGGESTIONS FOR FURTHER READING Koopman 2002).
Currie, Gregory. 1995. Image and Mind: Film, Philosophy, and Cognitive Hilda Koopman
Science. Cambridge: Cambridge University Press.
Eisenstein, Sergei. 1949. he cinematographic principle and the ideo-
gram. In Film Form, trans. Jay Leyda, 3844. San Diego, CA: Harcourt
Brace Jovanovich. Bresnan, Joan, Shipra Dingare, and Christopher D. Manning. 2001. Soft
Harman, Gilbert. 1977. Semiotics and the cinema: Metz and Wollen. constraints mirror hard constraints: Voice and person in English and
Quarterly Review of Film Studies 2: 1524. Lummi. Proceedings of the LFG 01 Conference. Stanford, CA: CSLI
Metz, Christian. 1974. Film Language: A Semiotics of the Cinema. Trans. Publications.
Michael Taylor. New York: Oxford University Press. Chomsky, Noam. 1981. Lectures on Government and Binding. Dordrecht,
Prince, Stephen. 1993. he discourse of pictures: Iconicity and ilm stud- the Netherlands: Foris.
ies. Film Quarterly 47.1: 1628. . 1995. he Minimalist program. Cambridge, MA: MIT Press.
Pudovkin, V. I. 1958. Film Acting and Film Technique. Trans. Ivor Chomsky, Noam, and Howard Lasnik. 1977. Filters and control.
Montagu. London: Vision. Linguistic Inquiry 8: 425504.
Schier, Flint. 1986. Deeper into Pictures: An Essay on Pictorial Koopman, Hilda. 2002. Derivations and complexity ilters. In Dimensions of
Representation. Cambridge: Cambridge University Press. Movement, ed. A. Alexiadiou et al., 15189. Amsterdam: John Benjamins.
311
Focus Foregrounding
Perlmutter, David. 1971. Deep and Surface Constraints in Syntax. New use cleftlike sentences or, more generally, marked constituent
York: Rinehart and Winston. order in corrections or answers (often in addition to intonation);
Ross, John. 1972. Doubl-ing. Linguistic Inquiry 3: 6186. also common are special morphemes, as well as prosodic phrase
Rouveret, Alain., and Jean-Roger Vergnaud. 1980. Specifying reference boundaries, to mark the edge of a focus.
to the subject: French causatives and conditions on representations,
Linguistic Inquiry 11: 97202. Daniel Bring
Williams, Edwin. 1982. Another argument that passive is transforma-
tional. Linguistic Inquiry 13:1603. WORKS CITED AND SUGGESTIONS FOR FURTHER READING
Beaver, David, and Brady Clark. 2008. Sense and Sensitivity: How Focus
FOCUS Determines Meaning. Malden, MA: Blackwell.
Bring, Daniel. 2007. Intonation, semantics and information structure.
Focus refers to a constituent within a sentence that is highlighted
In he Oxford Handbook of Linguistic Interfaces, ed. Gillian Ramchand
or emphasized by grammatical means. English sentences with
and Charles Reiss, 44573. Oxford: Oxford University Press.
marked accent patterns provide prototypical examples, like (1) Kadmon, Nirit. 2001. Formal Pragmatics: Semantics, Pragmatics,
(focus on the direct object): Presupposition, and Focus. Malden, MA: Blackwell.
Rooth, Mats. 1996. Focus. In he Handbook of Contemporary Semantic
(1) hey ordered COfeeF at the bar.
heory, ed. Shalom Lappin, 27197. Blackwell.
Focus realization typically has a high pitch accent, a local Schwarzschild, Roger. 1999. GIVENness, avoidF and other constraints
maximum of the voices fundamental frequency, on the syllable on the placement of accent. Natural Language Semantics 7: 14177.
co (indicated by capitals). As important as, or perhaps even more
important than, the pitch accent on the focus itself is the lack of
FOREGROUNDING
pitch accents following it, for example, on bar.
he privative syntactic feature F is a common means of focus Foregrounding is the patterned deviation from anticipated lan-
representation in the syntax. he F-markers in the tree (its focus guage, which is an important characteristic of literariness. In
structure) mediate between semantic interpretation and pro- general usage, the term sometimes describes the many conven-
sodic realization. tional means that language provides for certain information to
he pragmatic functions of focus seem wide and varied, and be made prominent in discourse, such as by variation of inten-
are often characterized in vague terms such as speakers high- sity in speech or by given/new organization in syntax. In liter-
lighting, most important information, evoking alternatives, and ary analysis, however, foregrounding refers speciically to the
so on. More formal approaches toward focus interpretation start efects achieved by textual devices that interrupt the automatic
from certain rather solid facts about the discourse distribution processes of linguistic understanding. Foregrounding shifts a
of focus: In an answer to a constituent question, the phrase cor- readers or listeners attention away from linguistic meaning to
responding to the question phrase is focused: (1) can answer the linguistic form so as to impart the sensation of things as they are
question What did they order at the bar? but not, for example, perceived and not as they are known, in the Russian formalist
Where did they order cofee? Who ordered cofee? or What Viktor Shklovskys words ([1917] 1988, 20).
happened? Similarly, in corrections, the item that diers from At all levels of linguistic organization, perceptual processes
the corrected sentence is focused: (1) can be a correction to entail anticipation. Foregrounding devices defeat the process-
hey ordered beer at bar but not, for example, hey spilled ing beneits of anticipation either by creating unconventional
cofee at the bar. patterns or by deviating from established patterns. Sonic-
To predict such facts, the notion of alternatives proves useful. level devices, such as alliteration, rhyme, assonance, (see
Focused cofee in (1) has, among others, beer, milk, steak, and so rhyme and assonance), and meter, are typically adduced as
on as alternatives; nonfocused elements (the background) dont examples of foregrounding because the repetition of individual
introduce alternatives. By pointwise combination, the whole phones or regular metrical structures falls outside of languages
sentence gets assigned an alternative set statements of the form conventionalized means of sense making. Once these uncon-
they ordered X at the bar, where X ranges over the alternatives ventional patterns are detected, some processing resources
to cofee. Roughly, the alternative set of an answer must corre- are diverted to construing their signiicance. Devices operating
spond to other conceivable answers to the question; the alterna- at more complex levels of structure must work diferently, as
tive set of a correction must include the meaning of the utterance ellipsis, metaphor, and irony, for example, regularly occur
corrected. in normal speech. Foregrounding with these devices is said to
Alternative semantics, as it is called, also accounts well for be achieved by an unusual degree of diiculty, which requires
cases in which focus inluences truth conditional content, the reader consciously to attend to construal. Deviation from
in association with particles such as only: (2) excludes ordering expected patterns can also be achieved at any level of linguistic
juice, beer, and so on at the bar (i.e., other members of the alter- organization. Creative punctuation, neologism, oxymoron, con-
native set), but not ordering cofee elsewhere, or ordering sau- versational infelicity, and unusual narrative structuring are just a
sages at the grill: few of the many devices that have been recognized as deautoma-
tizing linguistic perception.
(2) hey only ordered COfeeF at the bar.
If foregrounding involves the creation of new patterns or
If one accepts these uses of focus as deinitional, one can deviation from expected patterns, one must ask what gives rise
explore focus realization beyond pitch accents. Many languages to the expectations against which foregrounding is recognized.
312
Forensic Linguistics
In most cases, expectations arise from experience with every- lack of similarity to earlier correspondence by the deceased, this
day language, such as those evidenced by garden-path phe- analysis may be used to support a murder charge, in conjunc-
nomena in syntactic processing (see psycholinguistics), tion with nonlinguistic evidence. Most forensic linguists believe
but expectations can also arise from genre-speciic knowledge. that their evidence should not be used for positive identiication
Emily Dickinsons verse, for example, often metrically evokes the (to identify a particular person as the perpetrator of the language
religious hymns that would have been familiar to her commu- crime), but either for negative identiication (to exclude one or
nity. he masculine end-rhyme patterns of these hymns estab- more persons) or to provide supporting evidence for either a
lish an expectation against which the slant rhymes of her verse positive or negative identiication.
stand out. Another forensic application of discourse analysis is in the
Foregrounding devices are not unique to literary texts but can examination of covertly recorded conversations between under-
be found in traditionally nonliterary genres, such as advertising cover agents and suspects, as in drug cases. Topic analysis can
copy or political speech. In literary texts, however, they usually trace the extent to which a suspect initiates talk of criminal plans,
participate in larger patterns of meaning or coherence, such as for example, or is merely an interlocutor in a conversation where
those established by thematization or iconicity (see literari- such planning is initiated by the agent.
ness for a discussion of the latter two terms). Empirical research Forensic linguists also analyze comprehension diiculties
has established some support for the contribution of foreground- in police interviews, mostly of suspects who speak English as a
ing devices to the literariness of a text. In studies, segments of second language or second dialect. his is of particular rel-
short stories with a higher degree of foregrounding were read at evance to questions involving suspects understanding of their
a slower rate and were rated as more striking and more emotion- rights, including the right to silence and to a lawyer. Until the
ally evocative than segments with low foregrounding (Miall and widespread introduction of recording of police interviews with
Kuiken 1994). suspects, discourse analysis had also been used to examine
claims of fabricated confessions to police. hese included the
Claiborne Rice
notorious UK cases of the Guildord Four and Birmingham Six, in
which forensic discourse analysis was part of the complex legal
WORKS CITED AND SUGGESTIONS FOR FURTHER READING cases that resulted in overturned convictions of people found
Miall, David S., and Don Kuiken. 1994. Foregrounding, Defamiliarization, to be falsely accused of involvement in bombings by the Irish
and Afect: Response to Literary Stories. Poetics Today 22: 389407. Republican Army.
Shklovsky, Viktor. [1917] 1988. Art as technique. Trans. Lee T. Lemon he analysis of complex legal language often involves exami-
and Marion J. Reis. In Modern Criticism and heory, ed. David Lodge, nation of morphology, syntax, and semantics in arguments
1630. London: Longman. over legal interpretation (including the comprehensibility
Van Peer, Willie. 1986. Stylistics and Psychology. London: Croom Helm.
of instructions to the jury) or trademarks. A famous U.S. example
of the latter concerned the use of the Mc preix to connote budget
quality in the name McSleep Inns for a new hotel chain. While
FORENSIC LINGUISTICS
forensic linguists analyzed its generic use as a productive mor-
Forensic linguistics refers to the use of linguistic expert evidence pheme, the McDonalds corporation/company argued that they
in legal proceedings, and more broadly, to linguistic research in alone owned the use of this preix with this meaning.
legal contexts. Most forensic linguistic work published in English Forensic linguists who testify in court often face the contradic-
pertains to the common law adversarial legal system of the United tion between good scholarship, which entails openness to com-
Kingdom, the United States, Canada, and Australia. Any area of peting theories and explanations, and the adversarial nature of
linguistics can have a forensic application. In order to do foren- the legal system, which can make such openness problematic.
sic linguistics, a person must qualify as a linguist, specializing in Research on courtroom hearings shows how rules of evi-
a particular area, such as phonetics or sociolinguistics. dence control and construe the contributions of witnesses in a
here are a few graduate programs in forensic linguistics, but highly regulated speech event. he syntactic form of questions,
most practitioners are linguists with a doctorate in their special- combined with metapragmatic rules that prevent witnesses from
ization who apply this expertise to legal questions and contexts. asking questions or introducing their own topics, results in a
Forensic phonetics is used mostly in disputes over transcrip- highly asymmetrical interaction, where most of a witnesss story
tion of incriminating recorded speech, and over the identiica- is told by lawyers. lexical semantics can uncover strategic
tion of individuals who have committed a language crime such ways in which word choice constructs opposing views of reality.
as a threat, bribe, or hoax by means of a recorded voice mes- critical discourse analysis of cross-examination in rape
sage. acoustic phonetics is often combined with articu- cases reveals the central role of linguistic choices in construct-
latory phonetics. Analysis of anonymous voice recordings ing defendants as passive participants without responsibility or
can be compared with recordings of suspects. Similarly, linguis- agency, for example through the use of nominalizations such as
tic discourse analysis of a written text may help to identify the fondling of the breasts, and agentless sentences such as
an author, for example, of a so-called suicide letter in a case there wasnt any major sexual activity.
where police have grounds for suspecting murder, rather than A particular concern of sociolinguists has been the partici-
suicide. For example, if discourse analysis inds striking similari- pation of vulnerable witnesses, especially children, second lan-
ties between a suicide letter and previous (uncontested) corre- guage, and second dialect speakers. he language addressed
spondence of the spouse of the deceased, together with a striking to child victim-witnesses in abuse cases is often complex and
313
Formal Semantics
confusing, and compromises their ability to tell their story. Second worlds. his sets formal semantics apart from approaches that
language speakers rely on interpreters, whose role is often mis- view semantics as relating a sentence just to a representation
understood by legal professionals and witnesses. Microanalysis on another linguistic level (logical form) or a representation
demonstrates linguistic challenges involved in courtroom inter- in an innate language of thought. he formal semanticist
preting, inding that it is much easier for interpreters to accu- could accept such representations as an aspect of semantics but
rately interpret propositional meaning than pragmatic meaning. would insist on asking what the model-theoretic semantic inter-
Second dialect speakers are disadvantaged by widespread igno- pretation of the given representation-language is (Lewis 1970).
rance about subtle dialectal diferences. Where there are also Formal semantics is centrally concerned with composition-
important cultural diferences in communicative style, as with ality at the syntax-semantics interface, how the meanings of
Australian Aboriginal English speakers, the possibilities for mis- larger constituents are built up from the meanings of their parts
communication are disturbing. In Australia, forensic linguistics on the basis of their syntactic structure, and with the relation
has contributed to positive developments in delivery of justice between compositional sentence meaning and meaning in
generally for Aboriginal people, and speciically in legal cases of discourse.
some individuals.
Forensic linguistics is also concerned with the language of The History of Formal Semantics
lawyer-client interviews and alternative legal processes, such as Formal semantics developed out of the work of Richard
mediation. For example, conversation analysis investigates Montague (193071) (see montague grammar), with impor-
the extent to which mediators talk is neutral, as well as the ways tant contributions by other philosophers, logicians, and linguists.
in which gender afects talk between lawyers and their clients. Montague built on Alfred Tarskis recursively deined model-
Another recent focus is on the transformation of oral narratives theoretic semantic interpretation for logical formulas based on
into written legal documents (see intertextuality). While their recursively deined syntax. Donald Davidson, whose own
most forensic linguistics is concerned with criminal or civil law, approach to formal semantics is not model theoretic, was also
linguists concerns have recently been extended to immigration inluential in urging a truth conditional semantics for
law, notably immigration oicials use of untrained analysts who natural language, arguing from learnability that a initely
examine the speech patterns of asylum seekers to determine speciiable compositional semantics for natural languages must
their national origin. be possible (1967). Montague grammar evolved into formal
semantics through the work of philosophers and linguists, devel-
Diana Eades
oping a variety of approaches (Partee 1996; Partee with Hendriks
1997; Portner and Partee 2002).
SUGGESTIONS FOR FURTHER READING Gottlob Frege, whose ideas were part of the foundation of
Eades, Diana. 2010. Sociolinguistics and the Legal Process. Bristol: Tarskis and Montagues work, took an antipsychologistic view
Multilingual Matters. Comprehensive examination of sociolinguistic of meanings, and so did many other logicians and philosophers
research, examining how language works and does not work in the (including Bertrand Russell and Rudolf Carnap). But formal
legal process. semanticists who are linguists are very much concerned with
Gibbons, John. 2003. Forensic Linguistics: An Introduction to Language human semantic competence. he history of formal semantics has
in the Justice System. Oxford: Blackwell. Accessible introduction to the been colored by tension between the Fregean antipsychologistic
ield.
tradition and the Chomskyan tradition of focusing on linguistic
Tiersma, Peter. 1999. Legal Language. Chicago: University of Chicago
competence, a tension only partially resolved. One step toward
Press. Authoritative textbook on written and spoken language in the
a resolution has come from recognizing a distinction between
legal process.
meaning, which may exist outside the head (E-language),
and knowledge of meaning, or semantic competence, which is
very much inside the head (but should probably not be called
FORMAL SEMANTICS
I-language, a term better reserved for a semantic representation
Formal semantics is an approach to semantics, the study of language in approaches that include one.)
meaning, with roots in logic, the philosophy of language, and What is semantic competence? he answers to this ques-
linguistics, and since the 1980s a core area of linguistic theory. tion will naturally be diferent with respect to diferent seman-
Characteristics of formal semantics treated in this entry include tic frameworks. For formal semanticists, it is common to take
the following: Formal semanticists treat meaning as mind inde- the fundamental characterization of semantic competence to
pendent (though abstract), contrasting with the view of meanings involve the knowledge of truth conditions: Given a sentence in
as concepts in the head (see i-language and e-language a context, and given idealized omniscience about the facts con-
and meaning externalism and internalism ); formal cerning some possible situation, a competent speaker can judge
semanticists distinguish semantics from knowledge of semantics whether the sentence is true or false in that situation. From that
(Lewis 1975, Cresswell 1978), which has consequences for the basic competence, allowing idealizations about computational
notion of semantic competence. A central part of the meaning capacity, it follows that a competent native speaker can also
of a sentence on this approach is its truth conditions, and make judgments about entailment relations between sentences.
most, though not all, formal semantics is model theoretic, relat- So semantic competence is widely considered to consist in
ing linguistic expressions to model-theoretically constructed knowledge of truth conditions and entailment relations of sen-
semantic values cast in terms of truth, reference, and possible tences of the language.
314
Formal Semantics
The Current State of the Field via the notion of mental models (Johnson-Laird 1983). But
At the heart of formal semantics are the principle that truth approaches using mentally represented formulas (logical forms,
conditions form a core aspect of meaning and the methodolog- conceptual representations) and computations on such formu-
ically central principle of compositionality. Diferences among las, as advocated by Ray Jackendof for many years, are preferred
approaches can often be traced to three crucial theory-depen- by many Chomskyan linguists.
dent terms in the principle of compositionality: he meaning of Kamps (1981) discourse representation theory (DRT) uses
a whole is a function of the meanings of its parts. the discourse representation structure (DRS) as a noneliminable
intermediate level of representation, with claimed psychologi-
MEANINGS. David Lewis provided a famous strategy for thinking cal reality: Kamp hypothesized that his DRS could be a common
about what meanings are: In order to say what a meaning is, we medium playing a role in language and as objects of proposi-
may irst ask what a meaning does, and then ind something that tional attitudes. Kamp argued against full compositionality;
does that (1970, 22). here are diferent proposals about what he was challenged by Jeroen Groenendijk and Martin Stokhof
to count as meanings (or as the linguistically relevant aspects of (1991), who argued that a fully compositional dynamic seman-
meanings) within formal semantics. Montague formalized inten- tics could accomplish what Kamp could do with DRT. Reinhard
sions of sentences as functions from possible worlds and variable Muskens (1993) proposed a reconciliation with his composi-
assignments to truth values (see intension and extension; tional discourse representation theory.
proposition.) R. Stalnaker (1976), David Kaplan (1979), Hans
Kamp (1981), and Irene Heim (1982) emphasized the importance PARTS (SYNTAX). he relation between the preceding issues and
of context-dependent expressions; Kaplan introduced the char- syntax shows up clearly in debates about direct compositional-
acter of an expression, a function from contexts to intensions. ity: Some linguists argue that a direct compositional model-
Kamp and Heim introduced a more dynamic semantics, treating theoretic semantics can apply to nonabstract surface structures
meaning as a function from contexts to contexts. Jon Barwise and (Barker and Jacobson 2007), without abstract syntactic represen-
John Perry (1983) and Angelika Kratzer (1989) argued for replac- tations, movement rules, or a level of logical form. Advocates
ing possible worlds by (possible) situations, which for Kratzer are of direct compositionality use an enriched arsenal of semantic
parts of possible worlds, enabling more ine-grained analysis of combining rules, including not only function-argument applica-
meanings (see Kratzer 2007). tion but also function composition and a number of type-shift-
ing operators. here may or may not be an inevitable trade-of
IS A FUNCTION OF. How are meanings put together? How does between optimizing syntax and optimizing semantics; it is a sign
the compositional mapping from syntax to semantics work? he of progress that many linguists work on syntax and semantics
question of the sorts of functions that are used to put meanings with equal concern for both.
of parts together is inextricably linked to the questions of what
meanings are and what count as syntactic parts. Frege (1892)
An Example
took the basic semantic combining operation to be function-ar-
We illustrate the methods of formal semantics without formal
gument application: Some meanings are construed as functions
details by considering one aspect of the analysis of restrictive rel-
that apply to other meanings (see compositionality). With a
ative clauses like who fed Fido in (1a) and (1b).
syntax such as categorial grammar providing the relevant
partwhole structure, Freges function-argument principle could (1) a. I saw a boy who fed Fido.
be enough; with other kinds of syntax, other operations may be b. I saw every boy who fed Fido.
needed as well.
In the 1960s, there were debates about whether the relative
Formal semanticists also difer on whether a level of semantic
clause combines with the noun boy, as in structure (2), or with
representation is hypothesized to mediate between syntax
the phrases a boy, every boy, as in structure (3).
and model-theoretic interpretation. Montagues own work exem-
pliied both direct model-theoretic interpretation (Montague (2) I saw [a / every [boy [who fed Fido ]]]
1970) and two-stage interpretation via a language of intensional
(3) I saw [[ a / every boy] [who fed Fido ]]
logic (Montague 1973). Many linguists work with some interme-
diate semantic representation. Either approach can be composi- here were also debates about the semantics of the relative
tional: A two-stage interpretation procedure is compositional if clause, with some arguing that in (1a) who fed Fido means and
the syntax-to-semantic-representation mapping rules are com- he fed Fido, whereas in (1b) it means if he fed Fido, creating ten-
positional and the model-theoretic semantics of the representa- sion between the uniform surface structure of who fed Fido in
tion language is also compositional. When those conditions are (1a) and (1b) and the very diferent underlying semantic inter-
met, the intermediate language is, from a model-theoretic per- pretations posited for them (see Stockwell, Schachter, and Partee
spective, in principle eliminable. But linguists hypothesize that it 1973 ). he formal semantics perspective suggests searching for
may have some psychological reality: It may represent an aspect a unitary syntax and meaning for who fed Fido and locating the
of the means by which humans compute the mapping between semantic diference between (1a) and (1b) in the semantics of
sentences and their meanings. But it is a major challenge to ind a and every. he solution (due to Quine [1960] and Montague
empirical evidence for or against such a hypothesis. [1973]) requires structure (2): he noun and relative clause
It is worth noting that it is possible to advocate direct mod- denote sets, and their combination denotes the intersection of
el-theoretic interpretation without being antipsychologistic, those two sets. hen the phrase boy who fed Fido denotes the set
315
Formal Semantics
{x: x is a boy and x fed Fido}. Diferent theories of the semantics vague words with particular less vague meanings) on what
of determiners give diferent technical implementations of the we have in mind when we speak (division of linguistic
rest of the solution, but that irst step settles both the syntactic labor). How the word elm in Putnams mouth can refer to
question and the core of the semantics. Sentence (1a) asserts elms is a major topic in the philosophy of language (see natu-
that the set of boys who fed Fido and the set of boys that I saw ral kind terms ). Such issues distinguishing E-language from
overlap; (1b) says that the set of boys I saw is a subset of the set of I-language are especially acute for proper names (Kripke 1972).
boys who fed Fido. See Partee (1995) for a fuller argument, and Formal semanticists emphasize that human semantic compe-
Barwise and Cooper (1981), Heim (1982), and Kamp (1981) for tence includes the higher-order intention to use words with
treatments of the semantics of determiners. their shared conventional meanings, unlike Humpty Dumpty
in hrough the Looking-Glass (When I use a word, Humpty
A Second Example Dumpty said, in a rather scornful tone, it means just what I
Also informally, we illustrate the beginnings of the more dynamic choose it to mean neither more nor less.) At the same time,
semantics of Kamp (1981) and Heim (1982). Consider the con- formal semantics and pragmatics make room for the fact that
trasting minidiscourses in (4) and (5): in a discourse context, conversational partners may mutually
understand possibly mistaken speakers meanings, so that a
(4) A baby was crying. It was hungry. false sentence can sometimes be used to express a true prop-
(5) Every baby was crying. #It was hungry. (# means osition. For instance, hat beech is dying might successfully
anomalous.) communicate the fact that a certain tree that is in fact an elm
is dying, since the demonstrative that may be enough to enable
On the Kamp-Heim theory, an indeinite noun phrase (NP) like the hearer to recognize which tree is meant.
a baby in (4) introduces a novel discourse referent into the con- Some argue that formal semantics does not belong within
text, and the pronoun it in the second sentence of (4) can be linguistics if formal semantics is concerned with truth, since
indexed to that same discourse referent, whose lifespan can linguistics is concerned only with mental representations. Such
be a whole discourse, not only a single sentence. he discourse an attitude relects both a methodological decision (Noam
referent introduced by an essentially quantiicational NP like Chomskys position excludes model-theoretic semantics because
every baby in (5), however, cannot extend beyond its clause, so it extends beyond autonomously linguistic levels of represen-
the pronoun it in (5) is anomalous. he Kamp-Heim theory also tation) and a misunderstanding, since formal semanticists are in
includes an account of the famous donkey-sentences of Peter fact not interested in truth but in truth conditions, and one can
Geach (1962), variants of which are given in (6ab). very well know what a situation must be like for a sentence to be
(6) a. If a farmer owns a donkey, he always beats it. true in it, without knowing what the facts are and so not know-
b. Every farmer who owns a donkey beats it. ing the actual truth value. For most of us, this is the case with a
sentence like here is now a white cat sitting in a window of an
hese sentences had previously resisted compositional analysis, apartment in Baltimore in which a piano teacher gave piano les-
even with the tools of Montague grammar and its early exten- sons in the 1950s.
sions. On Kamps and Heims theories, the indeinite a donkey he compositionality principle is arguably one aspect of
introduces a discourse referent into the local context, but has no universal semantic competence that distinguishes human lan-
quantiicational force of its own; it ends up being bound by the guage from all animal languages (see compositionality).
unselective quantiiers always in (6a) and every in (6b). he cognitive linguists sometimes charge that compositional-
theories thus involve the interdependent areas of quantiica- ity is incompatible with the prevalence of metaphor. Formal
tion and anaphora, and relating sentence semantics to discourse semanticists reply that metaphor principally involves shifts in
semantics and pragmatics, giving rise to much new work in these lexical meanings, and that the interpretation of metaphor is
areas. driven in part by compositionality, that is, by the need to ind
suitable meanings for the parts so that they can be combined
Formal Semantics and Cognitive Science into a meaningful whole. Compositionality is also a driving force
Formal semantics as developed within linguistics is as much a in childrens acquisition of lexical meanings: When context sup-
part of cognitive science as any other part of linguistics, but there plies a plausible meaning for the whole and syntax supplies a
are a number of misunderstandings and controversies surround- suitable structure, the child can use knowledge of the meaning
ing this issue. his concluding section touches on a number of of the whole and of the other parts to solve for the meaning
broader issues and controversies. of the unknown part. (Cf. the carefully structured context for the
In semantics, possibly unlike syntax or phonology, ones idi- learning of novel color words in experiments reported in Carey
olect is not determined solely by ones own semantic knowl- [1978, 271]): Bring me the chromium tray not the blue one, the
edge. Meanings are conventional social constructs (Lewis chromium one.)
1975), and as Hilary Putnam (1975) observed, his inability A word about the relation between word meaning (see
to distinguish beeches from elms does not render the words also lexical semantics) and compositional meaning: Some
beech and elm synonymous or vague in his language, since the semantic theories have focused largely on word meaning, while
meanings of our words and the truth conditions of our sen- formal semantics has focused on compositional meaning, but
tences depend in signiicant part on the conventions of our in principle every semantic theory must address both, as well
language community and only in part (for instance, in using as their integration. Formal semanticists give special attention
316
Formal Semantics Forms of Life
to the semantic types of lexical items, since that is crucial to the Kratzer, Angelika. 1989. An investigation of the lumps of thought.
combining principles. A classic work on lexical meaning and Linguistics and Philosophy 12: 60753.
word-formation rules is Dowty (1979); see also Partee (1995). . 2007. Situations in natural language semantics. In he Stanford
As a inal observation, we note that in contemporary work, Encyclopedia of Philosophy (Spring ed.), ed. Edward N. Zalta. Available
online at: http://plato.stanford.edu/archives/spr2007/entries/
attention to context dependence has reached such a point that
situations-semantics/.
formal semantics and formal pragmatics are close to becoming a
Lappin, Shalom, ed. 1996. he Handbook of Contemporary Semantic
single discipline, and the role of context in determining meaning heory. Oxford: Blackwell. Contains articles by leading semanticists on
is so important as to require a richly context-dependent notion of wide spectrum of topics, representing both formal semantics and other
meaning (semantics-pragmatics interaction). approaches.
Lewis, David. 1970. General semantics. Synthse 22: 1867.
Barbara H. Partee
. 1975. Language and languages. In Language, Mind, and
Knowledge, ed. K. Gunderson, 335. Minneapolis: University of
WORKS CITED AND SUGGESTIONS FOR FURTHER READING Minnesota Press.
Bach, Emmon. 1989. Informal Lectures on Formal Semantics. New Montague, Richard. 1970. English as a formal language. In Linguaggi
York: State University of New York Press. A user-friendly nontechnical nella Societ e nella Tecnica, ed. Bruno Visentini et al., 189224.
introduction. Milan: Edizioni di Comunit.
Barker, Chris, and Pauline Jacobson, eds. 2007. Direct Compositionality. . 1973. he proper treatment of quantiication in ordinary English.
Oxford Studies in heoretical Linguistics. Oxford: Oxford University In Approaches to Natural Language, ed. K. J. J. Hintikka et al., 22142.
Press. Dordrecht, the Netherlands: Reidel.
Barwise, Jon, and Robin Cooper. 1981. Generalized quantiiers and nat- Muskens, Reinhard. 1993. A compositional discourse representation
ural language. Linguistics and Philosophy 4: 159219. theory. In Proceedings of the 9th Amsterdam Colloquium, ed. P. Dekker
Barwise, Jon, and John Perry. 1983. Situations and Attitudes. Cambridge, and M. Stokhof, 46786. Amsterdam: ILLC, University of Amsterdam
MA: MIT Press. Press.
Carey, Susan. 1978. he child as word learner. In Linguistic heory and Partee, Barbara. 1995. Lexical semantics and compositionality. In An
Psychological Reality, ed. M. Halle et al., 26493. Cambridge, MA: MIT Invitation to Cognitive Science. Vol. 1: Language. Ed. L.Gleitman and
Press. M. Liberman, 31160. Cambridge, MA: MIT Press.
Chierchia, Gennaro, and Sally McConnell-Ginet. 1999. Meaning and . 1996. he development of formal semantics in linguistic theory.
Grammar: An Introduction to Semantics. Cambridge, MA: MIT Press. In he Handbook of Contemporary Semantic heory, ed. S. Lappin,
An accessible introduction to formal semantics and pragmatics. 1138. Oxford: Blackwell.
Cresswell, M. J. 1978. Semantic competence. In Meaning and Partee, Barbara H., with Herman L.W. Hendriks. 1997. Montague gram-
Translation: Philosophical and Linguistic Approaches, ed. F. Guenthner mar. In Handbook of Logic and Language, ed. J. van Benthem and A.
and M. Guenthner-Reutter, 943. London: Duckworth. ter Meulen, 591. Amsterdam and Cambridge, MA: Elsevier and MIT
Davidson, Donald. 1967. Truth and meaning. Synthese 17: 30423. Press.
Dodgson, Charles (Lewis Carroll). 1871. hrough the Looking-Glass and Portner, Paul, and Partee, Barbara H., eds. 2002. Formal Semantics: he
What Alice Found here. London: Macmillan and Co. Essential Readings. Oxford: Blackwell. A collection of classic papers
Dowty, David. 1979. Word Meaning and Montague Grammar: he from the beginnings of the ield to the late 1980s, with an introductory
Semantics of Verbs and Times in Generative Semantics and in overview essay.
Montagues PTQ. Dordrecht, the Netherlands: Reidel. Putnam, Hilary. 1975. he Meaning of Meaning. In Minnesota Studies
Frege, Gottlob. 1892. ber Sinn und Bedeutung. Zeitschrift fr in the Philosophy of Science. Vol. 7: Language, Mind, and Knowledge. Ed.
Philosophie und philosophische Kritik C: 2250. Keith Gunderson, 13193. Minneapolis: University of Minnesota Press.
Gamut, L. T. F. 1991. Logic, Language, and Meaning. Vol. 2. Intensional Quine, Willard van Orman. 1960. Word and Object. Cambridge, MA: MIT
Logic and Logical Grammar. Chicago: University of Chicago Press. An Press.
excellent introduction to formal semantics and its logical foundations; Stalnaker, R. 1976. Propositions. In Issues in the Philosophy of Language, ed.
requires some prior logic. A. Mackay and D. Merrill, 7999. New Haven, CT: Yale University Press.
Geach, Peter. 1962. Reference and Generality. Ithaca, NY: Cornell Stockwell, Robert P., Paul Schachter, and Barbara H. Partee. 1973. he
University Press. Major Syntactic Structures of English. New York: Holt, Rinehart and
Groenendijk, Jeroen, and Martin Stokhof. 1991. Dynamic predicate Winston.
logic. Linguistics and Philosophy 14: 39100.
Heim, Irene. 1982. he semantics of deinite and indeinite noun
phrases. Ph.D. diss., University of Massachusetts. FORMS OF LIFE
Heim, Irene, and Angelika Kratzer. 1998. Semantics in Generative Forms of life, an expression associated with Ludwig
Grammar. Oxford: Blackwell. A systematic introduction to selected
Wittgenstein, is not one he made that much use of, employing it
topics in semantics that lays solid foundations for constructing and
only ive times in his Philosophical Investigations. It is not clear
arguing for formal semantic analyses.
Johnson-Laird, Philip N. 1983. Mental Models. Cambridge: Cambridge
from his usage exactly what he had in mind. Here are statements
University Press. in which the phrase occurs:
Kamp, Hans. 1981. A theory of truth and semantic representation. In It is what human beings say that is true or false; and they
Formal Methods in the Study of Language: Mathematical Centre Tracts
agree in the language they use. hat is not agreement in opin-
135, ed. J. A. G. Groenendijk et al., 277322. Amsterdam: Mathematical
ions, but in forms of life (1958, 38), and
Centre.
Kaplan, David. 1979. On the logic of demonstratives. Journal of What has to be accepted, the given, is so one could say
Philosophical Logic 8: 8198. forms of life (1958, 226), as well as
317
Forms of Life
I would like to regard this certainty, not as something akin understood as a challenge: Try to imagine understanding a lan-
to hastiness or supericiality, but as a form of life (hat is guage without knowing something about how those who use the
very badly expressed and probably badly thought as well). language live their lives, some of the diferent sorts of things they
(1969, 46) do. he language as a totality is an entirely contingent assem-
blage, accumulated ways of speaking that come from diferent
Its use may nonetheless be associated with Wittgensteins
parts of life (as add, subtract, divide come from calculation, or
attempt to humanize matters especially logic that philosophers
swing, putt, birdie from golf).
had been prone to idealize to the extent of making them seem to
he direction in which natural human responses can be culti-
surpass all possibility of creation by merely human beings, mean-
vated varies greatly, according to the multifarious ways in which
ing that their objectivity must surely reside elsewhere than in the
circumstances impact upon them, but Wittgenstein emphasized
contingent facts and incidental variety of human life. He seeks to
both that and the importance of the natural, animal responses
reassert the connection between seemingly immutable necessi-
of human beings. For example, consider the way in which the
ties (of logic and mathematics especially) and the contingencies
human animal naturally responds to a pointing gesture (follow-
of human life. To counteract the idealization of logic into some-
ing the direction of the inger) when other animals (cats, cows)
thing almost superhuman, Wittgenstein is prepared to say that I
do not, and the role that gesture can play when embedded in a
want to conceive of it [logic] as something that lies beyond being
parentchild relation in associating a color name with a color
justiied or unjustiied; as it were, something animal (1969, 47).
sample. his is an efective way of teaching words because chil-
dren naturally respond to the rulelike connection made between
Arguments over Interpretation
word and sample, and after a very few examples, can make the
From the previous quotations, it can be seen that Wittgenstein
same association for themselves. Such natural, and common,
does not enumerate the kinds of things that he has in mind, gives
reactions are what enable practices, as well as the development
no examples of the forms of life nor speciies what makes them
of their associated language forms, to ind purchase as standard
such. Inevitably, there is disagreement over what he meant by
practices: Agreement in the way color samples are extended to
forms of life. Is it that human beings, of every time and place,
cases is the basis for the possession of standard color names.
have the same forms of life or, alternatively, that cultural varia-
On that same basis, of course, varied color vocabularies have
tion amongst them is possible? (Emmett 1990, 213).
arisen.
Gertrude Conway (1989, 423) lists four rival
interpretations: those
Necessities and Certainties
(1) equating forms of life with language games, Forms of life can be considered as points of reference for two key
(2) interpreting forms of life on an organic model, philosophical concerns, necessities and certainties. he obser-
vations on such forms of life as a childs responses to teaching
(3) equating forms of life with cultural systems, and
are not contributions to some general and systematic account of
(4) presenting human nature as a form of life. the nature of human life and its practices, for they are taken by
Among the sort of things that forms of life might cover, then, Wittgenstein to be utterly obvious. hey are the sort of incontest-
are a) the human creature, which is a form of life when compared able truths that cause philosophical confusion because they are
with other forms of life (such as plants or ish), and b) activities apt to be left out of account when someone moves into theoriz-
characteristic of a kind of creature: Periodically going to sleep ing mode. Wittgensteins common method is to draw attention
is one form of life in which humans engage, digging burrows back to these apparent matters to show that what seems deeply
characteristic of rabbits. Wittgenstein certainly does not want to puzzling when considered in isolation from the hurly-burly of
lose sight of the fact that human beings are animals but, equally, ordinary lives may cease to seem puzzling when set against the
is aware that human beings are a distinctive sort of animal, a background of practical activities from which it has been cut of.
language-using social one. In consequence, c) groups of people
may difer in cultural forms, as rural dwellers might have very Necessities
diferent forms of life than urban ones, or d) may difer in rela- here can be a strong temptation to think that our institu-
tively speciic ways of acting, including ways of speaking, within tions, practices, and ways of acting can (sometimes at least) be
a cultural community their ways of measuring, for example. explained in terms of necessities. Logic was especially important
to Wittgensteins relections throughout his life. It was widely
The Natural History of Human Beings assumed that the power of logical forms derived from the fact
he development of language is connected with human biology that they relected necessities intrinsic to the way things are
(but it is the physiognomy of the organism and the organization logic could be no other way than it is because it corresponds to
of its life that Wittgenstein has in mind, not the physiology of its unalterable features of reality. Wittgensteins was no attempt to
nervous system). Language is made possible by characteristic do away with logical necessities altogether, only to dereify them,
human capacities and responses, but the language developed is and to persuade, more generally, that our institutions and prac-
not a direct outgrowth of those characteristics. he development tices do not follow from external necessities. Rather, our idea
of language is interwoven with the formation of practical life, of necessity lows from our institutions, practices, and so on.
for this involves the creation of ways of speaking integral to the Getting things the right way round is, as Peter Winch summa-
practical activity. he remark that to imagine a language means rized it, a matter of understanding that logical relations among
to imagine a form of life (Wittgenstein 1958, par. 19) can be propositions themselves depend on social relations between
318
Forms of Life Frame Semantics
men (1958, 126). Wittgenstein suggested that when it seemed doubts about the length of a table by recommending the use of a
that things could not be conceived of as being otherwise, this tape measure as a dependable way of precisely determining this,
was, if not an illusion, at most a function of the part that they but what sorts of serious doubts are there about a tape measures
played in our ways of acting, not a sign of their essential relation dependability in such a case? Our notion of reliable measure-
to any necessitating external reality. he temptation to appeal to ment is very much tied up with the ways we use tape measures,
such externally imposed necessities could be alleviated by draw- but, at the same time, we should not think that our practices of
ing attention to the degree to which seemingly transcendental measurement are the only ones that could possibly make sense,
necessity is connected with some familiar, unobtrusive feature of and that other people might not have ways of measuring for
the human organism or its life, by imagining the consequences which our tape measures would be useless.
of some gross change in the character of the human organism,
Wes Sharrock
or by imagining a cultural practice signiicantly diferent from
our own. hus, one need only imagine children not naturally
responsive to the rulelike link between instruction and sample to
quickly realize that color vocabularies and arithmetic (for exam- Conway, Gertrude.1989. Wittgenstein on Foundations. New
ple) depend on the trainable susceptibilities of children for their Jersey: Humanities Press International.
existence as stable practices. Alternatively, imagine how another Emmett, Kathleen. 1990. Forms of life. Philosophical Investigations 13
(July): 21331.
people might have diferent practices from ours in counting or
Garver, Newton. 1990. Form of life in Wittgensteins later works.
measuring, and then appreciate that the speciic forms that serve
Dialectica 44.1/2: 175201.
these tasks in our society do not comprise the unique, eternal, or Hanling, Oswald. 2002. Wittgenstein and the Human Form of Life.
universal essence of counting or measuring, and work as well as London: Routledge.
they do only because they have connections with the practical Hunter, John. 1968. Forms of life in Wittgensteins Philosophical
needs of our distinctive way of life. Investigations. American Philosophical Quarterly 5: 23343.
Winch, Peter. 1958. he Idea of a Social Science. London: Routledge.
Certainties Wittgenstein, Ludwig. 1958. Philosophical Investigations 2d ed.
Certainty, too, is to be connected to natural human reactions, Oxford: Blackwell.
to the way in which we commonly act without doubts or hesita- . 1969. On Certainty. Oxford: Blackwell.
tions we go about many of our activities in a way in which doubt
is simply absent; it does not occur to us. he philosophers stock FRAME SEMANTICS
idea of certainty is something that we are entitled to only after
his term refers to a wide variety of approaches to the system-
we have arrived at some inal, unquestionable justiication, have
atic description of natural language meanings. he one com-
given grounds for our practices that allow no logical possibility
mon feature of all these approaches which, however, does not
of error. Rather than accepting the skeptics query as to whether
suiciently distinguish frame semantics from other frameworks
it is logically possible to doubt, Wittgenstein prefers, instead,
of semantic description is the following slogan from Charles
to relect on where we do doubt. We do not possess certainties
Fillmore (1977a):
because we have arrived at them through critical relections, but
acquire them as an integral part of coming to mastery in various Meanings are relativized to scenes.
activities. Is this not, however, simply a complete concession to
According to this slogan, meanings have internal structure that is
the skeptic? All our practices lack justiication; we cannot iden-
determined relative to a background frame or a scene. he eas-
tify any ultimate, unquestionable self-evident truth that under-
iest way to understand this thesis is by way of example. he fol-
pins them. No, it is a counterskeptical move: he certainties that
lowing one is from Fillmore (1977c):
are incorporated into our learning and thus into our practices
Suppose that two identical twins Mark and Mike are both in
are ones that serve as conditions of intelligibility, that shape our
a hospital sitting on the edge of their beds in exactly the same
capacity to understand what a doubt could be in an actual case.
position. When a nurse walks by Marks room, she says: I see that
hese certainties are not held in place by an utterly uncritical atti-
Mark is able to sit up now, and when she walks by Mikes room
tude toward them but by their connection with our ways of doing
she remarks: I see that Mike is able to sit down now. Drawing on
practical things, in which they act as the setting within which the
what we know about hospitals our hospital background scenes
notions of doubt and justiication can have content.
or frames we will interpret the two remarks of the nurse rather
he doubts that the skeptic tries to induce are pretend ones
diferently, thereby relativizing the meanings of her remarks to
only, ones that by their very design are empty of empirical
the relevant scenes.
content, and are, as genuine doubts, unintelligible. Skepticism
Another often-cited example of Fillmore (1977c) that clearly
is right only insofar as its continuing pressure for further justii-
demonstrates the previous thesis is the diference in meaning
cation until a rock-bottom certainty is reached leaves us (quite
between the following two sentences:
soon) in a position where we have no further justiications. It is
misguided to think that this shows that we have been forced to (1) I spent three hours on land this afternoon.
admit our inability to respond to legitimate demands for justi-
(2) I spent three hours on the ground this afternoon.
ication. Rather, we run out of justiications at the same point at
which the skeptics demands for justiication themselves pass he background scene for the irst sentence is a sea voyage, while
the limits of what can make sense as a doubt. We can settle the the second sentence refers to an interruption of air travel. his
319
Frame Semantics
illustrates Fillmores use of the term frame as an idealization of a Table 1.

coherent individuatable perception, memory, experience, action,
BUYER buy GOODS (SELLER) (PRICE)
or object (1977c).
In order to understand frame semantics, it is helpful to begin Subject object From for
with a brief history. From here we turn to an overview of the most Angela bought the owl from Pete for $ 10
important theoretical concepts. Next, the relationship of frame
Eddy bought them for $ 1
semantics to one speciic version of construction grammar
is introduced and some examples analyzed. he entry ends with a Penny bought a bicycle from Stephen
short summary of applications of frame semantics and a note on
Table 2.
formalization. Usually, frame semantics is taken to be a very infor-
mal approach to meaning, but nevertheless, some approaches VERB BUYER GOODS SELLER MONEY PLACE
relating frame semantics to formal semantics exist. Buy subject object From For at
Sell to
History
Cost indirect subject Object at
here are at least two historical roots of frame semantics; the irst
object
is linguistic syntax and semantics, especially Fillmores case
grammar; the second is artiicial intelligence (AI) and the notion Spend subject on Object at
of frame introduced by M. Minsky (1975) in this ield of study.
A case frame in case grammar was taken to characterize a
small abstract scene that identiies (at least) the participants of prototypical descriptions of scenes. A prototype has the advan-
the scene and thus the arguments of predicates and sentences tage that it does not have to cover all possible aspects of the
describing the scene. In order to understand a sentence, the lan- meaning of a phrase; in other words, a prototype does not have to
guage user is supposed to have mental access to such schema- provide necessary and suicient conditions for the correct use of
tized scenes. a phrase. Fillmore (1977b) illustrates the use of prototypes within
he other historical root of frame semantics is more diicult to frame semantics by an analysis of the concept widow. he word
describe. It relates to the notion of frame-based systems of knowl- widow is speciied with respect to a background scene in which
edge representations in AI. his is a highly structured approach people marry as adults, they marry one person, and their lives
to knowledge representation that collects together information are afected by their partners death and perhaps other proper-
about particular objects and events and arranges them into a tax- ties. he advantage of a theory of meaning based on the prototype
onomic hierarchy familiar from biological taxonomies. However, concept, compared to a theory that insists on stating necessary
the speciic formalism suggested in the aforementioned paper by and suicient conditions for the meaning of a phrase, is that it
Minsky was not considered successful in AI . does not have to care about certain boundary conditions; that is,
it does not have to provide answers for questions like Would you
Some Basic Theoretical Principles call a woman a widow who has lost two of her three husbands but
he central theoretical concepts characterizing frame seman- who had one living one left? Fillmore (1977b). In a case like this,
tics originated with Fillmore and did not change much since his whether the noun widow applies or not is unclear since certain
irst writings on this approach. In order to understand the most properties of the background frame for this concept are missing.
important notions of frame semantics, let us briely consider a he concept prototype is not unproblematic either, however.
typical example of a frame, the commercial transaction frame that Note that Fillmore does not use this concept with respect to words
demonstrates the origin of frame semantics from Fillmores case but with respect to frames or scenes. Some words like bird cer-
frames as well. In Table 1, the concept frame is applied to verbs tainly have prototypes, but others may not have a corresponding
like buy with the intention to represent the relationsips between prototype. What is a prototypical vegetable, for instance, or a pro-
syntax and semantics. he verb buy, according to Table 1, requires totype corresponding to the adjective small? Moreover, applica-
obligatorily a buyer and goods and optionally a seller and a price. tions of prototype theory often involve two diferent measures for
Verbs with related meanings, such as sell, are expected to have category membership. A penguin, for example, is certainly not a
the same meaning slots but in a syntactically diferent order. his prototypical bird, but nobody hesitates to judge it as a bird. he
clearly shows the relation to Fillmores case frames. Combining other measure of category membership is typically used in the
these frames results in the commercial transaction frame about analysis of vague predicates, for instance, color adjectives. It may
which Table 2 provides partial information. Of course, the PLACE- sometimes be hard or even impossible to assign a given object to
feature just marks the beginning of an open-ended list, since every the category of pink or red entities .
event in Table 2 can be further speciied for instance, with respect Another central notion within frame semantics is the concept
to time. Moreover, the collection of frames forms an ordered struc- proiling. R. Langacker (1987) uses the example of hypotenuse for
ture. For instance, the commercial transaction frame itself is part explaining this concept. One can easily draw a mental picture
of the more general transaction frame prototypically expressed by of the concept hypotenuse. he interesting question concerning
the ditransitive verb give. his indicates that the system of depen- this mental picture is this: Can you imagine what a hypotenuse is
dencies between frames forms an intricate hierarchical structure. without imagining the whole right triangle? he answer is clearly
he concept prototype is one of the most important con- no. he triangle and the plane in which it is included is a frame,
cepts of frame semantics. Frames should be understood as and the terms hypotenuse and right triangle are interpreted
320
Frame Semantics
with respect to this frame, but they proile diferent parts of the uses of verbs in speciic constructions. In (6), the intransitive
frame. verb sneeze has to be integrated into the caused motion construc-
he following example taken from Goldberg (1995) illustrates tion and, therefore, is forced to be interpreted as some kind of
lexical proiling of participants. Consider the following difer- action.
ences between the closely related verbs rob and steal. Both verbs and constructions are associated with frame
semantic meanings. However, in contrast to the rich frame
(3) a. Jesse robbed the rich (of all their money).
semantic representations of verbs, the basic constructions are
b. *Jesse robbed a million dollars (from the rich).
associated with a more abstract semantics. hese basic con-
structions and their frames are supposed to be independent of
(4) a. Jesse stole money (from the rich).
a particular language. hey are cross-cultural structures that are
b. *Jesse stole the rich (of money).
deeply entrenched in human experience. his is the content of
hese distributional facts can be explained by a semantic dif- Goldbergs scene encoding hypothesis.
ference in proiling. In the case of rob, the victim and the agent
Scene Encoding Hypothesis: Constructions that correspond to
(the thief) are proiled; in the case of steal, the agent and the valu-
basic sentence types encode as their central senses event types
ables are proiled. Representing proiled participants in bold-
that are basic to human experience. (Goldberg 1995)
face, A. Goldberg proposes the following argument structure for
rob versus steal:
Applications
rob <thief target goods> Frame semantics has a wide range of applications reaching
steal <thief target goods> from subields of linguistic theorizing such as morphology,
However, Goldbergs main concern is with constructions, for to typolology, linguistic discourse analysis, and
which she uses frame semantics in order to provide highly struc- language acquisition. However, the central and most success-
tured rich meanings for them. ful application seems to be (computational) lexicography. In a
frame-based lexicon, the frame accounts for related senses of a
Construction Grammar: A Closely Related Framework single word and its semantic relations to other words. A frame-
What are constructions? Here is Goldbergs deinition: A con- based lexicon, therefore, ofers more comprehensive informa-
struction is deined to be a pairing of form with meaning/use tion than the traditional lexicon. An example is Petruck (1986), a
such that some aspects of the form or some aspect of the mean- study of the vocabulary of the body frame in modern Hebrew. An
ing/use is not strictly predictable from the component parts or example of computational lexicography is the FrameNet-System
from other constructions already established to exist in the lan- (see Boas 2002).
guage (1995, 68).
here is no doubt that constructions exist. Morphemes, for
Formalization
Although frame semantics does not lend itself easily to formaliza-
instance, satisfy Goldbergs deinition. But do constructions dif-
tion there is an early approach by J. M. Gawron (1983) in which
ferent from morphemes exist? his is, of course, what defendants
basic insights of frame semantics were formalized by notations
of construction grammar try to show. Here, we take the existence
like those of the LISP programming language, in combination
of constructions other than morphemes simply for granted.
with situation semantics. A more recent approach is presented in
Consider the following examples:
van Lambalgen and Hamm (2005) in which scenarios a concept
(5) Margaret baked Peter some cookies. closely related to the frame concept are formalized as certain
kinds of logic programs. An explicit formalization of the combi-
(6) Martin sneezed the napkin of the nightstand.
nation of frame semantics and construction grammar based on
he peculiarity of example (5) is due to the fact that the verb this work can be found in Andrade-Lotero (2006).
bake, which normally has two arguments, is used with three
Fritz Hamm
arguments here. Peculiar as this sentence is, we nevertheless can
make sense of it. Margaret baked some cookies with the inten-
tion to give them to Peter. Note that this interpretation helps us
to make sense of the recipient role, which is not provided by the Aitchison, J. 1994. Words in the Mind. Malden, MA: Blackwell.
verb bake; that is, we think of this sentence as an instance of the Andrade-Lotero, E. 2006. Meaning and Form in the Event Calculus.
ditransitive construction of which a more standard example is M.A. thesis, University of Amsterdam, MOL-200601.
Atkins, B. T. S. 1995. he role of the example in a frame semantics dic-
(7) John gave Mary a present. tionary. In Essays in Semantics and Pragmatics, ed. M. Shibatani and
S. hompson, 2542. Amsterdam: John Benjamins.
he crucial claim of construction grammar is that this is not
Boas, H. 2002. Bilingual FrameNet dictionaries for machine transla-
due to diferent basic meanings of the verb bake but due to the
tion. In Proceedings of the hird International Conference on Language
integration of this verb plus its meaning into the ditransitive con- Resources and Evaluation. Vol. 4. Ed. M. G. Rodriguez and C.P.S.
struction, which has a meaning of its own. herefore, construc- Araujo, 136471. Las Palmas de Gran Canaria, Spain: University of Las
tion grammar distinguishes the semantics of argument structure Palmas de Gran Canaria.
constructions from the semantics of the verbs that instantiate Fillmore, C. 1977a. he case for case reopened. In Syntax and Semantics 8:
them. An advantage of this approach is that it accounts for novel Grammatical Relations, ed. P. Cole, 5981. New York: Academic Press.
321
Framing Effects
. 1977b. Scenes-and-frames semantics. In Linguistic Structure beef described as 75% lean was given higher ratings than beef
Processing, ed. A. Zambolli, 5582. Amsterdam: North Holland. described as 25% fat (Levin and Gaeth 1988); similarly, research
. 1977c. Topics in lexical semantics. In Current Issues in and development (R&D) teams are allocated more funds when
Linguisticheory, ed. R. W. Cole, 76138. Bloomington: Indiana their performance rates are framed in terms of successes rather
University Press.
than failures (Duchon, Dunegan, and Barton 1989). he valence-
Fillmore, C., and C. Baker 2000. Frame Net. Available online at: http://
consistent shift in attribute framing is a robust efect, observed in
www.icsi.berkeley.edu/framenet.
Gawron, J. M. 1983. Lexical representation and the semantics of comple-
a large range of experimental environments, with obvious impli-
mentation. Ph.D. diss., University of California, Berkeley. cations for marketing and persuasion.
Goldberg, A. 1995. Constructions: A Construction Grammar Approach to In risky choice framing, subjects are presented with two
Argument Structure. Chicago: University of Chicago Press. options in a forced-choice task. he two options are typically
Hamm, F., H. Kamp, and M. van Lambalgen. 2006. here is no opposi- gambles that can be described in terms of proportions and prob-
tion between formal and cognitive semantics. heoretical Linguistics abilities of gains or losses. Usually, one of these options is a sure
32: 140. thing (in which an intermediate outcome is speciied as certain),
Langacker, R. 1987. Foundations of Cognitive Grammar. Vol. 1. heoretical while the other is a risky gamble (in which extreme good and bad
Prerequisites. Stanford, CA; Stanford University Press. values are both assigned non-zero probabilities). he gamble and
Minsky, M. 1975. A framework for representing knowledge. In he
sure thing are both described either in terms of gain outcomes
Psychology of Computer Vision, ed. P. H. Winston, 21177. New York:
and probabilities or in terms of equivalent loss outcomes and
McGraw-Hill.
Petruck, M. 1986. Body part terminology in Hebrew. Ph.D. diss.,
probabilities. he two options are usually equated in expected
University of California, Berkeley. value (i.e., the mean outcome expected over many repeated tri-
van Lambalgen, M., and F. Hamm. 2005. he Proper Treatment of Events. als), enabling the framing researcher to interpret observed pat-
Malden, MA: Blackwell. terns of preference in terms of subjects risk attitudes. Within this
rubric, preferences for the sure thing indicate risk aversion, and
For more information, the interested reader is advised to consult the preferences for the gamble indicate risk seeking. he best-known
Frame Semantics Bibliography drawn up by Jean Mark Gawron. risky choice framing problem is the so-called Asian Disease
Problem (Tversky and Kahneman 1981). In it, subjects irst read
the following background blurb:
FRAMING EFFECTS
Imagine that the U.S. is preparing for the outbreak of an unusual
A framing efect is usually said to occur when equivalent descrip-
Asian disease, which is expected to kill 600 people. Two possible
tions of a decision problem lead to systematically diferent
programs to combat the disease have been proposed. Assume
decisions. Framing has been a major topic of research in the psy-
that the exact scientiic estimates of the consequences of these
chology of judgment and decision making and is widely viewed
programs are as follows:
as carrying signiicant implications for the rationality debate
(e.g., Shair and LeBoeuf 2002). Framing efects are commonly Some subjects are then presented with options A and B:
taken as evidence for incoherence in human decision making
and for the empirical inapplicability of the rational actor models (A) If program A is adopted, 200 people will be saved.
used by economists and other social scientists. he irst part of (B) If program B is adopted, there is a one-third probability
this entry presents a brief review of the empirical phenomena; that 600 people will be saved and a two-thirds probability that
the second part describes the standard normative interpretation no people will be saved.
of these empirical efects. Although the literature has not typi-
cally focused on the structure of human conversational environ- Other subjects are presented with options C and D:
ments, framing efects involve utterances selected by a speaker (C) If program C is adopted, 400 people will die.
for a listener. A inal section considers the possible implications
(D) If program D is adopted, there is a one-third probabil-
of communicative factors for a normative and descriptive under-
ity that nobody will die and a two-thirds probability that 600
standing of framing efects.
people will die.
Empirical Review he robust experimental inding is that subjects tend to prefer

In this section, we follow I. P. Levin, S. L. Schneider, and G. J. Gaeths the sure thing when given options A and B but tend to prefer the
(1998) taxonomy of framing efects into three categories: attribute gamble when given options C and D. Note, however, that options
framing, risky choice framing, and goal framing. A and C are equivalent, as are options B and D. Subjects thus
In attribute framing, a single attribute of a single object is appear to be risk averse for gains and risk seeking for losses, a
described in terms of either a positively valenced proportion central tenet of prospect theory (Kahneman and Tversky 1979).
or an equivalent negatively valenced proportion. he subject In prospect theory, it is the decision makers private framing of
is then required to provide some evaluation of the object thus the problem in terms of gains or losses that determines his or
described. he typical inding is a valence-consistent shift (Levin, her evaluation of the options; the framing manipulation is thus
Schneider, and Gaeth 1998): Objects described in terms of a viewed as a public tool for inluencing this private frame.
positively valenced proportion are generally evaluated more In goal framing, subjects are urged to engage in some activ-
favorably than objects described in terms of the correspond- ity (e.g., wearing seatbelts). his plea involves a description
ing negatively valenced proportion. For example, in one study, of either the advantages of participating in the activity or the
322
Framing Effects
corresponding disadvantages of not participating. he most described in the previous section, applies to listener efects with-
common result is that subjects are more likely to engage in the out any consideration of associated speaker phenomena (i.e.,
activity when the disadvantages of not engaging, rather than the regularities in how speakers choose frames in typical linguistic
advantages of engaging, are emphasized (Levin, Schneider, and environments). Researchers have tended to interpret the experi-
Gaeth 1998). mental efects as if the experimenter had somehow surgically
implanted a framing of the decision problem into the subjects
Normative Analysis brain. However, because linguistic utterances are employed,
Risky-choice framing efects have been put forward as positive regularities in speaker behavior may be relevant to the norma-
evidence for prospect theory (Kahneman and Tversky 1979), tive and descriptive understanding of listener behavior: If speak-
a theory of choice that aims to be both formally tractable and ers tend to choose diferent frames as a function of background
cognitively realistic. However, the focus in the framing litera- conditions, then listeners may reasonably draw inferences from
ture has been largely on the negative evidence that framing the speakers choice of frame. If knowledge of these background
efects allegedly raise against classical expected utility theory conditions is relevant to the listeners choice, then the frames,
and other so-called rational actor models. he literature on while logically equivalent, would not be information equivalent.
attribute framing, in particular, is concerned almost exclusively S. Sher and C. R. M. McKenzie (2006; cf. McKenzie and Nelson
with the normative and practical implications of the empiri- 2003) argued that the frames studied in the attribute framing
cal efects. Framing efects, D. Kahneman has noted, are literature are commonly information nonequivalent, because
less signiicant for their contribution to psychology than for speakers tend to frame options in terms of attributes that are
their importance in the real world and for the challenge they relatively salient. For example, a generally impressive R&D team
raise to the foundations of a rational model of decision mak- is more likely to be described in terms of its success rate than a
ing (2000, xv). his raises the important questions: Are fram- generally incompetent team with the same success/failure rate.
ing efects always counternormative? And if so, what norm or A positive frame thus highlights the salience of the positive attri-
norms do they violate? bute in the speakers conception of the option information rel-
In an important paper, A. Tversky and Kahneman argued that evant to its evaluation.
framing efects violate a bedrock normative condition of descrip- Experiments convey information to subjects in framed state-
tion invariance [a]n essential condition for a theory of choice ments, and researchers have generally assumed that the only
that claims normative status so basic that it is tacitly assumed information content is logical information content. he framing
in the characterization of options rather than explicitly stated as of the logical content is assumed not to convey information but
a testable axiom (1986, S253). Any theory of rational choice, they simply to inluence the listeners construal of the logical content.
argued, must stipulate that the same problem will be evaluated In this way, the usual normative analysis of framing experiments
in the same way, regardless of how the problem is described leans on an implicit assumption of the information equivalence
thus, equivalent descriptions should lead to identical decisions. of logically equivalent frames. However, while the logical equiva-
Expected utility theory, for example, satisies this principle: it lence of a pair of frames can usually be determined on inspection
evaluates choice options strictly as a function of probability and (though see Jou, Shanteau, and Harris 1996), a determination of
outcome, with no speciication of probability-outcome fram- information equivalence requires empirical study of the human
ing. his reducibility of decision problems to a canonical form communicative environments in which speakers typically frame
is clearly a theoretical convenience; the principle of description objects and options. At least in the domain of attribute framing,
invariance states that it is also a normative requirement. Because logical equivalence does not imply information equivalence.
the framing phenomena observed both in the laboratory and in Whether the study of communicative environments will have
real-world situations violate the description invariance prin- similar implications for traditional normative conclusions drawn
ciple, these efects are taken to imply that no theory of choice in risky choice and goal framing is an open question. here also
can be both normatively adequate and descriptively accurate remain important questions about how, and how lexibly, listen-
(ibid., S251). ers use subtle information that is, in principle, available in par-
ticular framing experiments. However, an analysis of speaker
Framing, Communication, and Rational Norms regularities in human communicative environments is likely to
Although framing efects are investigated mainly in relation to be of some signiicance in any research area in which informa-
normative choice models, such efects are clearly bound up with tion presented to experimental subjects is evaluated against a
human language, and closely related phenomena have been normative standard of information equivalence (cf. Hilton 1995;
investigated by language scholars. For example, markedness McKenzie 2004; Sher and McKenzie 2008; Schwarz 1996).
theorists have documented subtly diferent information con-
veyed by opposing polar adjectives. he school of cognitive Shlomi Sher and Craig R. M. McKenzie
linguistics has drawn on a more general notion of frame in
its treatment of fundamental issues in semantics (see frame WORKS CITED AND SUGGESTIONS FOR FURTHER READING
semantics). Framing, in the broad sense, enters crucially into Duchon, D., K. J. Dunegan, and S. L. Barton. 1989. Framing the problem
many processes of communication and can only be fully and making decisions: he facts are not enough. IEEE Transactions on
understood in the context of those processes. Engineering Management (February): 257.
Experimental framing efects involve utterances selected by Hilton, D. J. 1995. he social context of reasoning: Conversational infer-
speakers for listeners, but the standard normative analysis, ence and rational judgment. Psychological Bulletin 118: 24871.
323
Frontal Lobe
Jou, J., J. Shanteau, and R. J. Harris. 1996. An information processing reorganization. he primary role of the frontal lobes has devel-
view of framing efects: he role of causal schemas in decision mak- oped from behavior regulation through language, giving rise to
ing. Memory and Cognition 24: 115. abstract thinking and the ability to plan and execute complicated
Kahneman, D. 2000. Preface. In Choices, Values, and Frames, ed. actions. It may very well be that the development of language
D. Kahneman and A. Tversky, ixxvii. Cambridge: Cambridge
and the frontal lobes propelled humans from a hunting and
University Press.
predatory species to a social species, capable of modifying and
Kahneman, D., and A. Tversky. 1979. Prospect theory: An analysis of
decision under risk. Econometrica 47: 26391.
manipulating their environments.
Levin, I. P., and G. J. Gaeth. 1988. How consumers are afected by the
framing of attribute information before and after consuming the prod- Neuroanatomical and Neurophysiological Overview
uct. Journal of Consumer Research 15: 3748.
In humans, the frontal lobes are the largest lobes of the brain
Levin, I. P., S. L. Schneider, and G. J. Gaeth. 1998. All frames are not
and comprise about one-third of the cerebral cortex. As seen in
created equal: A typology and critical analysis of framing efects.
Figure 1, the frontal lobes lie anterior to the central (Rolandic)
Organizational Behavior and Human Decision Processes 76: 14988. A
thorough taxonomy, review, and analysis of attribute, risky choice, and sulcus and superior to the lateral (Sylvian) issure. Anatomically,
goal framing efects. there are several important landmark sulci and gyri in the frontal
McKenzie, C. R. M. 2004. Framing efects in inference tasks and lobe. he precentral sulcus lies anterior and parallel to the cen-
why they are normatively defensible. Memory and Cognition 32: tral sulcus and comprises a vertically placed precentral gyrus.
87485. he superior and inferior frontal sulci, extending from the pre-
McKenzie, C. R. M., and J. D. Nelson. 2003. What a speakers choice of central sulcus, separate the lateral surface of the frontal lobe into
frame reveals: Reference points, frame selection, and framing efects. three horizontally parallel gyri: the superior, middle, and infe-
Psychonomic Bulletin and Review 10: 596602. rior frontal gyrus. he inferior frontal gyrus is divided into three
Schwarz, N. 1996. Cognition and Communication: Judgmental
parts: he pars orbitalis, the pars triangularis, and pars opercu-
Biases, Research Methods, and the Logic of Conversation. Mahwah,
laris constitute brocas area, an important region for speech
NJ: Erlbaum. A summary of empirical evidence indicating that many
and language.
supposed shortcomings of human judgment are due to experimen-
tal subjects going beyond the literal meaning of the information Functionally, the frontal lobe can be divided into motor areas
provided by the researcher and drawing on the pragmatic meaning. and prefrontal cortex. he motor areas include the primary
Shair, E., and R. A. LeBoeuf. 2002. Rationality. Annual Review of motor cortex, the premotor cortex, the supplementary motor
Psychology 53: 491517. cortex, frontal eye ield, and Brocas area. he primary motor
Sher, S., and C. R. M. McKenzie. 2006. Information leakage from logi- cortex (Brodmanns area, BA 4), also called the motor cortex, is
cally equivalent frames. Cognition 101: 46794. located anterior to the central sulcus and the adjacent portion of
. 2008. Framing efects and rationality. In he Probabilistic the precentral gyrus. he primary motor cortex executes skilled,
Mind: Prospects for Bayesian Cognitive Science, ed. N. Chater and M. voluntary movement on the contralateral (opposite) side of the
Oaksford, 7996. Oxford: Oxford University Press. A summary of prob-
body. hat is, the right hemisphere controls the left side of the
lems of information nonequivalence in the framing literature and of
body and the left hemisphere controls the right side. he entire
the application of information equivalence to other areas of psycho-
human body is represented in the primary motor cortex, and
logical research.
Tversky, A., and D. Kahneman. 1981. he framing of decisions and the these representations are arranged somatotopically, such that
psychology of choice. Science 211: 4538. each area of the body is related to a speciic area on the motor
. 1986. Rational choice and the framing of decisions. Journal of cortex (see Figure 2). his somatotopic mapping, also know as
Business 59: S25178. An inluential discussion of risky choice framing the homunculus (in Latin, little man), is mapped according to
efects and their implications for models of rational choice. the amount of brain matter devoted to each particular body part,
rather than by the proportional body-part size. hus, there is a
relatively larger portion of the motor cortex dedicated to laryn-
FRONTAL LOBE
geal, tongue, and inger areas that involve ine and complex
he evolution of the frontal lobes in humans is strongly associ- movements, and a relatively smaller portion of the motor cortex
ated with the advent of language, the primary behavior distin- dedicated to the trunk and limbs that have relatively gross and
guishing humans from other primate species. Although not simpler movements.
much is known about how the frontal lobes evolved, we do know he premotor cortex lies anterior and parallel to the primary
that there was a marked expansion 3.3 to 2.5 million years ago motor cortex. he premotor cortex stores motor schemata (ele-
in Australopithecus Africanus, a prehistoric form of human. his mentary motor acts) and selects movements to be executed for
frontal lobe development, coupled with growth of the temporal externally cued movements. Together with the prefrontal cor-
lobes also associated with language function, coincides with the tex, the premotor cortex prepares, initiates, selects, and learns
creation and use of tools. Tool use may have given rise to the movement patterns. For language, thus, the premotor cortex
development of hand preference and eventual lateralization of determines how humans are going to move their articulators
language functions. Verbal communication likely evolved from to generate speech. here is also a supplementary motor area
hand signals to vocalizations, which progressively came to sig- located in the mesial prolongation of the premotor area. his
nify common referents (i.e., words) used in spoken language. supplementary area helps to execute spontaneous or self-initi-
hus, the genesis of a linguistic symbolic communication sys- ated movements, and damage to this area causes reduced spon-
tem inluenced, and in turn was inluenced by, brain growth and taneous speech and apraxia of speech.
324
Frontal Lobe
Figure 1. Frontal Lobe (left hemisphere).

Source: Adapted from Gray, Henry. Anatomy of the
Human Body. Edited by Warren H. Lewis. Philadelphia: Lea
& Febiger, 1918. New York: Bartleby.com, 2000.
Figure 2. The motor homunculus.

Source: Penield, W, and Rasmussen T. The Cerebral Cortex of
Man. New York: Macmillan, 1950.
325
Frontal Lobe
Brocas area consists of the foot of the third frontal convolu- Neurological Underpinnings of Our Knowledge
tion (the inferior frontal gyrus), just in front of the motor strip. of Frontal Lobe Responsibilities
Traditionally thought to be responsible for the production of lan- Lesions of the frontal lobe have resulted in a number of speech
guage, it has also been implicated in comprehension of syntax. and language deicits, as well as motor, memory, behavioral,
As with any of the perisylvian regions involved in language, it emotional and cognitive problems, that afect speech and lan-
appears to contribute to lexical access as well. guage functioning. he main speech and language disorders
he second major portion of the frontal lobe, in addition to include dysarthria, apraxia of speech, mutism, and aphasia.
the motor areas just discussed, is the prefrontal cortex, located Damage to the motor areas in the frontal lobe causes weakness
in the foremost part of the frontal lobe, in front of the motor or paralysis of the face and articulators, such as the larynx, phar-
area. he ratio of the prefrontal cortex to the rest of the cortex is ynx, palate, tongue, lips, and jaw. Dysarthria is a speech pro-
larger in humans than in any other species. he prefrontal cortex duction disorder due to impaired control of these articulatory
is divided into three major regions: dorsolateral, orbitofrontal, systems, and the patients may show abnormal acoustic and
and medial prefrontal cortex. Although there is some contro- phonetic patterns, hypophonic speech (decreased vocal vol-
versy, currently many researchers associate diferent behavioral ume), deviant intonation, and prosody disturbances in their
changes and cognitive disturbances with each of these three speech. Apraxia of speech, a disorder of motor planning and
prefrontal regions. Dorsolateral prefrontal cortex is suggested to programming of speech without muscle weakness or paralysis, is
have a role in working memory, attention, planning, and prob- characterized by an inability to execute appropriate movement
lem solving. Lesions in this area cause an inability to plan cor- for articulation of speech. Apraxic patients phonological
rectly and diiculty with multistep tasks. he orbitofrontal cortex errors are inconsistent, and articulatory accuracy is better
is associated with emotion and the initiation of behavior. Lesions for automatic speech than volitional speech. Also, patients with
of this area lead to disinhibition, socially inappropriate behav- frontal lobe damage may lack voluntary initiation of conversa-
ior, and change of afect. Finally, the medial/cingulate prefron- tion and evidence marked reduction in language production.
tal cortex is related to motivation and drive. Lesions in this area In some cases, frontal lobe damage results in mutism, that is,
result in apathy, loss of interest in ones life, and reduced sponta- preserved comprehension but no verbal or gestural intent to
neous speech and movement. communicate with others. Patients with mutism show a lack of
he prefrontal cortex is extensively connected to the motor, frustration about their breakdown of expression, unlike patients
perceptual, and limbic systems of the brain. It sends and receives with Brocas aphasia.
great amounts of information and controls cognitive processes Brocas aphasia is traditionally associated with a lesion to
so that appropriate behaviors are carried out at the correct time Brocas area in the left frontal lobe, and characterized by rela-
and place. It takes part in higher aspects of motor control and tively intact comprehension but efortful, short sentence produc-
various cognitive functions, such as inhibition, problem solv- tion with morphosyntactic errors and few functors: his form of
ing, abstract thinking, emotional control, and appropriate social speech production is often called telegraphic speech. However,
behavior. Neuropsychologists consider the prefrontal cortex contemporary brain imaging studies suggest that more extensive
particularly responsible for executive function, by which they left frontal lobe lesions are required to cause persistent Brocas
mean the ability to plan, monitor, and make inferences, as well aphasia. Also, transcortical motor aphasia, with good repeti-
as related self-governing behaviors. he old notion of a com- tion and comprehension but impaired speech abilities, can also
puter with a central processor that determines how the rest of result from damage of the frontal lobe.
the systems computational abilities are to be used is sometimes Patients with frontal lobe damage in whom Brocas area is
employed when the role that executive function plays vis--vis spared generally do not produce grammatical errors. hey do,
other cognitive abilities is discussed. When distinction is made however, usually show a deicit in verbal luency. Generally, ver-
between automatic and controlled processes, it is the controlled bal luency is examined by so-called phonemic (e.g., list words
processes that the prefrontal cortex controls. Cognitive lexibility beginning with the letter s) or semantic category naming tasks
is also considered a role played by the frontal lobes. working (e.g., list animals). Also, perseveration (uncontrollable repetition
memory the ability to keep a limited amount of information of a particular word or phrase) is a frequently observed symptom
available for analysis is either assumed to be a part of executive in patients with frontal lobe lesions.
function or to be closely linked to it. Although reading and writing disorders (alexia and agraphia,
Language behaviors that involve executive function are the less respectively) are more often associated with posterior brain
automatic ones, such as appreciating humor, making inferences lesions, frontal lobe lesions can also manifest them. Frontal lobe
from nonexplicit phrasing, appreciating the pragmatic aspects of alexia, a reading disorder, involves diiculty in decoding words
communication, monitoring ones speech for speech errors and by the grapheme-to-phoneme method. Frontal lobe agraphia, a
correcting them, monitoring ones interlocutors to rephrase or writing disorder, results in spelling errors, including inappropri-
shift register if they appear not to be comprehending, selecting ate letter repetition and erroneous word selection.
the right language or mix of languages to speak in bilingual and Deicits of behavioral and emotional problems due to fron-
polyglot situations, avoiding culturally taboo words or topics that tal lobe damage may afect language and communication func-
would be inappropriate in a given context, and reanalyzing input tion as well. he close association of personality changes and
when garden-pathed or on other occasions when it appears not prefrontal cortex damage is best evidenced by the famous case
to make sense. One language task that appears to put substantial of Phineas Gage. Gage was a railroad construction worker who
burden on working memory is simultaneous interpretation. sufered a penetrating wound to his frontal cortex. Although he
326
Functional Linguistics
survived the serious accident, he was no longer himself. Before assumptions for a variety of empirical reasons (see papers in
the accident, Gage had been known as a hard-working, capable, Elman et al. 1996; MacWhinney 1999, and references therein).
and sociable man, but his personality changed radically after the Skepticism toward these tenets of formal linguistics has con-
accident. He uttered profanities, becoming impatient, obstinate, sequences for the kinds of data and research methodologies
itful, and antisocial. Indeed, in Gages case and similar ones, employed in functional approaches, many of which eschew
diiculty in socializing follows frontal lobe brain damage. hus, introspective grammaticality judgments and focus instead
while no overt language problems regarding the form of lan- on more inductive methodologies, such as evidence from cor-
guage, such as grammar, or regarding language content, such as pora of naturally occurring language, data gleaned from ield
semantic meaning, were reported in these patients, their use of elicitation, experimentation, statistical correlations, and ethno-
pragmatic language skills was impaired. graphic observation (see corpus linguistics).
In sum, the frontal lobes play a crucial role in various speech Widely known functional theories of grammar include sys-
and language processing tasks, including initiation of articula- temic functional linguistics (Halliday 1994), the theory of func-
tory movement, phonetic/phonological selection, morphosyn- tional grammar (Dik 1997), and the functional approach to
tactic production, syntactic manipulation, and integration of syntax explicated by Talmy Givn (2001), as well as role and
pragmatic function. Depending on the location and depth of the reference grammar. For the purposes of this entry, a func-
frontal lobe lesion, a mixture of cognitive and behavioral symp- tionally oriented approach is any approach that holds that
toms can appear with the disorders of these speech and language structural aspects of language (phonology, morphology,
functions as well. syntax, discourse organization [see discourse analysis
(linguistic)], etc.) are motivated and constrained by func-
JungMoon Hyun, Elizabeth Ijalba, Teresa M. Signorelli,
tional concerns. hese can be broadly divided into at least four
Peggy S. Conner, and Loraine K. Obler
overlapping, closely allied factors: 1) the role of communica-
tion and discourse, 2) the centrality of meaning, 3) human
cognitive, neurological, and physiological capacities, and 4) the
Anderson, Steven W., Hanna Damasio, Daniel Tranel, and Antonio social and interactional nature of human beings (see papers in
R. Damasio. 2000. Long-term sequelae of prefrontal cortex dam- Tomasello 1998 and 2002 for a thorough overview.) To arrive at a
age acquired in early childhood. Developmental Neuropsychology full understanding of the functional constraints and motivations
18: 28196.
that act on the form of language, it is necessary to consider the
Baddeley, Alan. 2003. Working memory and language: An overview.
joint contribution of all of these areas.
Journal of Communication Disorders 36: 189208.
Damasio, Hanna, Daniel Tranel, homas Grabowski, Ralph Adolphs, and
Functional research in the early twentieth century focused
Antonio R. Damasio. 2004. Neural systems behind word and concept primarily on the way in which communicative pressures orga-
retrieval. Cognition 92: 179229. nize sentence structure. For example, Prague School linguists
Miller, Bruce L., and Jefrey L. Cummings. 2006. he Human Frontal noted regular patterns of the placement of the theme (informa-
Lobes: Functions and Disorders. New York: Guilford. tion known by the recipient) and the rheme (information new to
Penield, W., and T. Rasmussen. 1950. he Cerebral Cortex of Man A the recipient). In pragmatically unmarked sentences in English,
Clinical Study of Localization of Function. New York: Macmillan. for example, known information tends to come early in a sen-
tence as its subject, while new information comes later, and
both information statuses tend to co-occur with certain types of
FUNCTIONAL LINGUISTICS
prosody. he key insight of this approach is that the functional
Functional linguistics, broadly deined, includes a wide range of need to organize and diferentiate known information from new
diverse approaches that highlight the interdependence of lan- information leads to prosodic and structural consequences for
guage structure and language function. In this view, structural the ways in which sentences are organized an illustration of
features of languages have evolved and continue to develop as function afecting form. Prague School work directly inluenced
a result of competing cognitive, communicative, ecological, and the approach known as functional sentence perspective (Firbas
social pressures. (See also grammaticalization.) Functional 1992), the functional theory of Andr Martinet (1962), and was a
research treats language neither as a purely formal object nor as a precursor to the cognitively based information low theory fully
closed autonomous system; rather, it considers language form to articulated by Wallace L. Chafe (1994; see also information
be tightly integrated with both the uses and users of language (see structure in discourse.)
also usage-based theory). As work in this area began to develop and mature, research
Within the context of current theories of linguistics, many expanded beyond the boundaries of simple communicative
functional approaches difer sharply from so-called formal theo- pressure to encompass wider and more complex discourse and
ries, such as government and binding or minimalism. While pragmatic contexts. Such work has become widely known as
lines of demarcation between formal and functional approaches discourse-functional linguistics. Current discourse-functional
are, of course, not always clear-cut, formal theories tend to begin work seeks to describe the ways in which particular grammati-
with the premise that grammar is innate, to assume the auton- cal constructions are used in discourse, to propose explanations
omy of syntax, to hold to the modularity of language from for the way those forms may have come into being through lan-
other cognitive faculties, and to impose a distinction between guage use, and to explicate the consequences of this research in
competence and performance. Functionally oriented lin- shaping the linguists understanding of the nature of grammar.
guistics, on the other hand, tends to question the validity of these For example, Sandra A. hompson and Anthony Mulac (1991)
327
Functional Linguistics Games and Language
describe the conditions for the occurrence and nonoccurrence social actions and, conversely, to understand in what ways social
of the complementizer that in a corpus of English conversation activities, such as turn-taking, conversational repair,
(e.g., in sentences like You can tell that its going to rain versus sequential organization, assessments, and preference structure,
I think its going to rain). hey ind the distribution of that to are directly implicated in shaping syntax and prosody. (For an
be probabilistically determined by functional factors, such as overview, several concrete examples, and further references, see
topic management and strength of epistemic commitment. he Ford, Fox, and hompson 2002).
explanation of these indings rests in the observed blurring of
Robert Englebretson
the distinction between main and subordinate clauses in these
contexts, whereby the so-called main clause in constructions
without that are actually best analyzed as epistemic adverbial
phrases (see also hompson 2002). hese indings suggest that Bolinger, Dwight L. 1977. Meaning and Form. New York: Longman.
the traditional binary distinction between main and subordi- Browman, Catherine P., and Louis Goldstein. 1986. Towards an articula-
nate clauses is not categorical, as it emerges out of speciic com- tory phonology. Phonology Yearbook 3: 21952.
Bybee, Joan L. 2001. Phonology and Language Use. New York: Cambridge
municative situations in natural discourse. (See Cumming and
University Press.
Ono 1997 for a thorough explication of the discourse-functional
. 2006. From usage to grammar: he minds response to repeti-
approach, including further examples and references; see also tion. Language 82.4: 71133.
emergent structure.) Chafe, Wallace L. 1994. Discourse, Consciousness, and Time: he Flow
In addition to communicative and discourse factors, func- and Displacement of Conscious Experience in Speaking and Writing.
tional research of the last several decades has also focused on the Chicago: University of Chicago Press.
role of semantics in shaping grammar (Bolinger 1977 inter alia; Cumming, Susanna, and Tsuyoshi Ono. 1997. Discourse and grammar.
Halliday 1994). In these views, meaning is central in motivating In Discourse Studies: A Multidisciplinary Introduction, Vol. 1. Ed. Teun
form, and, therefore, particular grammatical constructions have A. van Dijk, 11237. London: Sage.
the forms they do because of the particular meanings that they Dik, Simon C. 1997. he heory of Functional Grammar. 2 vols. 2d rev. ed.
convey. Two noteworthy paradigms emerging from this tradi- Ed. Kees Hengeveld. New York: Mouton de Gruyter.
Elman, Jefrey, Elizabeth Bates, Mark Johnson, Annette Karmilof-
tion include cognitive linguistics, which views language
Smith, Domenico Parisi, and Kim Plunkett, eds. 1996. Rethinking
form as motivated by meaning and conceptualization, and the
Innateness: A Connectionist Perspective on Development. Cambridge,
cross-linguistic approach known as typological-functional lin- MA: MIT Press.
guistics, which explores the ways in which similar meanings are Firbas, Jan. 1992. Functional Sentence Perspective in Written and Spoken
expressed in the grammars of languages around the world (see Communication. New York: Cambridge University Press.
cognitive grammar and typology.) Ford, Cecelia E., Barbara A. Fox, and Sandra A. hompson. 2002. Social
Other strands of functional research investigate the ways in interaction and grammar. In he New Psychology of Language. Vol. 2.
which neurological, cognitive, and physiological faculties of the Ed. Michael Tomasello, 11943. Mahwah, NJ: Lawrence Erlbaum.
human user motivate and constrain grammar and phonology. Givn, Talmy. 2001. Syntax: An Introduction. 2 vols. Rev. ed.
For example, Sidney M. Lamb (1999) outlines a connectionist, Amsterdam: John Benjamins.
relational network model of language based in what is known Halliday, M. A. K. 1994. An Introduction to Functional Grammar. 2d ed.
London: E. Arnold.
about neurons and connections among neurons and the neuro-
Lamb, Sydney M. 1999. Pathways of the Brain: he Neurocognitive Basis of
logical processes of activation and inhibition. Functional theories
Language. Amsterdam: John Benjamins.
of phonology likewise take seriously the conceptual, neurologi- MacWhinney, Brian, ed. 1999. he Emergence of Language. Mahwah,
cal, and physiological aspects of language users, for example, NJ: Lawrence Erlbaum.
articulatory phonology (Browman and Goldstein 1986), which Martinet, Andr. 1962. A Functional View of Language. Oxford: Clarendon
views the abstract phonological system as constrained by the Press.
physical system of articulatory phonetics. More recently, hompson, Sandra A. 2002. Object complements and conversa-
Joan L. Bybee (2001) has proposed an exemplar-based theory tion: Towards a realistic account. Studies in Language 26.1: 12564.
of phonology, which likewise treats phonology as grounded in hompson, Sandra A., and Anthony Mulac. 1991. he discourse condi-
the concrete physiology of phonetics and considers speech as tions for the use of the complementizer that in conversational English.
Journal of Pragmatics 15: 23751.
a neural-motor activity subject to the same processes observed
Tomasello, Michael, ed. 1998, 2002. he New Psychology of
in other complex motor activities such as routinization, simpli-
Language: Cognitive and Functional Approaches to Language Structure.
ication, and entrenchment based on frequency. (See also Bybee
2 vols. Mahwah, NJ: Lawrence Erlbaum.
2006 for an extension of this approach to grammar in general.)
Recent functional research has also begun to concentrate on
ways in which the inherent social and interactional nature of lan-
guage users likewise motivates and constrains language form. One
G
socially oriented approach has become known as interactional
linguistics, incorporating insights from sociology especially the
GAMES AND LANGUAGE
subields of ethnomethodology and conversation analysis.
Interactional linguistics views language as a form of highly sys- Rules and Language Games
tematic social action. Interactional research seeks to describe the One traditional view in philosophy and linguistics is that with-
prosodic and syntactic forms that regularly accomplish speciic out rules of usage common to the speaker and the listener,
328
Games and Language
communication would be impossible. According to it, every Strategic Games and Nash Equilibria
linguistic expression has a meaning, which is determined by One of the most legendary games in classical game theory is the
the rules for its correct use. his obviously brings language and Prisoners dilemma: Two criminals, 1 and 2, are interrogated in
games together, for it is in the latter that rules are explicitly given. separate cells. If they both confess, each of them will be punished
Here are two examples of language-games, which illustrate to stay three years in prison. If only one of them confesses, he will
in a very simple and ideal way how a communication language be free while the other will be punished with four years in prison.
could emerge out of language games. hey are from the Finnish If neither of them confesses, each will stay one year in prison. he
logician Erik Stenius, who thought that they are typical examples following igure depicts the choices and payofs of the players of
of the view of language advocated by Ludwig Wittgenstein in his the Prisoners dilemma:
later period.
he Garden Game is played by a gardener A and his assistant D C
B. here are pieces in the game, the letters a, b, c, P, and Q and D (1,1) (4,0)
a lower bed divided into squares as in the following igure. In C (0,4) (3,3)
every square there is a plant.
D stands for dont confess and C stands for confess.
As we see, a complete description of a strategic game with
3rd day two players requires a list of the players action repertoires A1
2d day and A2, and a speciication of their utility functions u1 and u2.
u1 speciies, for each possible sequence of choices (a,b) (action
c B a 1st day
proile) player 1s payof. In our example, we have: u1(D,D) = 1,
u1(D,C) = 4, u2(D,C) = 0, and so on.
he game amounts to this: Every day B writes on a piece of
Many of the games studied in classical game theory are strictly
paper the letters a, b, c, and to the left of any of these letters he
competitive (zero-sum games): he players payofs are strictly
writes either the letter P or the letter Q, according to whether the
opposed; that is, for each action proile (a,b), u1(a,b) + u2(a,b) =
plant in the square corresponding for that day to the lowercase
0. Matching pennies is a typical example of such a game: Each
letter is in lower or not. For instance, if in the rectangle for that
player chooses either Head or Tail. Player 2 pays player 1 one
day the plant next to the path is in lower, whereas the two others
euro if the two choices match. Otherwise, it is player 1 who pays
are not, B will write:
player 2 one euro in Matching pennies:
Pa Qb Qc
he teaching of the game is done by simple gestures of H T

approval and disapproval, depending on whether B writes the
H (1,1) (1,1)
correct tokens on the piece of paper.
Once the assistant masters the Garden Game, A and B move T (1,1) (1,1)
to play the Report Game. A does not need to accompany B to
the lowerbed any longer. A now takes part in the game only Given a strategic game, we are interested in optimal plays of the
by receiving the written tokens from B. If B really follows the game, that is, in every players action being the best response
rules of the game, A can read of certain facts from what B has to the actions of his opponents. For simplicity, we consider the
written. Prisoners dilemma game, but the points we make are general.
It is obvious that by means of the report game, A and B have Consider an arbitrary action proile (a,b). It is a Nash equilibrium
created a small language for communication: a, b, and c are used in the strategic game if none of the players would have been bet-
to denote certain squares, and so on. hey get a meaning. ter of by making a diferent choice:
Steniuss language games had more of a philosophical purpose, uI(a,b) uI(c,b), for any c in A1
namely, to give concrete examples of Wittgensteinian language-
games. hey inspired the philosopher David Lewis, who formulated uII(a,b) uII(a,d), for any d in A2.
them in a more precise way. In doing so, Lewis thought to respond hus, in the Prisoners dilemma game, (C,C) is a Nash equilib-
to a challenge launched by another philosopher and logician, rium, but there is no Nash equilibrium in the Matching pennies
Quine. W. V. O. Quine regarded with distrust conventional views game. here are games that have two Nash equilibria, like the
of language and doubted that one can give a coherent account of following one where both (L,L) and (R,R) are Nash equilibria:
how communication takes place without already presupposing
some degree of linguistic competence. Lewis formulated signal-
L R
ing games, that is, communication games played by two players,
the sender and the receiver, the former sending messages or sig- L (1,1) (1,1)
nals about the situation he or she is in, and the latter undertaking a R (1,1) (1,1)
certain action after receiving it. he point to be emphasized is that
the messages do not have a prior meaning: Whatever meaning they In this case, the problem is to devise a criterion for selecting
are going to acquire, it will be the result of the interactive situation between them. One such criterion is Pareto optimality:
in the signaling game, or in other terms, they will be optimal solu-
tions in the game. Let us have a closer look at the game-theoretical he action proile (a,b) is Pareto more optimal than (a,b), if a > a
setting. and b > b.
329
Games and Language
Lewiss Signaling Games between what is said and what is conveyed or implied. he former
Lewis modeled signaling games using the game-theoretical is more or less conventional, semantic meaning, while the latter
apparatus just described. he game models an ideal communica- is something the speaker wants the hearer to understand from
tive situation, which involves a sender and a receiver, the former what is said, although not explicitly stated. In a seminal paper,
sending messages that the latter tries to interpret. More precisely, Paul Grice tried to account for such pragmatic inferences by mak-
whenever he or she is in a state t (one of the many from the set ing use of maxims of conversation, like Be relevant, Always say
T of possible states), the sender selects a message or form f from the truth, or Be as informative as possible, and so on. Recently,
a set F that he or she sends to the receiver. he receivers task is some attempts have been made to reduce and explicate these
one of interpretation; that is, whenever a message f is received, maxims in terms of rational principles of communication, which
he or she will associate it with a state in T. he signaling games advise the speaker to say as much as he or she can to fulill com-
are cooperative games; that is, both players try to achieve a communicative goals, and to say no more than he or she must to ful-
mon goal, communication. For this reason, whenever in state t ill those communicative goals. Gricean pragmatics found its way
the sender sends f and the receiver interprets f as t, both players recently into optimality theory, a linguistic theory that basi-
receive an equally rewarding payof. As in Steniuss Report cally compares alternative syntactic inputs to one another and
Game, the receiver does not have full knowledge about the state selects as the optimal meaning the one associated with the syntac-
the sender is in. he situation becomes even more complicated tic form that expresses it in the most eicient way. It should come
when, for instance, the sender may use more than one form in as no surprise that the ranking and judging of representations and
the same state. he optimal situation is the one in which com- meanings in optimality-theoretic interpretation has a structure,
munication is achieved: he receiver associates with each mes- which resembles principles developed in strategic games.
sage f the state t in which the sender was when he or she sent f.
he game-theoretical analysis is meant to show that the optimal Hintikkas Semantical Games
situation can be obtained as one of the solutions (Nash equi- Games are also used to characterize diferent notions of depen-
libria, Pareto optimality) to the game. We may, of course, won- dence in logic and language. In contrast to communication
der how the game can achieve this somehow miraculous result games whose task it is to model how expressions of the language
without the receivers having full knowledge of the situation the acquire an interpretation, in semantical games associated with
sender is in. Well, in a way, the game cannot achieve it: In the natural or formal languages, it is presupposed that expressions
setting just described, the game will yield several solutions (Nash already have an interpretation. What we want, instead, is a way
equilibria); that is, there will be more than one way to pair mes- to characterize the dependence (and independence) of certain
sages with states. If we want to discriminate among them, more expressions of the language on other expressions in terms of the
information is needed. It may come in diferent layers. interaction of the players in a semantical game. Here is a typical
One kind of additional information the players may have is a example from the mathematical vernacular:
prior probability distribution over the states in T: Some of them A function y = f(x) is continuous at x0 if given a number however
are more probable than others. But even so, it is often the case small, we can ind such that |f(x) f(x0)|< , given any x such
that there are several Nash equilibria in the game. Lewis would that |xx0| < .
say that in this case, one of those is chosen, the most salient one.
In game-theoretical terms, we can ind is represented by an
existential player, , and given any is represented by a universal
Anyhow, the existence of several equilibria shows, according to
player, , both choosing individuals from the relevant universe

him, the conventional character of the formsmeanings pairs.
If one does not like this form of conventionalism, then some
more discriminative information is needed. For instance, we may of discourse. hus, the property of the function f being continu-
ous at x0 is characterized by a game in which the universal player
chooses an individual from the universe, after which the exis-
assign costs to messages and assume that it is more rational for
tential player chooses an individual , and inally the universal

the sender to send less expensive messages. It can be shown that
in some cases where these conditions are fulilled, the combina-
tion of Nash equilibria and Pareto optimality leads to a unique player chooses an individual x. he game stops here. Unlike
solution. in the strategic games, which are one-shot games, semantical
games have a sequential element: here is a sequence of choices,
Signaling Games and Gricean Pragmatics with later choices depending on earlier ones, and so on. he cru-
We saw that signaling games are useful for modeling communi- cial notion is not any longer that of Nash equilibrium but that of
cative situations in which the players extract information from winning strategy. In other words, the game-theoretical paradigm
linguistic messages according to some general principles of in this case is that of extensive games.
rational behavior. We mentioned the case of the senders being
forced to consider alternative expressions he or she could have Extensive Games
used, together with their costs, and so on. It was then assumed It is customary to exhibit extensive games as a sequence
that it is more rational for the sender to send less expensive
messages. G = (N,H,P,(ui)iN)
One inds similar features in the so-called Gricean where N is a collection of players, H is a set of histories, P is a
pragmatics (see conversational implicature). In this function attaching to each nonmaximal history the player whose
case, one is not so much concerned with the question of how turn it is to move, and ui is the utility function for player i, that is,
expressions acquire their meanings, but rather with the distinction a function that associates with each maximal history in H a payof
330
Games and Language Gender and Language
for player i. In other words, each maximal history represents a Van Rooy, Robert. 2002. Optimality-theoretic and game-theoretic
play of the game, at the end of which each of the players is given approaches to implicatures. In Stanford Encyclopedia of Philosophy.
a payof. Unlike communication or signaling games, semantical Available online at: http://plato.stanford.edu.
games are strictly competitive 0-sum games: For each maximal
play, one of the players is winning and the other is loosing. GENDER AND LANGUAGE
he crucial notion is that of a strategy for a given player, a
method that gives the player the appropriate choice depending he term gender in this discussion refers to the social condition of
on the elements chosen earlier in the game. Such a strategy is being a woman or a man as distinct from sex, biological female-
codiied by a mathematical function g that takes as arguments ness or maleness. Sex may be relevant in areas of inquiry where bio-
the partial histories (a0,,an-1) in H where it is the players turn logical mechanisms are at issue (e.g., the organization of language
to move, and gives him or her an appropriate choice g(a0,,an-1); in the brain), but in most research, the issue is the social difer-
g is a winning strategy for the player in question if it guarantees a entiation of men and women. Gendered linguistic behavior arises
win in every maximal play in which he or she uses it. not because men and women are innately diferent but because of
the way the diference is made signiicant in the local organization
Semantical Games, Quantiiers, and Anaphora of social life. he forms and precise social signiicance of gender
Our informal description of the game associated with the dei- can vary considerably across cultures and through time.
nition of a continuous function should be suicient to con- Gender in this sense is also distinct from the use of the term
vey the idea that the game in question can be rephrased as an gender to denote a grammatical category. he relationship
extensive game. A maximal play of the game is any sequence between linguistic and social gender across languages has been
(,,x), with and x chosen by the universal player, and by studied extensively (e.g., Hellinger and Bussman 20013), but for
the existential player. As for utilities, if the chosen elements reasons of space, this body of work will not be considered here;
stand in the appropriate relations, that is, if whenever |xx0| < the focus will instead be on research investigating patterns of
we also have |f(x) f(x0)|< , then we declare the play to be a language use linked to the gender of the user.
win for and a loss for . Otherwise, it is a win for and a loss he relationship of gender to language has been of interest to
for . But now any winning strategy of has to be a function g scholars for a variety of reasons. On the one hand, the social fact
whose arguments are all the individuals chosen by earlier in of gender diferentiation (apparently universal in human cul-
the game. In other words, the logical priority of the quantiied tures) inluences processes such as language variation, change,
expression given any over we can ind an is captured by and shift and is, therefore, relevant for linguists understanding
the strategy g of the existential player, which is deined over any of those phenomena. On the other hand, language use is part
element chosen by . And the fact that this strategy is a win- of the process whereby gender is produced and reproduced as
ning one amounts to any of the sequence of elements (,g(),x) a social fact. his makes language of interest to scholars whose
satisfying the appropriate conditions; that is: If |xx0| < g(), main interest is in the social organization of gender relations,
then |f(x) f(x0)|< . language and identity, or inequality.
Another dependency phenomenon modeled by semantical he modern ield of language and gender studies emerged in
games is pronominal anaphora, as in the following sentence: the 1970s when the advent of second-wave feminism prompted
sympathetic researchers in language-related disciplines to under-
1. A woman is sitting on a bench. She smiles.
take a systematic examination of the language used by and about
We witness here a phenomenon of semantical depen- women. Adopting a broadly feminist political standpoint and a
dence: Before an expression (She) gets an interpretation, modern sociolinguistic perspective, these researchers reacted
another expression that is its head (A woman) must get an against the assumptions pervading previous discussions, which
interpretation. he semantical games in this case are completely had stereotyped women language users as simultaneously
analogous to the quantiier game (in fact, the game involves exotic and inferior. Questions about sex diferences in language
quantiiers). he rules of the game will contain not only choices were reframed as questions about social identity, diference, cul-
prompted by quantiied expressions but also choices prompted ture, and power.
by the anaphoric pronoun: She prompts a move by the exis- In the early phase of the ields development (roughly 1975 to
tential player, who must now choose the same individual chosen 1990), researchers worked largely within a framework of inter-
earlier as a possible value of the indeinite A woman. est in identifying and explaining diferences between men and
women. his work continued the earlier tradition of treating
Gabriel Sandu
womens linguistic behavior as marked with respect to mens,
but the questions that researchers asked were diferent and so
were their motivations. Some gender diference studies were
Dekker, Paul, and Robert van Rooy. 2000. Bi-directional optimality the- animated by a desire to establish the (in)validity of sexist folk-
ory: An application of game-theory. Journal of Semantics 17: 21742. linguistic stereotypes like women talk incessantly or women
Grice, Paul. 1975. Logic and conversation, In Syntax and Semantics 3: cant tell jokes. Other scholars were interested in exploring how
Speech Acts, ed. P. Cole and J. L. Morgan. New York: Academic Press.
diferences between mens and womens ways of speaking might
Hintikka, Jaakko, and Gabriel Sandu. 1991. On the Methodology of
arise from the social reality of male dominance (Lakof 1975).
Linguistics. Oxford: Basil Blackwell.
Lewis, David. 1969. Conventions. Cambridge: Harvard University Press.
his dominance current sought to raise consciousness about
Stenius, Erik. 1967. Mood and language-game. Synthese 17: 25474. the fact that in language as elsewhere, women were relegated to
331
Gender and Language
second-class citizenship by the way they were socialized to speak avoided by women and men. Some of the reasons for this difer-
and write, the way they were judged as speakers and writers, and entiation have to do with the inluence of gender norms and ste-
the way they were conventionally represented in speech and reotypes. Girls may be instructed by parents and teachers that the
writing. An alternative, cultural diference current placed more use of features that index modesty or deference is appropriate for
emphasis on the idea that women and men (and, importantly, them, whereas boys may be ridiculed for using those same fea-
girls and boys) grew up in diferent social worlds in which they tures; speakers in each gender group may develop an investment
learned diferent rules for verbal communication (Maltz and in using the features to the extent that they also have an invest-
Borker 1982). ment in being judged as gender appropriate. he approach, how-
By the end of the 1980s, however, many researchers were turn- ever, allows for the possibility that not everyone does develop
ing away from the gender diference paradigm and abandoning such an investment: here have been various studies of groups
what was increasingly seen as an unproductive quest for global whose behavior appears to be shaped by a conscious refusal of
generalizations. he more that empirical indings accumulated, gender appropriateness (e.g., Abe 2004; Bucholtz 1999; Okamoto
the more apparent it became that women and men could not use- 1995.) here are also speakers (e.g., some transgendered or
fully be treated as internally undiferentiated populations. It was transsexual individuals) whose investment in using features that
forcefully argued (notably by Eckert 1990) that since intragroup will index their adopted gender identity is such that they produce
diferences were as signiicant as intergroup ones, and since the extreme gender stereotypes (Kulick 1999).
variable of gender did not exist in isolation but always interacted Other reasons for using or avoiding features that indirectly
with other social variables, such as class, race/ethnicity, and age, index gender, though, have more to do with the demands of the
general statements to the efect that women do X and men do Y activities in which speakers are engaged. Bonnie McElhinny
were unenlightening, if not meaningless. (1995) reports that women police oicers in Pittsburgh adopt a
he traditional focus on binary gender diference began relatively afectless style of interaction regarded by some observ-
to yield to an approach that was more concerned with gender ers as defeminizing, but that they are quite clear that they are not
diversity in other words, with the use of linguistic variability as trying to talk like men; they are trying to talk like police oicers.
a resource for producing a range of gendered styles in diferent hey also believe that the style of talk required is not simply a
communities or contexts. Researchers followed the injunction to contingent norm relecting the historical domination of policing
look locally (Eckert and McConnell-Ginet 1992) at the forms by men but is intrinsically demanded by the nature of the work.
that gender identities and relations take in speciic communities Although some styles do have gendered connotations (and his-
of practice (CoPs), on the grounds that gender-linked patterns tories) that can pose problems for individuals whose gender is
of language use will emerge from the localized social practices stereotypically incongruent with them, it is clear that the way
in which women and men are engaged. his led, among other men and women behave in diferent contexts has as much to do
things, to a wave of empirical research conducted with more with the nature of those contexts as with gender per se.
socially and linguistically diverse groups of subjects, and deal- It is also clear that gender itself is not always and everywhere
ing with masculinity as well as femininity (Johnson and Meinhof indexed in similar ways because there is cross-cultural and his-
1997). torical variation in the social roles allotted to men and women
he shift was also theoretical in nature, as language and gen- and the qualities ascribed to them (linguistic markers of which
der scholars were inluenced by the more general critique of gen- become secondary indices of gender). he Japanese association
der essentialism (belief in masculinity and femininity as ixed and of femininity with delicacy may seem natural to Westerners,
invariant essences). Some adopted the performative account of too, but the association is not made by, for instance, the villag-
gender put forward by Judith Butler (1990) (see also sexuality ers of Gapun in Papua, New Guinea, who characterize womens
and language), while others took up alternative theoretical language, like women themselves, as blunt, direct, and aggres-
approaches exemplifying the shift from essentialism to a more sive (Kulick 1993). Nor can it be automatically assumed that such
radical social constructionism. One inluential theoretical con- associations, ideologically powerful though they may be, deter-
tribution was made by linguistic anthropologist Elinor Ochs mine the actual behavior of most speakers. For instance, recent
(1992), who used the concept of indexicality (see indexicals to research analyzing the speech of working-class and rural women
give an account of the relationship between language and gen- in diferent parts of Japan points to the practical irrelevance for
der that could accommodate empirical observations about its many Japanese women of the idealized normative construct
locally variable and context-embedded nature. Pointing out that womens language (Okamoto and Shibamoto Smith 2004).
few features of languages directly and exclusively index gender, Whereas early language and gender researchers often took
she suggested that masculinity and femininity were most often issue with prefeminist generalizations about mens and womens
indexed indirectly by the use of linguistic features whose primary language use, more recent researchers inluenced by the shifts
meaning related to particular roles or qualities (e.g., mother- just outlined have revisited many of the generalizations made
hood or modesty) but which had come to connote masculin- during the 1970s and 1980s. he classic claims of variationist
ity or femininity by association. sociolinguistics about gender (that women are generally closer
he implication of this line of argument is that we should not than men to prestige norms and lead in change from above
expect direct and unmediated correlations between speakers because of their greater status consciousness, but are otherwise
gender and features of their language use. he correlations are conservative) have been substantially revised (e.g., Labov 2001).
typically indirect, the results of a process whereby features with While the new variationist orthodoxy is still a gender generaliza-
other primary meanings are diferentially appropriated and/or tion (that women tend to lead in both change from above and
332
Gender and Language
change from below), it does not permit stereotypical explana- as men is one way to deal with this marginal social position-
tions in terms of roles or psychological dispositions shared by all ing. If so, it is evident that inequality, rather than just diference,
women but, on the contrary, requires an account to be given of shapes the relationship of language use to gender.
womens nonuniform sociolinguistic behavior. here continues A recent external development to which researchers are now
to be debate on the claim that women are by and large more beginning to respond is the rise of scientiic paradigms such
polite speakers than men (see politeness) by reason of their as evolutionary psychology, which, in addition to being
subordinate social positioning (Lakof 1975; Brown 1980), with generally critical of feminist social constructionism, have made
some researchers suggesting that this generalization still has speciic claims (some of them empirically ill-founded see,
value (e.g., Holmes 1995), while others are more skeptical (e.g., e.g., Hyde 2005) about malefemale linguistic diferences as
Mills 2003). the hardwired products of millennia of natural selection. he
here is also debate on the theoretical assumptions of the recent resurgence of biological essentialism in both academic
new paradigm itself: It can be asked whether the emphasis on and popular culture constitutes a challenge, both intellectual
looking locally, stressing the diversity and variability of gen- and political, that language and gender researchers, in my view,
dered behavior, is resulting in a reluctance to think globally, should not ignore. Yet while in the future there may well be more
which risks throwing the feminist baby out with the essentialist discussion of the relationship between sex and gender, I think
bathwater. For some commentators, caution about treating gen- it is unlikely that researchers will abandon the commitment to
der as an overarching social category or making generalizations (some variant of) the social constructionism that has proved so
about it is problematic, in that it obscures or downplays inequali- productive in recent years.
ties that, although they may not be global in the sense of univer-
Deborah Cameron
sal and exceptionless, are not localized to just one community
of practice. here is also concern that some current approaches
overemphasize the agency of subjects in constructing gendered WORKS CITED AND SUGGESTIONS FOR FURTHER READING
personae while downplaying the structural and institutional fac- Abe, Hideko. 2004. Lesbian bar talk in Shinjuku, Japan. In Okamoto
tors that in reality constrain their performances. his concern is and Shibamoto Smith 2004, 20521.
addressed in recent research dealing with womens use of lan- Baxter, Judith, ed. 2006. Speaking Out: he Female Voice in Public
guage in the workplace and in other public domains a tradi- Contexts. Basingstoke, UK: Palgrave.
tional feminist research topic that is now being revisited from Brown, Penelope. 1980. How and why are women more polite? In
newer theoretical perspectives (e.g., Baxter 2006; Holmes 2006; Women and Language in Literature and Society, ed. Sally McConnell-
Ginet, Ruth Borker, and Nelly Furman, 11149. New York: Praeger.
Walsh 2001).
Bucholtz, Mary. 1999. Why be normal? Language and identity practices
Penelope Eckert (2000) is among those researchers who
in a community of nerd girls. Language and Society 28: 20323.
believe that looking locally can and should be combined with
Butler, Judith. 1990. Gender Trouble: Feminism and the Subversion of
thinking globally about gender. Eckert carried out research in Identity. New York: Routlege.
a suburban high school near Detroit, where identity and social Cameron, Deborah. 2006. On Language and Sexual Politics.
practice were organized around the contrast between jocks London: Routledge
(who embrace mainstream deinitions of school success, e.g., Eckert, Penelope. 1990. he whole woman: Sex and gender diferences
participating actively in both academic and extracurricular pur- in variation. Language Variation and Change 1: 24568.
suits) and burnouts (who reject the schools values and resist . 2000. Gender and sociolinguistic variation. In Language and
active participation in its oicial culture). Ailiation in these Gender: A Reader, ed. Jennifer Coates, 6475. Oxford: Blackwell.
groups was marked linguistically as well as in other ways: Jocks Eckert, Penelope, and Sally McConnell-Ginet. 1992. hink practically
and look locally: Language and gender as community-based practice.
made more use of conservative vowel pronunciations, whereas
Annual Review of Anthropology 12: 46190.
burnouts used more innovative urban variants. In both groups,
. 2003. Language and Gender. Cambridge: Cambridge University
however, it was girls who were more advanced in the use of the Press.
variants that indexed group membership. Eckert suggests that Hellinger, Marlis, and Hadumod Bussman, eds. 20013. Gender Across
they were symbolically claiming status as good jocks or good Languages: he Linguistic Representation of Women and Men. 3 vols.
burnouts and that this relected the difering terms on which Amsterdam: John Benjamins.
the two sexes participated in their CoPs. Males gained status by Holmes, Janet. 1995. Women, Men and Politeness. London: Longman.
displaying ability (e.g., in sports or ighting), but females status . 2006. Gendered Talk at Work. Malden, MA: Blackwell.
was more dependent on appearance and personal style: hey Holmes, Janet, and Miriam Meyerhof, eds. 2003. he Handbook of
were obliged to work harder to assert in-group status by means Language and Gender. Malden, MA: Blackwell.
of symbolic details like the styling of their jeans and the pronun- Hyde, Janet Shibley. 2005. he gender similarities hypothesis. American
Psychologist 60: 58192.
ciation of their vowels.
Johnson, Sally, and Ulrike H. Meinhof, eds. 1997. Language and
Eckert argues that the pressures to which these high school
Masculinity. Oxford: Blackwell.
girls were responding are not conined to adolescent subcul- Kulick, Don. 1993. Speaking as a woman: Structure and gender in domes-
tures. Women, as the subordinate gender, may perceive their static arguments in a Papua New Guinea village. Cultural Anthropology
tus and legitimacy to be in question in all kinds of CoPs; making 8: 51041.
a symbolic display of in-group credentials pointedly present- . 1999. Transgender and language. GLQ 5: 60522.
ing oneself as, say, a real lawyer/athlete/truck driver or using Labov, William. 2001. Principles of Linguistic Change. Vol. 2. Social
resources such as language that are accessible to women as well Factors. Oxford: Blackwell
333
Gender Marking Generative Grammar
Lakof, Robin. 1975. Language and Womans Place. New York: Harper dependent on nouns of diferent classes, regardless of whether
and Row. or not the nouns themselves bear gender markers.
Maltz, Daniel, and Ruth Borker. 1982. A cultural approach to male- Agreement in gender with the head noun can be found in
female misunderstanding. In Language and Social Identity, ed. John J. other words in the noun phrase (adjective, determiner, demon-
Gumperz, 196216. Cambridge: Cambridge University Press.
strative, numeral, etc., even focus particle), in the predicate of the
McElhinny, Bonnie. 1995. Challenging hegemonic masculini-
clause, an adverb, and arguably in an anaphoric pronoun
ties: Female and male police oicers handling domestic violence.
In Gender Articulated, ed. Kira Hall andMary Bucholtz, 21743.
outside the clause boundary. For example, in Polish the femi-
London: Routledge. nine noun skarpeta sock (the controller) requires that many
Mills, Sara. 2003. Gender and Politeness. Cambridge: Cambridge other elements (targets) in the clause agree with it in gender: ta
University Press. jedna stara porwana skarpeta, ktra leaa na pododze this.f
Ochs, Elinor. 1992. Indexing gender. In Rethinking Context: Language one.f old.f torn.f sock(f) which.f lay.f on loor. Markers of
as an Interactive Phenomenon, ed. Alessandro Duranti and Charles gender often do not mark gender alone but may be portmanteau
Goodwin, 33558. Cambridge: Cambridge University Press. markers that combine information about gender with number,
Okamoto, Shigeko. 1995. Tasteless Japanese: Less feminine speech person, case, or other features.
among young Japanese women. In Gender Articulated, ed. Kira Hall If antecedentanaphor relations are accepted as agreement,
and Mary Bucholtz, 297325. London: Routledge.
languages in which free pronouns present the only evidence
Okamoto, Shigeko, and Janet Shibamoto Smith, eds. 2004. Japanese
for gender (gender distinctions being absent from noun phrase
Language, Gender and Ideology. New York: Oxford University
Press.
modiiers and from predicates) can be counted as having a
Walsh, Clare. 2001. Gender and Discourse: Language and Power in (pronominal) gender system. Such languages are rare the best
Politics, the Church and Organizations. London: Longman. known example is English, which is typologically unusual (see
typology) in this respect (Corbett 2005); another is Defaka
(Niger-Congo).
GENDER MARKING
Anna Kibort
Almost all languages have some grammatical means of divid-
ing up their noun lexicon into distinct classes, with devices
or markers occurring in surface structures under speciiable
conditions and providing information about the semantic Corbett, Greville G. 1991. Gender. Cambridge: Cambridge University
characteristics of the referent of the nominal head of the noun Press.
phrase. Gender marking is one such device, typically found . 2005. Number of genders. In he World Atlas of Language
in languages with a fusional or agglutinating proile; other Structures, ed. Martin Haspelmath, Matthew S. Dryer, David Gil, and
Bernard Comrie, 1269. Oxford: Oxford University Press.
devices are frequently grouped under the term classiiers and
Hockett, Charles F. 1958. A Course in Modern Linguistics. New York:
are typically found in isolating languages. he term gender is
Macmillan.
used both for the particular classes of nouns (a language may
have two or more genders, or noun classes) and for the whole
grammatical feature (a language may or may not have the fea- GENERATIVE GRAMMAR
ture of gender). he approach to linguistics known as generative grammar
here is always some semantic basis to gender classiication, (GG) was initially introduced by Noam Chomsky in the 1950s,
though it may be supplemented with additional formal (pho- and though it has developed continuously ever since, the core
nological and morphological) criteria. he semantic assumptions have remained remarkably constant. For instance,
criteria include humanness, animacy, sex, shape, form, con- at the highest level, GG has always maintained that a grammar
sistency, and functional properties. A minimal gender system efectively constitutes a set of formal rules that recursively enu-
consists of two genders (e.g., French), and this is the most com- merate all, and only all, the well-formed (i.e.,grammatical)
mon system; it was found in 50 languages of a sample of 256 sentences of a language (see recusion, iteration, and
(Corbett 2005). hree-gender systems (e.g., Russian) appear to metarepresentation ). As part of the process of construct-
be roughly half as common, and larger systems are increasingly ing such a model of linguistic knowledge, GG research has
less common. he largest system found so far is Nigerian Fula consistently attempted to describe how a speaker-hearer can
with around twenty genders (the exact count depending on the generate and comprehend an ininite number of grammatical
dialect). However, 144 of the 256 languages had no gender sentences despite encountering only a inite amount of primary
system. linguistic data (PLD) while learning any given language. In
Semantic distinctions between classes of nouns, even lexical order to account for this apparent conundrum, GG standardly
derivations (e.g., the English poet versus poetess), do not in them- assumes that the language capacity is a genetic endowment (see
selves make genders. his is because it is taken as the deinitional innateness and innatism ) that is distinctive to homo sapi-
characteristic of gender that some constituent outside the noun ens and which speciies those aspects of linguistic knowledge
itself must agree in gender with the noun. hus, gender refers that are genetically determined, as opposed to those that must
to classes of nouns within a language that are relected in the be acquired via contact with PLD. As a result of this emphasis
behavior of the associated words (Hockett 1958, 231), and a lan- on language acquisition, GG is often closely associated with
guage has a gender system only if we ind diferent agreements biolinguistics.
334
Generative Grammar
he following sections summarize the main stages in the rules (e.g., S NP VP) generates strings of symbols (the kernel
development of GG from the 1950s to the present. sentences of the grammar), and that a set of transformational
rules subsequently operates upon these strings, modifying them
The Early Years in order to derive further sentences (see transformational
During the 1930s and 1940s, prominent linguists in North grammar).
America sought to discover, in a systematic fashion, the gram-
matical rules that regulated the sentential structure of utterances
in any given corpus. For instance, in his Methods in Structural Development and Transition
Linguistics (1951), Zellig Harris identiied distributional discov- In the early 1960s, the irst generation of linguists who had
ery procedures that could determine the structure of a given lan- encountered GG as students came to maturity, and this group
guage, specifying rules that would enable (for instance) knife and included such inluential igures as John R. Ross, Paul Postal,
knives to be associated with a single underlying morphophone- James McCawley, and George Lakof. However, Chomsky con-
mic sequence. Such concerns came to typify the corpus-driven tinued to guide the development of GG, and a revised version
structural linguistics of the 1940s. was presented in Aspects of the heory of Syntax (1965; hence-
While still a student, Chomsky became dissatisied with forth ATS). While various techniques from 1950s-style TGG
the decompositional methodology that Harris (and others) had been retained, there were also conspicuous diferences.
were advocating, and he developed an alternative approach, For instance, the topic of language acquisition was now explic-
transformational generative grammar (TGG), that was itly addressed, and Chomsky suggested that the generation
designed to overcome limitations in the work of his contem- of grammatical structures was determined partly by innate
poraries. Although the rudiments of TGG were summarized in knowledge of language, thereby stressing the connection
Chomskys celebrated 1957 publication Syntactic Structures, between linguistics and cognitive psychology (see Chomsky
this monograph merely provided a high-level overview of 1965). In order to clarify this idea, he distinguished between
various techniques and theoretical assumptions that had been competence (i.e., a speaker-hearers knowledge of the for-
presented in earlier work, especially his then-unpublished mal aspects of language) and performance (i.e., a speaker-
manuscript he Logical Structure of Linguistic heory ([1955] hearers actual use of language in concrete situations), and he
1975; henceforth LSLT). suggested that the task of linguistic research was to provide a
he theory presented in LSLT assumes a hierarchy of analyti- description of the former. While elaborating this revised per-
cal linguistic levels (e.g., the phonemic level, the morphemic spective, Chomsky contrasted descriptive and explanatory
level). Consequently, smaller linguistic units (e.g., morphemes) adequacy (see descriptive , observational, and explana-
can be combined in a rule-driven manner to create larger lin- tory adequacy) and argued that a valid grammatical theory
guistic units (e.g., words). A fully articulated grammar of this must be both descriptively and explanatorily adequate. As a
kind would be able explicitly to produce all the grammatical sen- result, a theory of universal grammar (UG) became pos-
tences in a given language, and therefore the model is generative sible. Speciically, since ATS encouraged linguists to explain
rather than decompositional. Eventually, TGG came to be asso- how an idealized speaker-hearer eventually achieves linguistic
ciated with a number of distinctive and inluential theoretical competence while encountering only a inite amount of PLD,
stances such as the following: researchers began to focus more on the task of identifying those
properties that are common to all known languages, rather than
Syntax can be analyzed independently of semantics that merely producing isolated grammars for speciic languages.
is, sentences such as Colorless green ideas sleep furiously In the ATS framework, in addition to TGG-style phrase struc-
are grammatical though they are meaningless (Chomsky ture rules such as
[1955] 1975, 57; 1957b, 15).
Statistical techniques (such as inite state machines and S NP Aux VP
stochastic grammars) cannot generate all, and only all, the VP V NP
grammatical sentences in a given language, and therefore NP Det N
they cannot usefully be incorporated into comprehensive lin-
NP N
guistic theories (Chomsky 1957a).
Linguistic theories can be developed and presented in a Det the
rigorously axiomatic-deductive framework like that stan- Aux M
dardly used by mathematicians and logicians (Chomsky
[1955] 1975, 83). Chomsky also included subcategorization rules that contained
explicit information about sublexical features (1965, 85):
he detailed arguments that Chomsky developed in order to jus-
tify such beliefs caused many of his contemporaries to claim that N [+N, Common]
TGG was a more scientiic linguistic theory than any of its pre- [+Common] [Count]
decessors, and this partly explains why it was eventually received [+Count] [ Animate]
with such enthusiasm by linguists who were keen to establish
[Common] [Animate]
their discipline as a scientiic enterprise (see Tomalin 2006).
Focusing speciically on the syntactic level of analysis, the stan- [+Animate] [Human]
dard TGG framework assumes that a set of phrase structure [Count] [Abstract]
335
Generative Grammar
Figure 1.
LEXICON LEXICON
D-STRUCTURE NARROW SYNTAX
Move
PHONETIC FORM LOGICAL FORM

S-STRUCTURE
Figure 3.
where S0 is the initial state and Ss is the resulting stable state

with ixed parameter settings. Also during this period, the term
PHONETIC FORM LOGICAL FORM
E-language began to be used to refer to actual manifestations of
Figure 2. language in the external world, while I-LANGUAGE referred to
the ideal speaker-hearers internal, tacit knowledge of language.
Although it developed out of previous GG research, the GB
Such rule sets enable structures such as those found in model certainly introduced a new framework for linguistic
Figure 1 to be generated. Base-generated trees of this kind analysis. For instance, while deep structure (i.e., D-structure)
constituted deep structure representations, and the transfor- and surface structure (i.e., S-structure) were retained, a single
mational rules operated on them to produce surface structure rule, Move- (i.e., move anything anywhere), was used to gen-
representations (see underlying structure and sur- erate S-structures from D-structures, rather than a set of speciic
face structure ). movement transformations. In addition, the GB phrase struc-
ture component used x-bar theory, which posited structural
similarities between diferent phrasal categories, such as noun
Principles and Parameters
phrase (NP) and verb phrase (VP), and, crucially, Chomsky
By the late 1970s, GG had started to change once again, gradually
(1986) later extended these structural insights to functional cat-
emerging in the early 1980s as the modular government and
egories as well. Consequently, the basic GB framework can be
binding (GB) theory. he GB framework associates UG with a
represented as in Figure 2.
inite set of principles that are common to all languages, and a
During the early 1990s, the P&P approach was reformu-
inite set of parameters the settings of which vary from language
lated as the minimalist program (MP) (see minimalism ),
to language it therefore began to be referred to as the prin-
and in an attempt to reduce the theory to its bare essentials,
ciples and parameters (P&P) approach. In the GB formal-
some familiar GB elements (e.g., D-structure, S-structure,
ism, UG is understood to constitute a characterization of the
X-bar theory) were rejected in favor of a simpler, more eco-
childs pre-linguistic initial state and the parameters are ixed
nomical methodology. Accordingly, the basic schema for the
as PLD are encountered, thus creating a stable-state grammar
MP framework can be presented as in Figure 3. Specifically, in
(Chomsky 1981, 7). Schematically,
the MP, an I-language generates expressions that pair instruc-
S0 + PLD Ss tions for the articulatory-perceptual (A-P) system interface
336
Generative Grammar Generative Poetics
with instructions for the conceptual-intentional (C-I) system Newmeyer, F. J. 1986. Linguistic heory in America. Orlando, FL: Academic
interface. CHL (the computational component of UG) contains Press.
a small set of operations (e.g., Select, merge ), which manipu- Tomalin, M. 2006. Linguistics and the Formal Sciences.
late lexical items (LIs). LIs are defined in terms of irreducible Cambridge: Cambridge University Press.
features, and (crudely) CHL combines LIs in various princi-
pled ways in order to create larger syntactic objects, with all
GENERATIVE POETICS
superfluous machinery (e.g., projections, labels) being omit-
ted. A computation converges if it converges at the A-P and Generative poetics comprises all theories that seek to explain the
C-I interface levels, and, crucially, the MP hypothesizes that production and reception of literary works by reference to a set
natural language constitutes an optimal solution to the var- of rules or algorithmic procedures. It is closely related to cogni-
ious demands imposed by the external interfaces. In other tive poetics. However, generative poetics has tended to draw
words, the MP seeks to determine just how perfect natural inspiration from Chomskyan generative grammar. In con-
language actually is (Chomsky 1995, 9). trast, many writers in cognitive poetics have aligned their analy-
Chomsky was initially rather vague about the nature of this ses with cognitive linguistics.
perfection. However, he has recently attempted to deine Early work in generative poetics tended to track developments
this notion with reference to the speciic requirements that in generative grammar. In Noam Chomskys usage, a generative
are imposed upon an I-language by the A-P and C-I interfaces. grammar produces all and only the grammatical sentences of
Accordingly, it is possible to identify various degrees of the essen- a language. For the early theorists of generative poetics, then,
tial minimalist thesis, with the most stringent being the strong a narrative grammar should produce all and only well-formed
minimalist thesis (SMT). In essence, if the SMT is correct, then narratives. Moreover, in both cases, it was commonly thought
there are no elements of S0 that cannot be accounted for in terms that this goal is best accomplished through a transforma-
of interface requirements and general (nonlinguistic) computational grammar.
tional properties; therefore, there are no inherently unexplain- Much of this early work was illuminating and important.
able aspects of S0. If this hypothesis is shown to be true, then However, there are several problems with modeling, for exam-
natural language would be, in this sense, perfect. ple, narratology on linguistic theory. First, linguistic theories
may change rapidly. If one bases ones narrative theory on any
Marcus Tomalin
speciic syntactic theory, ones narrative theory will probably be
outmoded by the time it is published. Second, there is no reason
to believe, a priori, that rules for the generation of stories should
Chomsky, N. [1955] 1975. he Logical Structure of Linguistic heory. directly parallel rules of syntax anyway. Finally, it is not clear that
there is any narrative counterpart to grammaticality. In other
. 1957a. Review of Hocketts A Manual of Phonology. International
words, the ambition of generating all and only well-formed
Journal of American Linguistics 23: 22334.
stories may be misguided. (he situation is, of course, diferent
. 1957b. Syntactic Structures. he Hague: Mouton.
. 1965. Aspects of the heory of Syntax. Cambridge, MA: MIT Press. for areas of poetics that are directly governed by linguistic rules.
. 1981. Lectures on Government and Binding. Floris, the Indeed, much of the most important work in generative poet-
Netherlands: Dordrecht. ics has been done in such areas; see meter, verse line, and
. 1986. Barriers: Linguistic Inquiry Monograph 13. Cambridge, poetic form, universals of.)
MA: MIT Press. he point about well-formedness is worth considering in fur-
. 1995. he Minimalist Program. Cambridge, MA: MIT Press. ther detail. here are, of course, speech actions that are clearly
. 2000a. New Horizons in the Study of Language and Mind. not stories and speech actions that are. he diiculty is that
Cambridge: Cambridge University Press. there is a gradient of more or less marginal cases, rather than a
. 2000b. Minimalist inquiries: he framework. In Step by
strict division between stories and nonstories. In short, story is
Step: Essays on Minimalist Syntax in Honor of Howard Lasnik, ed.
a prototype concept. Given the prototype nature of story, it
D. Michaels, J. Uriagereka, and R. Martin, 89115. Cambridge, MA:
is not clear just how the relevant data are organized, thus, just
MIT Press.
. 2004. Beyond explanatory adequacy. In Structures and what needs explaining. In other words, it is not clear precisely
Beyond: he Cartography of Syntactic Structures. Vol. 3. Ed. A. Belletti, what structures the generative rules should generate.
10431. Studies in Comparative Syntax. New York and Oxford: Oxford More exactly, the outputs of a generative system clearly
University Press. need to match the relevant data. At least two aspects of the
Harris, R. A. 1993. he Linguistic Wars. Oxford: Oxford University Press. Chomskyan approach are generalizable here. First, the system
Harris, Z. S. 1951. Methods in Structural Linguistics. Chicago: University should not overgenerate. For example, a generative grammar
of Chicago Press. might produce all grammatical sentences. But it is still invalid if it
Jackendof, R. 1977. X-Bar Syntax. Cambridge, MA: MIT Press. also produces he the of at as a grammatical sentence. Second,
Johnson, D. E., and S. Lappin. 1997. A critique of the minimalist pro-
the grammar cannot produce only sentences that have already
gram. Linguistics and Philosophy 20: 273333.
occurred. Such a grammar would be falsiied as soon as someone
. 1999. Local Constraints vs. Economy. Monographs in Linguistics
Series. Stanford, CA: CSLI.
uttered a new sentence. In short, the data comprise all possible
Lees, R. 1957. Review of Syntactic Structures. Language 33: 375408. speech actions of the relevant type (sentences, narratives) and
Matthews, P. H. 1993. Grammatical heory in the United States from no impossible ones. his returns us to the issue of intermedi-
Bloomield to Chomsky. Cambridge: Cambridge University Press. ate cases. However we determine the data, a generative system
337
Generative Poetics
should produce structures in keeping with the categorial organi- speciic to a group, 3) speciic to an individual, and 4) speciic to
zation of the data. If the data have a sharp well-formed/not well- a speech action. Generative grammar focuses almost entirely on
formed division with few intermediate cases, the system should the irst and second levels. Generative poetics, however, must be
generate structures divided in this way. If the data involve a more concerned equally with the third and fourth levels. his is con-
gradual gradient from excluded to included cases as we ind nected with the fact that verbal art is a normative category. We
with stories then the system should generate structures orga- care about instances of verbal art, and we care about the people
nized in this way. who produce works of verbal art. (We commonly care less about
In addition to this diference in the data, there is a normative individuals who interpret works of verbal art. In keeping with this
component to our concept of stories that is largely absent from preference, writers in generative poetics and related areas have
our concept of sentences. We routinely refer to some stories as tended to focus on production rather than reception. Ultimately,
better than others. However, we do not commonly think of some however, a research program treating these issues will need to
sentences as better than others. address reception as well; see competence and perfor-
Although it is not usually referred to as generative poetics, mance, literary, for one inluential approach to reception in
some recent work in cognitive science and literature does fall this context.)
into that category as deined here. his work avoids the prob- Consider a simple rule system for producing narrative. Such
lems of earlier approaches by developing theoretical principles a system might have three types of rules: 1) basic plot construc-
independently of particular grammatical theories and by recog- tion rules, 2) development principles that serve to specify the
nizing the prototype nature of our ordinary language concept of basic plot, and 3) evaluation rules, a form of adjustment rules
stories. that operates in authorial revision. here are two levels of basic
In order to understand this recent work, we need to consider plot construction rules. At the irst level, a few simple processes
what constitutes a generative rule system for a speech action. generate a story from a goal. hese processes might involve con-
Most obviously, a generative system needs a productive com- structing an agent who pursues the goal and the development
ponent and a receptive component. Parts of these components of some problem that prevents the achievement of the goal.
will be directly parallel. In other words, many rules of produc- Repeated over enough instances, such stories themselves form
tion and reception have to be systematically coordinated so that a second constructive level narrative prototypes. Narrative
when I say Could you ask John to pass the salt? my addressee prototypes do not deine basic conditions for being a story. hey
will understand the question in such a way that I will most often crystallize what counts as a good case of a story. hese proto-
end up getting the salt. On the other hand, some parts of these types help us to account for the gradient between central and
components will necessarily be diferent. For example, not every- marginal cases of stories. One may put the point simply by refer-
thing involved in producing an alliterative, rhyming, metered ence to acquisition. A child learns that Nice day! is not a story
verse line is also involved in reading that line. at all; I went out to buy a loaf of bread is a very marginal sort of
Whether speaking of productive or receptive speech actions, story; I went out to buy a loaf of bread, but there was a ifteen-
there are several ways in which we could organize the rules that car accident on the highway is more prototypical and so on,
constitute a generative system. No matter how we do this, we are up to the experiences of Bambi, which are highly prototypical.
likely to have processes (e.g., activation), structures in which pro- Some narrative prototypes are narrative universals.
cesses operate (e.g., episodic memory), and elements on which hree in particular recur cross-culturally heroic, romantic, and
processes operate (e.g., memories of particular experiences that sacriicial tragi-comedy. hese are speciied and combined in
we activate from episodic memory). Note that these divisions particular genres that vary from tradition to tradition.
need not be absolute. Processes could themselves be the ele- he nature of prototypes is such that they are relatively abstract
ments on which other processes (meta-rules) operate. Moreover, and common or ordinary. In contrast, successful literary works
some elements may incorporate processes operating on other are both concrete and distinctive. Development principles serve
elements. We see this in the case of scripts, such as the script to particularize the prototypes. In many, perhaps most, cases,
for eating at a restaurant. When these scripts are activated, they development principles are universal or common to a group,
guide our speech and behavior by integrating various preexisting though their precise operation involves individual idiosyncrasy.
processes and other elements. For example, when we decide to For example, in specifying characters, one common develop-
eat at a restaurant and follow our script for doing so, such pro- ment principle is to draw on exemplars, instances of particu-
cesses as activating our prototype for a menu are involved so lar people, real or literary. While this principle seems likely to be
that we can recognize menus and respond appropriately when universal or near universal, its results will vary with the precise
the server holds one out to us. Finally, diferent processes may exemplars employed. hese exemplars will, in turn, vary cultur-
only operate at diferent derivation levels. A common case of this ally and individually. hus, we ind that diferent traditions com-
sort would be the distinction between basic construction rules monly have a limited number of exemplary characters who serve
and rules that adjust the outputs of the basic construction rules. as important models to later writers. he principle is cross-cul-
For example, certain rules of politeness might not afect our tural, but of course the exemplars themselves difer (e.g., Jesus in
initial production of a sentence, but may enter as adjustments the Christian tradition versus Rama in the Hindu tradition).
performed while we are speaking. Evaluation rules include self-conscious processes of adjust-
As the preceding analysis suggests, there are diferent levels ment for a projected audience or readership. However, they
of commonality for the components and operations of any rule more importantly involve unself-conscious sensitivity to pattern-
system. he main levels would be as follows: 1) universal, 2) ing, suggestiveness (see dhvani and rasa and suggestion
338
Generative Poetics Generative Semantics
structure), and other complex features of the work. here are WORKS CITED AND SUGGESTIONS FOR FURTHER READING
obviously diferences in the nature of these rules and the way Hogan, Patrick Colm. 2006. Narrative universals, heroic tragi-comedy,
they operate in writing and oral composition. and Shakespeares political ambivalence. College Literature
A brief example from Shakespeare should help to clarify these 33.1: 3466.
points. Like any other author, Shakespeare had a diverse set of Pavel, homas G. 1985. he Poetics of Plot: he Case of English Renaissance
principles that constituted his generative poetic system. hese Drama. Minneapolis: University of Minnesota Press.
principles were multiple and stood in complex relations with one Prince, Gerald. 1973. A Grammar of Stories. he Hague: Mouton.
another. he relations are probably best understood roughly in
terms of connectionism. hey were linked to one another with
GENERATIVE SEMANTICS
diferent degrees of strength. Diferent principles were activated
at diferent times and in diferent degrees, depending on what Generative semantics (GS) began as an orthodox development
Shakespeare was reading, experiencing in his personal life, and within the standard theory (ST) of the transformational-
so forth. hese diferent activations would sometimes produce generative grammar (TGG) framework developed by Noam
very diferent cascades of activation within the system, leading Chomsky and his collaborators in the late 1950s and early 1960s.
to diferent products. Its most prominent voices include one of those collaborators,
Obviously, we can never have a sense of all these particulars. Paul Postal, two of Chomskys students, James D. McCawley and
However, the particulars are not without patterns. For example, Hj (John Robert) Ross, and others in the general circle, chiely
Shakespeare largely conined himself to the universal narrative George Lakof and Robin Tolmach Lakof. While the developers,
prototypes. Moreover, there are patterns to his development and many onlookers, appeared to regard GS as a natural contin-
of those prototypes cultural patterns (relating, for example, uation of ST, Chomsky clearly did not; antagonisms soon arose.
to character types of Renaissance English drama) and individ- GS had considerably more adherents for several years, but within
ual patterns, patterns that are more distinctive of Shakespeare a decade the situation had reversed, with more linguists adopt-
himself. One standard development principle, a function of the ing Chomskys extended standard theory, or EST. GS rapidly
maximization of relevance (see the essay Elaborating Speech disintegrated, leaving many proposals of lasting interest.
and Writing: Verbal Art in this volume), is alignment, where GS was a highly streamlined TGG model, with only two rep-
the author parallels diferent levels of the narrative world (e.g., resentations, meaning and form the irst a syntactic structure
presenting society and nature as in simultaneous turmoil). comprised of semantic primes and heavily inluenced by
Shakespeare sometimes intensiies such parallelism, extending symbolic logic, the second a phonetically completed syntac-
it to three or four levels for example, mental health and family tic structure linked by a set of transformations, governed by
relations, along with society and nature. he most famous case a set of derivational conditions called global rules. While the
of such alignment in Shakespeare is in King Lear, when Lears generative role of semantics was both originary and titular, it
madness is paralleled with the division of his family, strife in his quickly became peripheral to GS, as proponents began tackling
kingdom, and a terrible storm. a wide range of issues and phenomena previously discounted
here are other, less common, principles employed by or unnoticed within TGG. For instance, grammaticality had
Shakespeare as well. Some are localized. In several cases, for been deined exclusively in terms of a speciic grammar, but
example, he represents rebels as having suicidal thoughts, even GSers regarded it as a psychosocial notion, relative to language
when their rebellion seems entirely justiied. Others range more users and their contexts. Similarly, lexical categories had been
broadly across a work. For instance, a number of Shakespeares assumed to be discrete, but GSers explored categorization
development principles serve to foster ambivalence toward vari- in fuzzy and context-sensitive terms resonant with prototype
ous actions and characters, complicating our sense of who is right theory, developments which have been inluential in cogni-
and who is wrong in particular conlicts. hus, we sympathize with tive grammmar (e.g., Ross 1973; Lakof 1972). GSers also
Hamlet, knowing that Claudius is a murderer. But Shakespeare brought performative analyses into TGG, typically positing a
makes Hamlet, too, commit murder, if in somewhat more ambig- deletable hypersentence carrying the illocutionary force,
uous circumstances. (See Hogan 2006 on this and other cases.) such as I warn you, into which a locution like Dont make a move
Without access to Shakespeares drafts and revisions, it is dif- was embedded (Sadock 1969). his research helped bring prag-
icult to isolate his evaluation rules. However, we might guess matics into formal linguistics.
that they involved systematizing the execution of such devel- he legacy of GS is extensive, albeit notoriously unacknowl-
opment principles (e.g., enhancing ambivalence through such edged (Postal 1988). Frederick J. Newmeyer ofers this partial but
means as the addition of humanizing speeches for otherwise vil- signiicant catalog of topics introduced to TGG by GS: capturing
lainous characters). semantic regularities in a representation utilizing symbolic logic;
Generative poetics is certainly not the only way of theoriz- the early exploration of phenomena that led to the development
ing literature. However, it is highly promising, when dissociated of mechanisms like indexing devices (see indexicals), traces,
from particular linguistic theories and tied instead to a broader and filters; lexical decomposition; and several speciic propos-
understanding of rule systems in a cognitive context. As such, it als, such as the nonexistence of extrinsic rule ordering, postcy-
has great potential for helping us to understand both the univer- clic lexical insertion, and treating anaphoric pronouns as bound
sal patterns of verbal art and the speciicity of particular works. variables (1986, 138; see anaphora).
Patrick Colm Hogan Randy Allen Harris
339
Generic- and Speciic-Level Metaphors
WORKS CITED AND SUGGESTIONS FOR FURTHER READING to but also in their internal structure. Speciic-level metaphors
Harris, Randy Allen. 1993. he Linguistics Wars. New York: Oxford
are held to involve a set of speciic mappings between a source
University Press. domain and a target domain (see source and target). In the
Huck, Geofrey J., and John A. Goldsmith. 1995. Ideology and metaphor LOVE IS A JOURNEY, abstracted from conventional
Linguistic heory: Noam Chomsky and the Deep Structure Debates. expressions, such as our relationship has hit a dead-end street,
London: Routledge. look how far weve come, or the relationship isnt going anywhere
Lakof, George. 1972. Hedges: A study in meaning criteria and the logic (Lakof 1993, 206), the lovers correspond to the travelers, the dis-
of fuzzy concepts. Proceedings of the Chicago Linguistics Society tance traveled corresponds to the duration of the relationship,
8: 183228. and so on. However, generic-level metaphors do not involve
Newmeyer, Frederick J. 1986. Linguistic heory in America. 2d ed. New such a ixed set of correspondences. For example, the metaphor
EVENTS ARE ACTIONS does not specify the events that can be
Postal, Paul. 1988. Advances in linguistic rhetoric. Natural Language
metaphorically understood as actions or the actions that can be
and Linguistic heory 6: 12937.
Ross, J. R. 1973. Nouniness. In hree Dimensions of Linguistic heory,
used as metaphor vehicles, and, consequently, it does not spec-
ed. O. Fujimura, 137258. Tokyo: TEC. ify any particular correspondences between such events and
Sadock J. 1969. Hypersentences. Paper in Linguistics 1: 283371. actions.
he idea of generic-level metaphors was a signiicant turn-
GENERIC- AND SPECIFIC-LEVEL METAPHORS ing point in the development of conceptual metaphor theory
because it marked a step away from the extrapolation of hypoth-
The distinction between generic-level and specific-level met- esized conceptual metaphors from linguistic data toward the
aphors was introduced in conceptual metaphor theory postulation of more abstract schemas with loose relations to
(Lakoff 1993; Lakoff and Turner 1989). It identifies hierar- observable linguistic metaphors (e.g., Lakof and Johnson 1999).
chical relations between metaphorical concepts that are Whereas in earlier work (Lakof and Johnson 1980), conceptual
hypothesised to be used in the understanding of figurative metaphors were treated as generalizations over metaphors
language. in communication on the part of the analyst and, presum-
Examples of speciic-level metaphors are LOVE IS A JOUR ably, the speaker, the direction of reasoning was reversed with
NEY, PEOPLE ARE PLANTS, or DEATH IS A THIEF. Examples respect to generic-level metaphors: hese were explicitly held
of generic-level metaphors are EVENTS ARE ACTIONS and to constrain the particularities of metaphor in language and
GENERIC IS SPECIFIC. he motivation for introducing this dis- communication.
tinction was the observation that some topics are commonly he idea of generic-level metaphors has attracted some debate.
talked about using a small group of metaphors that share impor- One contention has been that the relation between actions and
tant characteristics. George Lakof and Mark Turner (1989), in an events (EVENTS ARE ACTIONS) or between a speciic instance
analysis of poetic language, noticed that time was metaphorically and a general phenomenon (GENERIC IS SPECIFIC) is not itself
personiied as a destroyer, a thief, or a devourer, as in Miltons metaphoric (Jackendof and Aaron 1991; Stern 2000). Also, it
line time, the subtle thief of youth (cited on p. 35). While such remains an open question whether abstract schemas such as
personiications might be common and easily understood, per- generic-level metaphors appropriately model metaphor under-
soniications of time such as, say, a child or as a shop clerk do standing (e.g., Murphy 1996) or whether they are better regarded
not occur in the analyzed poems. he reason for this, Lakof and as descriptive tools for the analyst. Some current cognitive-lin-
Turner argued, is that metaphorical understanding in terms of guistic approaches to metaphor are more concerned with the
relatively speciic ideas (such as thieves or shop clerks) is con- development of metaphoric ideas in discourse communities (e.g.,
strained by generic-level metaphors, in this case EVENTS ARE Musolf 2004).
ACTIONS. Generic-level metaphors ensure that topics are meta- he inluence of the idea of generic-level metaphors can be
phorically conceptualized using vehicles that share relevant seen in two developments. he notion of primary metaphors
generic structure with the topic. In the example from Milton, the (e.g., Grady and Johnson 2003) takes up the idea that metaphors
event of middle age involves, among other things, the absence in verbal communication might be constrained by other met-
of a previously experienced youth. Actions involving thieves as aphoric schemas that are not themselves manifest in language.
agents typically result in the absence of previously held posses- he notion of a generic space in conceptual blending the-
sions. Evidently, the two share generic structure, which licenses ory preserves the idea of a cognitive structure that identiies the
the personiication of time as a thief via the events are actions shared generic structure of arguments brought together in a
metaphor. Typical activities involving shop clerks as agents, on blended space, while abandoning the notion that such generic
the other hand, might be more diicult to relate to the experience structure is itself metaphoric.
of middle age: Such activities do not share generic structure with
the onset of middle age. his is why, theoretically, the EVENTS Jrg Zinken
ARE ACTIONS metaphor should prevent poets from attempting
the line time, the subtle shop clerk of youth. he generic-level WORKS CITED AND SUGGESTIONS FOR FURTHER READING
metaphor EVENTS ARE ACTIONS constrains the range of viable Grady, Joseph, and Christopher Johnson. 2003. Converging evidence
speciic-level personiication metaphors. for the notions of subscene and primary scene. In Metaphor and
Generic-level metaphors difer from speciic-level metaphors Metonymy in Comparison and Contrast, ed. R. Dirven and R. Prings,
not only in the generality of the topics and vehicles they apply 53354. Berlin: Mouton de Gruyter.
340
Genes and Language
Jackendof, Ray, and David Aaron. 1991. Review of More han Cool found (Fisher, Lai, et al. 2003). Speech and language disorders
Reason: A Field Guide to Poetic Metaphor, by G. Lakof and M. Turner. are repeatedly observed to cluster in families, and twin studies
Language 67: 32038. indicate that they are highly heritable (Bishop 2001). In recent
Lakof, George. 1993. he contemporary theory of metaphor. In years, geneticists have successfully located chromosomal sites
Metaphor and hought, ed. A. Ortony, 20251. Cambridge: Cambridge
within the human genome that are likely to harbor genetic risk
University Press.
factors involved in developmental language disorders (e.g., he
Lakof, George, and Mark Johnson. 1980. Metaphors We Live By.
SLI Consortium 2002). Moreover, they have even been able to
. 1999. Philosophy in the Flesh: he Embodied Mind and Its Challenge zero in on a speciic gene, FOXP2, that is implicated in one par-
to Western hought. New York: Basic Books. ticular disorder afecting speech and language (Lai, Fisher, et al.
Lakof, George, and Mark Turner. 1989. More han Cool Reason: A Field 2001).
Guide to Poetic Metaphor. Chicago: University of Chicago Press.
Murphy, Gregory. 1996. On metaphoric representation. Cognition FOXP2: What Is It and How Was It Discovered?
60: 173204. he FOXP2 gene, found on human chromosome 7, codes for a
Musolf, Andreas. 2004. Metaphor and conceptual evolu- special type of regulatory protein, technically known as a fork-
tion. Metaphorik.de 7: 5575. Available online at: http://www. head-box (or FOX) transcription factor. his class of proteins
metaphorik. de.
helps govern when and where genes are expressed (switched on
Stern, Joseph. 2000. Metaphor in Context. Cambridge, MA: MIT Press.
and of) during embryogenesis, in postnatal development, and
in the mature organism (Lehmann, Sowden, et al. 2003). Each
FOX protein contains a special structure, called a forkhead-box
GENES AND LANGUAGE
domain, which enables it to bind to the DNA of a target gene and
Why Should One Expect Genes to Play an Important Role in afect how much of the product of that target gene is made in
Language? the cell. Transcription factors like these may afect many down-
Unlike ofspring of any other species, ordinary human children stream targets in chorus and, thus, represent central components
routinely acquire complex language, characterized by open- of gene regulatory networks that are important for implementing
ended vocabularies and productive syntax. his cannot be a developmental programs, allowing cells to respond to signals,
result of input alone because juveniles of other closely related and so on.
primate species, such as chimpanzees, do not develop human- he discovery of the human FOXP2 gene originated in stud-
like languages even with extensive tutelage (Terrace, Petitto, ies of a large three-generational family (the KE family) sufering
et al. 1980). From the perspective of biology, the question is not from a rare form of speech and language impairment (Hurst,
whether genes play roles in language but how (Fisher and Marcus Baraitser, et al. 1990; Gopnik and Crago 1991). he disorder is
2006). characterized primarily by severe diiculties in the learning and
Clearly, the words and grammar that are speciic to any par- production of sequences of mouth movements that are neces-
ticular language are learned through exposure to appropriate sary for luent speech, usually referred to as developmental ver-
models. Nevertheless, the peculiar human capacity to acquire bal dyspraxia or childhood apraxia of speech (Vargha-Khadem,
and use language depends on a rich mixture of neural systems Watkins, et al. 1998). Afected individuals simultaneously dis-
that must be biologically constrained. hese include mecha- play problems in a wide range of language-related abilities,
nisms that need to simultaneously coordinate syntactic, in both oral and written domains, with impact on receptive as
semantic, phonological, and pragmatic representations well as expressive skills (Watkins, Dronkers, et al. 2002; Vargha-
with one another, with motor and sensory systems, and with Khadem, Gadian, et al. 2005). All 15 of the afected people in the
both speakers and listeners knowledge of the world. he func- KE family have inherited a mutation altering a single nucleotide
tional properties of the relevant neural systems are largely deter- letter in the DNA code of the FOXP2 gene (Lai, Fisher, et al. 2001).
mined by the cellular architecture of the human brain, which his change afects the structure of the encoded FOXP2 protein
is itself the product of ongoing interactions between genes and and prevents it from functioning properly (Vernes, Nicod, et al.
the environment (Marcus 2004). Genes contribute to the birth, 2006).
migration, diferentiation, patterning, and connectivity of neu- Although the mutation in question is private to the KE family,
rons during embryogenesis and development, and they con- diferent mutations disrupting FOXP2 function have been dis-
tinue to contribute to online functions in the mature brain, for covered in other families, showing comparable problems with
example, by mediating changes in the strengths of connections speech and language acquisition (MacDermot, Bonora, et al.
between neurons. It is likely that hundreds or even thousands of 2005). In all cases identiied thus far, the mutations have been
genes participate in the development and maintenance of the heterozygous; that is, people with the disorder have a muta-
neural systems that underlie language, some in ways that may tion in only one copy of FOXP2, while the other copy is intact.
be tailored to linguistic functions, others (like housekeeping (Humans are diploid organisms, carrying two copies of every
genes that govern the basic metabolic processes of all cells) that gene, one inherited from the father, the other from the mother,
clearly are not. with a few exceptions such as the genes on the sex chromo-
As yet, it is unknown how many of the genes in the human somes.) he consistent observations of heterozygosity in difer-
genome are closely tied to language, but studies of developmen- ent cases of FOXP2-related disorder suggest that afected people
tal syndromes that primarily disrupt speech and/or language have reduced amounts of working FOXP2 protein in brain cir-
skills give strong reason to believe that such genes are there to be cuits that are important for speech and language. herefore, the
341
Genes and Language
amount (dosage) of FOXP2 may be a critical factor in the proper Genetic studies of typical forms of speciic language impair-
development of speech and language skills. ment have identiied other genomic regions that are likely to be
relevant to language, and researchers are focusing consider-
Does That Make FOXP2 the Language Gene? able attention on those chromosomal sites in the hope of pin-
No. Although studies of people carrying damaged versions of ning down particular genes (e.g., he SLI Consortium 2002). For
FOXP2 are consistent with a role (or roles) in the development developmental dyslexia, a disorder primarily characterized by
and/or processing of language, it is already apparent from reading disability, but underpinned by subtle persistent deicits
genetic studies that no single gene is exclusively responsible for in language processing, it has been possible to home in on sev-
this distinctive human trait. Indeed, FOXP2 is implicated (thus eral candidate genes (DYX1C1, KIAA0319, DCDC2, and ROBO1).
far) only in one rare form of disorder, and not mutated in people hese genes difer from FOXP2 in that there have been no spe-
diagnosed with more common variants of specific language ciic causal mutations identiied instead, it is thought that the
impairment (SLI) (Newbury, Bonora, et al. 2002). Instead, most increased risk of dyslexia stems from as-yet unknown variants in
developmental language disorders are likely to be multifacto- the regulatory parts of those genes that govern their expression
rial: the product of multiple genetic risk factors, their interactions (Fisher 2006). Nevertheless, there are some striking parallels
with one another and interactions with the environment (Fisher, with FOXP2; each of the dyslexia candidate genes shows wide-
Lai, et al. 2003). It is also worth noting that mutation of FOXP2 spread expression patterns in multiple circuits in the brain, and
impairs not only aspects of speech and language but also aspects each is active in additional tissues, not only the brain. None of
of nonlinguistic orofacial motor control (Watkins, Dronkers, et the genes is unique to humans; for example, highly similar ver-
al. 2002; Vargha-Khadem, Gadian, et al. 2005). sions of each are found both in other primates and in rodents. At
More broadly, given what we know about the fundamen- this stage, there is little understanding of why alterations in these
tals of genetics, developmental biology, and neuroscience, it is genes should have relatively speciic efects on reading abilities,
highly unlikely that there is a single human-speciic gene whose although their basic neurobiological functions are beginning to
sole purpose is to endow our species with the capacity to acquire be deined; three of the genes (DYX1C1, KIAA0319, and DCDC2)
language. Individual genes do not specify particular behavioral have been linked to neuronal migration, and the fourth (ROBO1)
outputs or aspects of cognitive function. Rather, they contain the codes for a receptor protein involved in signal transduction,
codes for assembling individual molecules that act in a highly which helps regulate axon/dendrite guidance.
interactive fashion with other molecules in order to build and At the time of writing, there are indications that alterations
maintain a working human brain (Marcus 2004). Often, a gene in gene dosage may emerge as a general theme underlying overt
will have a primary function that is very clearly deined at the speech/language deicits. For example, it has been recently
cellular level for example, by encoding an enzyme, structural shown that duplications of a speciic region on chromosome
protein, ion channel, signaling molecule, or receptor but the 7 (far away from the site of the FOXP2 gene) can cause speech
pathways that link the gene to higher-order brain function will deicits (Somerville, Mervis, et al. 2005). What is especially inter-
nevertheless be complex, indirect, and diicult to disentangle esting about this inding is the fact that the relevant part of chro-
(Fisher 2006). mosome 7, which contains several diferent genes, corresponds
he language gene shorthand is also misleading because to the region that is most commonly deleted in cases of Williams
most, if not all, of the genes that are involved in language are syndrome, a well-studied disorder in which language skills can be
likely to play other roles, elsewhere in the brain and/or in other relatively well preserved as compared to other abilities. In other
tissues of the body. he expression of FOXP2 is not conined to words, while deletion of that part of chromosome 7 (i.e., reduced
classical language-related regions of the cortex, or even to the gene dosage) tends to spare language, duplication of this same
brain. Instead, it extends to additional brain structures, such set of genes (increased gene dosage) leads to speech disruptions.
as the basal ganglia, thalamus, and cerebellum (Lai, Similarly, there is evidence to suggest that the number of func-
Gerrelli, et al. 2003), and to other parts of the body (e.g., the lungs tional copies of a chromosome 22 gene called SHANK3, recently
[Shu, Yang et al. 2001]); it also has close counterparts in all verte- implicated in autism spectrum disorders, may be critical for
brates, as discussed in the section on evolution. In sum, FOXP2 speech development (Durand, Betancur, et al. 2007). Language,
can properly be a called a gene that participates in language like many aspects of biology, is likely to depend on a precise bal-
but not the language gene or even a gene that participates ance among many diferent molecules.
exclusively in language.
What Can Genes Tell Us about the Evolution of Language?
How Representative Is FOXP2? Are Other Genes Involved in Genes, like species, are the product of the process that Darwin
Language Likely to Act in Similar Ways? called descent with modiication. Each gene has an evolution-
It is diicult to say for sure; thus far, the FOXP2 gene represents ary history, with its current function a modiication of earlier
the only known example where point mutations have been linked functions. To the extent that the language system is the product
to a developmental disorder which primarily afects speech and of descent with modiication, most genes that are associated with
language. However, since disruptions of FOXP2 are found in only language can be expected to have counterparts in nonlinguistic
a very small subset of people with language-related disorders species. As such, comparisons of gene sequences and expression
(Newbury, Bonora, et al. 2002; MacDermot, Bonora, et al. 2005), patterns in diferent species can help cast light on language evo-
it is clear that there must be other genetic efects that remain to lution, identifying which of the relevant neurogenetic pathways
be discovered. are shared with other species and which have been modiied
342
Genes and Language
on the lineage that led to modern humans (Fisher and Marcus data that will soon emerge, both in terms of sheer data analysis
2006). and in terms of relating these data to linguistic functions.
FOXP2 again appears representative in this regard. Following Another exciting prospect is the use of genetic manipula-
the discovery of the gene, molecular studies have shown that it is tion in order to ind out more about the functions of genes that
present in similar form in many vertebrates, including mammals, are involved language, for example, by examining the function
birds, reptiles, and ish, where it is expressed in corresponding of nonhuman counterparts to those genes (White, Fisher et al.
regions of the brain to those observed in humans (reviewed by 2006). In this way, individual genes may provide the irst molecu-
Vargha-Khadem, Gadian, et al. 2005; Fisher and Marcus 2006). lar entry points into neural pathways involved in human commu-
On the basis of such data, it appears that FOXP2 is evolutionarily nication, and a direct way to understand how the twin processes
ancient, shared by many vertebrate species, regardless of speech of descent and modiication led to the remarkable and uniquely
and language ability, where it may have conserved functions in human faculty for complex language.
brain circuits involved in sensorimotor integration and motor-
Gary F. Marcus and Simon E. Fisher
skill learning (Fisher and Marcus 2006). For example, the stria-
tum in the basal ganglia is a conserved site of high FOXP2
expression, which shows reduced gray matter density in humans
carrying FOXP2 mutations (Vargha-Khadem, Gadian, et al. Bishop, D. V. 2001. Genetic and environmental risks for speciic lan-
2005). It is intriguing that in songbirds, changes in expression of guage impairment in children. Philos Trans R Soc Lond B Biol Sci
the gene in striatal Area X a key nucleus of the brain system 356.1407: 36980.
Durand, C. M., C. Betancur, et al. 2007. Mutations in the gene encoding
involved in song learning appear to relate to alterations in vocal
the synaptic scafolding protein SHANK3 are associated with autism
plasticity (see White, Fisher, et al. 2006).
spectrum disorders. Nat Genet 39.1: 257.
Despite such notable conservations across distantly related Enard, W., M. Przeworski, et al. 2002. Molecular evolution of FOXP2, a
species, a comparison of the locus in diferent primates has gene involved in speech and language. Nature 418.6900: 86972.
demonstrated that there was accelerated change in the FOXP2 Fisher, S. E. 2006. Tangled webs: Tracing the connections between genes
protein sequence during human evolution, most likely due to and cognition. Cognition 101.2: 27097.
positive selection (Enard, Przeworski, et al. 2002). Mathematical Fisher, S. E., C. S. Lai, et al. 2003. Deciphering the genetic basis of speech
analyses of genomic sequences from diverse human populations and language disorders. Annu Rev Neurosci 26: 5780.
suggest that the version of FOXP2 now ubiquitous in modern Fisher, S. E., and G. F. Marcus 2006. he eloquent ape: Genes, brains and
humans arose within the last 200,000 years, concordant with sev- the evolution of language. Nature Reviews Genetics 7: 920.
eral archaeological estimates of the time of emergence of pro- Gopnik, M., and M. B. Crago 1991. Familial aggregation of a develop-
mental language disorder. Cognition 39.1: 150.
icient spoken language (ibid.). We may never know for certain
Hurst, J. A., M. Baraitser, et al. 1990. An extended family with a domi-
why these modiications spread throughout the population, but
nantly inherited speech disorder. Dev Med Child Neurol 32.4: 3525.
it seems plausible that they proliferated due to some advantage Lai, C. S., S. E. Fisher, et al. 2001. A forkhead-domain gene is mutated in
inherent in enhanced vocal communication, perhaps achieved a severe speech and language disorder. Nature 413.6855: 51923.
through modiication of pathways already involved in motor- Lai, C. S., D. Gerrelli, et al. 2003. FOXP2 expression during brain devel-
skill learning. Still, this does not mean that changes in FOXP2 opment coincides with adult sites of pathology in a severe speech and
were the sole reason for the appearance of speech and language, language disorder. Brain 126 (Part 11): 245562.
even if they did represent an important factor in the evolution of Lehmann, O. J., J. C. Sowden, et al. 2003. Foxs in development and dis-
human communication. ease. Trends Genet 19.6: 33944.
MacDermot, K. D., E. Bonora, et al. 2005. Identiication of FOXP2 trun-
Whats on the Horizon? cation as a novel cause of developmental speech and language dei-
cits. Am J Hum Genet 76.6: 107480.
Technological advances provide one reason for optimism.
Marcus, G. F. 2004. he Birth of the Mind: How a Tiny Number of Genes
Techniques for characterizing genes and genomes are quickly
Creates the Complexities of Human hought. New York, Basic Books.
becoming more rapid, cost-efective, and eicient which can Newbury, D. F., E. Bonora, et al. 2002. FOXP2 is not a major susceptibil-
only speed up ongoing searches for genes involved in speech and ity gene for autism or speciic language impairment. Am J Hum Genet
language. For example, it has recently become possible to simul- 70.5: 131827.
taneously screen hundreds of thousands of genetic markers in Shu, W., H. Yang, et al. 2001. Characterization of a new subfamily of
people with a disorder of interest, and compare the data to those winged-helix/forkhead (Fox) genes that are expressed in the lung and
obtained from a control set of unafected individuals. Given an act as transcriptional repressors. J Biol Chem 276.29: 2748897.
adequate sample size, this kind of approach could uncover subtle SLI Consortium, he. 2002. A genomewide scan identiies two novel
genetic diferences that are correlated with developmental lan- loci involved in speciic language impairment. Am J Hum Genet
70.2: 38498.
guage disorders. Before long, it will even be feasible to sequence
Somerville, M. J., C. B. Mervis, et al. 2005. Severe expressive-language
the entire genome of every person participating in a study. We
delay related to duplication of the Williams-Beuren locus. N Engl J
might also expect to see developments in the ways we can image
Med 353.16: 1694701.
gene expression patterns in the human brain, with the hope that Terrace, H. S., L. A. Petitto, et al. 1980. On the grammatical capacity
it may one day be possible to observe on-line changes in gene of apes. In Childrens Language 2, ed. K. E. Nelson, 371495. New
expression in neural circuits during language processing. Still, it York: Gardner.
is also clear that we will need major conceptual advances in order Vargha-Khadem, F., D. G. Gadian, et al. 2005. FOXP2 and the neuro-
to make sense of the vast quantities of sequence and expression anatomy of speech and language. Nat Rev Neurosci 6.2: 1318.
343
Gesture
Vargha-Khadem, F., K. E. Watkins, et al. 1998. Neural basis of an

inherited speech and language disorder. Proc Natl Acad Sci U S A
95.21: 12695700.
Vernes, S. C., J. Nicod, et al. 2006. Functional genetic analysis of muta-
tions implicated in a human speech and language disorder. Hum Mol
Genet 15.21: 315467.
Watkins, K. E., N. F. Dronkers, et al. 2002. Behavioural analysis of an
inherited speech and language disorder: Comparison with acquired
aphasia. Brain 125 (Part 3): 45264.
White, S. A., S. E. Fisher, et al. 2006. Singing mice, songbirds, and
more: Models for FOXP2 function and dysfunction in human speech
and language. J Neurosci 26.41: 103769.
GESTURE
no ideas, just irritable mental gestures.

(remark attr. to Lionel Trilling,
New York Times, June 21, 2006, A1)
What Is Gesture?
Lionel Trilling, in this non-motto, invokes an all-too-common
view of gesture. he very phrase hand waving suggests triviality.
But let us imagine Trillings own gesture. It would have been (we
can predict) what Cornelia Mller has called the palm up open Figure 1. Gesture combining entity, upward movement, and interiority in
hand (PUOH), the hand seeming to hold a discursive object, one symbol. (Computer art by Fey Parrill.)
holding, in fact, Trillings view. hese kinds of gestures have been
linked to the conduit metaphor the metaphor whereby language Table 1. Gesture-speech binding resists interruption
or cognition is a container holding some content.
he PUOH is also one of a species of gesture termed by Kendon Domain Phenomenon
gesticulation, one of several kinds of gesture he distinguished Delayed auditory feedback Does not disrupt speech-gesture
and that I arranged on Kendons Continuum: synchrony.
Gesticulation Speech-Linked Pantomime Emblems Stuttering Gesture stroke onsets resist stuttering;
Sign Language stuttering cancels ongoing strokes.
Even though gesticulation is only one point on the continuum, Blindness Gestures occur when speaking to other
blind known as such.
in storytelling, living-space descriptions, academic discourse
(including prepared lectures), and conversations, gesticulation is Fluency Speech and gesture are complex or
the overwhelming gesture type 99+ percent of all gestures and simple in tandem.
it is the gesture ofering the greatest penetration into language Information exchange Information seen in gesture recalled as
itself. As one moves from gesticulation to sign language, two speech, and vice versa,
reciprocal changes take place. First, the degree to which speech
is an obligatory accompaniment of gesture decreases. Second, that are unalike, and this combination of unalikes occupies the
the degree to which gesture shows the properties of a language same psychological instant a fact of importance for creating an
increases. Gesticulations are obligatorily accompanied by speech imagerylanguage dialectic. I use gesture, rather than gesticula-
but have properties unlike language. Speech-linked gestures, tion, in the remainder of this entry.
such as the parents were ine but the kids were [inger across he gesture-irst theory of language origin holds that the irst
throat], are also obligatorily performed with speech but relate to form of language consisted largely of gestures, to be later sup-
speech as a linguistic segment sequentially, rather than concur- planted by speech an idea going back to tienne de Condillac in
rently, and in a speciic linguistic slot (standing in for the com- the eighteenth century. Gesture-irst has attracted much interest in
plement of the verb, for example). Pantomime, or dumb show, recent years. A diiculty, however, is that it predicts the wrong
by deinition is not accompanied by speech. Emblems such as gestures. he initial gestures would have been speechless panto-
the OK sign have independent status as symbolic forms. Signs mimes, nonverbal actions with narrative potential, but not the ges-
in American Sign Language (ASL) and other sign languages are ticulations that pose dialectic oppositions to language at the far end
not accompanied by speech and while simultaneously speaking of Kendons Continuum. Pantomime may indeed have been pre-
and signing is possible for ASL-English bilinguals, this is not typi- sent but, if so, did not lead to the evolution of speech and gesture
cal, and the languages themselves have the essential properties units (growth points). Such units would likely have had their own
of all languages. adaptive value. An implication is that diferent evolutionary tra-
Clearly, therefore, speech and gesticulations (but not the jectories landed at diferent points along the continuum, relected
other points along Kendons Continuum) combine properties today in diferent forms and timing patterns with speech.
344
Gesture
1 2740 2 2769
so he gets a / hold of a big [oak tree / and he
3 2780 4 2828
bends it way ba ck]

Figure 2. Phases of a gesture timed with and he bends it way back. The insert is a frame counter (1 frame = 1/30
sec.). The total elapsed time is about 1.5 seconds. Panel 1: Preparation. Panel 2: A prestroke hold while saying
he. Panel 3: Middle of stroke bends it way ba(ck). Panel 4: End of stroke and beginning of the poststroke
hold in the middle of back.
Simultaneous Semiotic Modes (meaning determination moves from the whole to the parts, not
Figure 1 illustrates one gesture and how it is simultaneous with from the parts to the whole). he efect is a uniquely gestural way
coexpressive speech. he example is taken from the narration of packaging meaning something like rising hollowness, which
of a cartoon story (the speaker had just watched the cartoon does not exist as a semantic package in the lexicon of English at
and was recounting it from memory to a listener; instructions all. hus, speech and gesture, at the moment of their synchroniza-
emphasized that the task was storytelling without mention of tion, were coexpressive but nonredundant, and this sets the stage
gesture). he speaker was describing an event in which one for doing one thing (conception of the cats climbing up inside the
character (Sylvester) attempted to reach another character pipe) in two forms analytic/combinatoric and global/synthetic.
(Tweety) by climbing up a drainpipe conveniently attached
next to the window where Tweety was perched. He entered the Properties of Gestures
pipe and traversed it on the inside adding stealth to his efort. THE UNBREAKABLE SPEECH-GESTURE BOND. Synchronized
he speaker said and he goes up thrugh the pipe this time speech and gesture comprise virtually unbreakable psycholin-
(the illustration captures the moment at which she is saying the guistic units, unbreakable as long as speech and gesture are
stressed vowel of thrugh). Coexpressively with up her hand coexpressive. A diverse range of phenomena show the insepa-
rose upward; coexpressively with through her ingers spread rability of the two modes; Table 1 summarizes some of them. In
outward to create an interior space. he upward movement and each case, some disruption to speech-gesture combination is
the opening of the hand took place concurrently, not sequen- resisted; it holds despite the disruption. To break this bond, one
tially, and these movements occurred synchronously with up has to drain the combination of meaning for example, through
through, the linguistic package that carries the same meanings. rote repetition.
he contrastive emphasis on thrugh, highlighting interiority, is
matched by the added complexity of the gesture, the spreading PHASES AND THEIR SIGNIFICANCE. Gesture phases are organized
of the upturned ingers. What makes speech and gesture coex- around the stroke: everything is designed to present it in proper
pressive is this joint highlighting of the ideas of upward motion synchrony with its coexpressive speech segment(s). Figure 2
and interiority. shows all gesture phases except retraction. he full span, from
Note the diferences, too. In speech, meanings are analyzed the beginning of preparation to the end of retraction, brackets
and segregated. Speech divides the event into semantic units a what can be thought of as the lifetime of a speciic idea unit in
directed path (up) plus the idea of interiority (through). Analytic language-geared imagery. We see the image in a state of activa-
segregation further requires that direction and interiority be com- tion that did not exist before and does not exist after this span.
bined in order to obtain the composite meaning of the whole. In he dawn of the idea unit is seen in the beginning of the prepa-
gesture, this composite meaning is fused into one symbol, and ration, and the idea unit itself is the unit formed of the synchro-
the semantic units are simultaneous there is no combination nized coexpressive speech and stroke (called a growth point).
345
Gesture Government and Binding
WHEN DO GESTURES OCCUR? Somewhat surprisingly, the timing gestures relating to idea units. On the contrary, they exhibit con-
of gestures in relation to speech has been the subject of contro- tinuity with ideas, as envisioned by L. Vygotsky.
versy. he question is whether gestures tend to anticipate their
David McNeill
linked linguistic material or coincide with it. he anticipation
view is often accompanied by a further idea that gestures take
place during speech pauses. he synchrony view, clearly, implies
that gestures and speech are co-occurring. When the question Beattie, G. 2003. Visible hought: he New Psychology of Body Language.
is examined with careful attention to the distinction between Hove, UK: Routledge.
preparation and stroke, the facts are clear: he preparation for Goldin-Meadow, S. 2003. Hearing Gesture: How Our Hands Help Us
the gesture precedes the coexpressive linguistic segment (with hink. Cambridge: Harvard University Press.
Kendon, A. 2004. Gesture: Visible Action as Utterance.
a pause or not); the stroke coincides with this segment about
90 percent of the time. Holds ensure that this synchrony is
McNeill, D. 1992. Hand and Mind: What Gestures Reveal about hought.
preserved. Chicago: University of Chicago Press.
McNeill, D.. 2005. Gesture and hought. Chicago: University of Chicago
Discourse and Social Interaction Press.
In addition to fueling idea units, a signiicant intersection of ges- McNeill, D. ed. 2000. Language and Gesture. Cambridge: Cambridge
tures with language is in the construction of discourse and social University Press.
interactions.
he gesture in Figure 1 was the second that this speaker had
performed for Sylvesters ascent of the pipe. In the cartoon, GOVERNMENT AND BINDING
Sylvester attempts to climb the pipe twice, irst on the outside, as
Government and binding (GB) theory, originally developed
a kind of ladder, second on the inside, the version in Figure 1. he
by Noam Chomsky (see Chomsky 1981, 1982), is an approach
outside gesture by this speaker, just before Figure 1, had been free
to the study of the syntax of human languages based on
of pipelike features; it was pure ascent. he Figure 1 gesture thus
abstract underlying representations and transformations
exhibited precisely what, in the immediate context, was distinc-
successively altering those structures. The approach is cen-
tive interiority creating communicative dynamism. Narrators
tered around universal principles argued to be innately rep-
who, due to error, do not mention the outside attempt but only the
resented in the mind and simple parameters , fixed by the
inside ascent tend not to include interiority. he fact of interiority is
language learner from simple evidence, determining how
not suicient; the gesture is sensitive to the distinctiveness of this
languages can differ.
information in the discourse context. Coexpressive speech and
GB theory developed out of Chomskys earlier work in trans-
gesture apparently synchronize at points of high communicative
formational grammar. Like all of his work, it centered on
dynamism (experiments by S. Duncan and D. Loehr are currently
two fundamental questions:
testing this hypothesis).
Gestures also code discourse frames by use of the second (1) What kind of capacity is knowledge of language?
hand. A two-handed gesture can initiate a discourse segment in
(2) How does this capacity arise in the individual?
which one hand depicts events while the other hand maintains
the shape and/or location it had in the launching gesture, and From his earliest work, Chomskys answer to (1) posited a com-
this frames the event in the continuing context. ASL exploits this putational system that provided statements of the basic phrase
device for discursive cohesion. structure patterns of languages (phrase structure rules) and
A further concept provides an empirical route for inding the operations for manipulating these basic phrase structures (trans-
context within which an idea unit is diferentiated. A catchment formations). GB strongly focused on question (2) by positing
comprises multiple gestures with recurring form features and heavier and heavier restrictions on the computational system,
exposes the discourse segment to which a growth point belongs thus limiting the choices available to the learner.
(the use of two hands for discourse frames comprises a catch- An innovation was the development of trace theory, which
ment, but catchments are formed in a wide variety of ways). proposed that when movement transformations operate, they
Catchments ofer a second insight for linguistics: Discourse itself leave behind traces, silent placeholders marking the position
takes on imagery form. from which movement took place, as schematized in (3).
In addition to discourse, gestures are sensitive to the social-
(3) Linguistics, I like t
interactive context of the speakers. Asli zyrek showed that
changing the number and the spatial loci of listeners has an efect Under trace theory, the earlier importance of deep struc-
on the speakers gestural imagery. Janet Bavelas has pioneered ture (the initial representation in the syntactic derivation in the
the study of a class of gestures she terms interactive gestures standard theory) for semantic interpretation is ultimately
whose signiicance lies in the structuring and management of eliminated. It was already known that some aspects of meaning
social interactions without yielding control of the loor. Along (including scope of quantiiers, anaphora, focus) depend on sur-
similar lines, gesture mimicry and joint speakerlistener gesture face structure. Once surface structure is enriched with traces, even
production cement social interactions. In roundtable discus- grammatical relations (subject of, object of, etc.) can be deter-
sions, gestures are parts of turn-taking and speaker dominance. mined at that level of representation. Using the term LF (logical
Gestures with an interactive focus are not discontinuous from form) for the syntactic representation that relates most directly
346
Government and Binding
to semantics and PF (phonetic form) for the one relating most object would fulill a necessary -role (theme in this instance)
directly to phonetics, we have the so-called (inverted) Y-model in determined by the meaning of the verb. Conversely, an intransi-
(4), which was at the core of GB theorizing. tive verb like sleep does not take a direct object since there would
be no -role for it to fulill. hese paired requirements on assign-
(4) D-Structure
ers and recipients of theta roles are called the -criterion: Every

-role must be assigned to one and only one argument, and every
Transformations
argument must receive one and only one -role.

S-Structure

Case Theory
S-structures result from the transformational component oper-
PF LF
ating on D-structures. Given the generality of Move , deriva-
tions often seem to yield ungrammatical sentences. One module
Modularity
reining in this overgeneration by regulating S-structure is case
he GB theory displayed a high degree of modularity. Complex
theory. here are characteristic structural positions that license
phenomena were seen as the result of interactions of simple
particular cases. In many languages (such as Latin, Russian,
modules. he phrase structure module was virtually reduced to
German), these case distinctions are overtly manifested. In
x-bar theory (originally developed in Chomsky [1970] 1972),
English, only pronouns show an overt distinction between nomi-
with speciic instantiations following from properties of particu-
native and accusative, for instance, but Case heory posits that all
lar lexical items. Further, the X-bar schema itself was extended
noun phrases (NPs) have abstract case (henceforth, Case), even
from just lexical categories (noun, verb, adjective, etc.) to func-
when it is not phonologically visible. he requirement that all
tional categories. For example, a sentence came to be analyzed
NPs occur in appropriate Case positions is the Case Filter, a well-
as the projection of an inlectional head, Inl, containing tense
formedness condition on the S-structure level of representation.
and agreement information.
he transformational module is also dramatically simpliied
in comparison with its predecessors. he GB framework replaced Government
the earlier numerous speciic transformations with very general he GB approach always sought regularities and generalizations.
operations, Move (displace any item anywhere), or even Afect he notion government is itself a generalization of the X-bar
(do anything to anything). here is thus very little transforma- theoretic head-complement relation. he basic deinition is as
tional syntax that the child has to learn. A grammar this simple follows:
and general would seem to massively overgenerate, producing (5) A head H governs Y if and only if every XP (highest projec-
countless numbers of unacceptable sentences. To deal with this tion of X) dominating H also dominates Y and conversely.
overgeneration problem, GB theorists, further developing a line [Domination is ancestry in a phrase structure tree
of research begun in the 1960s, posited general constraints on diagram.]
the operation of transformations (locality constraints, especially
subjacency, part of bounding theory) and also conditions By (5), a head governs its complement and also its speciier. Case
on the output of the transformational component (including licensing then is under government, with the governor licens-
filters). ing the governee. A transitive verb governs its direct object NP; a
preposition governs its complement NP; Inl governs its speciier
Parameters (the subject of the clause). hus, a Case-licensing head licenses
he postulated universal (wired-in) parts of the computa- Case on a nominal expression that it governs. For example, a
tional system are called principles. he (limited) ways in which transitive verb, such as prove licenses (accusative) Case on its
languages can difer syntactically are called parameters. he sys- complement direct object.
tem is fundamentally based on principles and parameters.
he child learning a language is preequipped with the principles Types of Movement
and needs only to set the values of the parameters. he standard he transformational module of the theory recognizes three
assumption is that there are few parameters, they are very sim- major subtypes of movement. A-movement is movement to an
ple, and their values can be determined by the child on the basis argument-type position (i.e., an A-position, especially subject
of readily available primary linguistic data. position. (non-A)-movement is movement of an XP (highest
projection of X, for variable X) to a non-A position. he move-
-Theory and the Lexicon ment of an interrogative expression, such as Who in (6) (WH-
he X-bar schema for phrase structure is one module of the movement) is a central exemplar:
theory. he lexicon is another. hese modules determine
(6) Who will they hire t
Dstructure conigurations via the regulation of a third mod-
ule, theta theory. Subcategorization properties follow, in large WH-movement is standardly analyzed as movement to the
measure, from semantic properties. hus, in a sentence with speciier of CP (complementizer phrase), a functional projec-
the verb solve, there is a semantic function for a direct object to tion above IP (inlectional phrase). Both types of movement are
fulill, while there is no such function in the case of sleep. hese regarded as instantiations of one very general operation: Move .
semantic functions that arguments fulill are called thematic he diferences follow from independent properties of the items
()roles. he verb prove demands a direct object since the moved and the positions moved to.
347
Government and Binding
(8) CP (13) *She thinks Mary will solve the problem [with She intended to
| refer to Mary]
C'
(14) Mary thinks she will solve the problem
C IP
The Role of Logical Form
NP I' In the core GB model schematized in (4), LF is not distinct from
| S-structure. However, more and more arguments were put for-
Susan I VP ward that transformational operations of the sort successively
modifying D-structure, ultimately creating S-structure, also
V NP apply to S-structure, creating a distinct LF. (See especially May
| | 1977, 1985.) One such operation, quantiier raising (QR), moves
is a linguist quantiiers from their surface positions to positions more trans-
parently representing their scope, with the traces of the moved
quantiiers ultimately interpreted as variables bound by those
Figure 1. quantiiers. Unlike the transformational operations mentioned
earlier, applications of QR exhibit no phonological displace-
ment. his follows from the organization of the grammar. When
he third major type of movement is head movement, where a transformation operates between D-structure and S-structure,
an X0, a minimal X-bar theoretic element, adjoins to a higher it will have an efect on the phonetic output, since S-structure
head (the very next higher head by the head movement con- feeds into PF. On the other hand, a transformational application
straint). One of the classic analyses of generative grammar was between Sstructure and LF will have no phonetic efect, since
restated in the GB framework in terms of head movement. Pairs LF does not feed into PF.
of sentences like those in (7) are related via movement of the Another covert operation is the analog of overt wh-move-
verb be/is to Inl, followed by movement of Inl to C, schematized ment. Assume that overt WH-movement positions an interroga-
in (8) in Figure 1. tive operator in its natural position for interpretation (with the
(7) a. Susan is a linguist trace it leaves behind in the natural position for a variable bound
b. Is Susan a linguist by the operator). hen in sentences with multiple interrogatives,
such as (15), at the level of LF all are in sentence initial operator
Similar head movement, along with WH-movement, is involved position, as illustrated in (16).
in the derivation of (6).
(15) Where should we put what
Binding (16) what1 [where2 [we should put t1 t2]
he binding part of government and binding theory has as
its core anaphoric relations, circumstances under which one (16) is then rather transparently interpreted as:
expression can or cannot take another as its antecedent, that is, (17) For which object x and which place y, we should put x at y
pick up its reference from the other. In (9), him can take John as
One of the most powerful arguments for covert WH-movement,
its antecedent, while in (10), it cannot.
from Huang (1981/82), involves constraints on movement. For
(9) John said Mary criticized him example, it is diicult to move an interrogative expression out of
an embedded question (a question inside another sentence):
(10) John criticized him
(18) *Why1 do you wonder [what2 [John bought t2 t1]]
hat is, (10) has no reading corresponding to that of (11), with the
pronoun him replaced by the anaphor himself (see anaphora). If (18) were acceptable, it would mean What is the reason such
that you wonder what John bought for that reason. In languages
(11) John criticized himself
where wh-phrases are in situ (unmoved) at S-structure, such as
A pronoun cannot have an antecedent that is too close to it. Chinese, their interpretation apparently obeys this same con-
his is Condition B of the binding theory. Conversely, an anaphor straint. So, in Chinese, an example like (19) is possible but one
requires an antecedent quite close to it (Condition A). Compare like (20) is impossible.
(11) with (12).
(19) ni renwei [ta weisheme bu lai]
(12) *John said Mary criticized himself you think he why not come
LF: [weisheme1 [ni renwei [ta t1 bu lai]
he pertinent locality is, roughly, being in the same clause
Why do you think he didnt come?
(though in certain instances a more complicated notion involv-
ing government is implicated, hence, Chomskys name govern- (20) *ni xiang-zhidao [Lisi weisheme mai-le sheme]
ing category for the relevant domain). you wonder Lisi why bought what
A third binding condition (Condition C) excludes an ana- LF: [weisheme1 [ni xiang-zhidao [Lisi t1 mai-le sheme]
phoric connection between the higher in the tree She and the *What is the reason such that you wonder what Lisi bought for
lower Mary in (13), as contrasted with (14). that reason?
348
Grammaticality Grammaticality Judgments
his argues that even though the weisheme is not phonetically grammaticality judgments ). If grammar is taken in the
displaced, it really is moving; that is why it is obeying move- sense of theory, grammaticality may be a purely binary prop-
ment constraints. But this movement is covert, occurring in erty, depending on the degree and type of formalization of the
the mapping from Sstructure to LF, hence not contributing to theory.
pronunciation. Noam Chomsky (1957, 15) introduces grammaticality in con-
he modular and very restrictive nature of the GB approach of trast to acceptability, illustrating them with the examples in (1).
syntax led to theories that went well beyond descriptive adequacy
(1) a. Colorless green ideas sleep furiously.
toward a high degree of explanatory adequacy, as it drastically
b. *Furiously sleep ideas green colorless.
limited the types and numbers of grammatical rules available to
c. Have you a book on modern music?
the learner. In fact, the success of the approach led Chomsky to
d. *Read you a book on modern music?
formulate a new program, minimalism, that aims to move even
beyond explanatory adequacy. Ungrammaticality in (1) is indicated by a star, following
a convention introduced in the early 1960s. Whereas (1ab)
Howard Lasnik
are diicult to interpret, (1cd) are readily understandable.
Nevertheless, (1a) is grammatical and (1d) not. he acceptability
problem in (1a) is entirely due to semantic incongruence. It can
Chomsky, Noam. [1970] 1972. Remarks on nominalization. In Readings be overcome by extending the meanings of the words in the sen-
in English Transformational Grammar, ed. Roderick A. Jacobs and tence. his is not possible for (1b). In (1d), the meaning is entirely
Peter S. Rosenbaum, 184221. Waltham, MA.: Ginn. Reprinted in transparent, but English grammar does not allow questions to be
Noam Chomsky, Studies on Semantics in Generative Grammar, 1161.
formed in this way.
he Hague: Mouton.
As discussed in detail by Frederick J. Newmeyer (1983),
. 1981. Lectures on Government and Binding. Dordrecht, the
Netherlands: Foris.
grammaticality is a theory-dependent property. he boundary
.1982. Some Concepts and Consequences of the heory of Government between the factors accounted for by the grammar and by other
and Binding. Cambridge, MA: MIT Press. components of knowledge is not given in advance. If a sentence
Chomsky, Noam, and Howard Lasnik. [1993] 1995. he theory of prin- violates a condition of the grammar, it is ungrammatical, but if
ciples and parameters. In Syntax: An International Handbook of it violates, for instance, only semantic or pragmatic condi-
Contemporary Research. Vol. 1. Ed. Joachim Jacobs, Arnim von Stechow, tions, as in (1a), it is grammatical. Grammaticality in relation to
Wolfgang Sternefeld, and heo Vennemann, 50669. Berlin: Walter competence is a matter of degree. hus, Liliane Haegeman (1994,
de Gruyter. Reprinted in Noam Chomsky, he Minimalist Program, 56573) discusses the analysis of contrasts such as (2).
13127. Cambridge, MA: MIT Press.
Haegeman, Liliane. 1994. An Introduction to Government and Binding (2) a. *Whomi do you know [the date [whenj [Mary invited ti tj]]]
heory. 2d ed. Oxford: Blackwell. b. **Whenj do you know [the man [whomi [Mary invited ti tj]]]
Huang, C.-T. James. 1981/82. Move wh in a language without wh-move-
ment. Linguistic Review 1: 369416. Both sentences in (2) are ungrammatical, but (2b) is worse
Lasnik, Howard, and Juan Uriagereka. 1988. A Course in GB Syntax: Lectures than (2a). As this result is fairly robust, it is worth trying to
on Binding and Empty Categories. Cambridge, MA: MIT Press. explain it in terms of grammatical theory. A fully formalized
May, Robert. 1977. he Grammar of Quantiication. Ph.D. diss., grammar will normally partition the set of possible sentences
Massachusetts Institute of Technology. into grammatical and ungrammatical ones, without any inter-
. 1985. Logical Form: Its Structure and Derivation. Cambridge, mediate degrees.
MA: MIT Press.
Webelhuth, Gert, ed. 1995. Government and Binding heory and the Pius ten Hacken
Minimalist Program. Oxford: Basil Blackwell.
GRAMMATICALITY Chomsky, Noam. 1957. Syntactic Structures. he Hague: Mouton.

. 1980. Rules and Representations. New York: Columbia University
A sentence that is well formed according to a given grammar Press.
is grammatical. By analogy, the property can also be assigned Haegeman, Liliane. 1994. Introduction to Government and Binding
to constituents of a sentence, for example, noun phrases. In heory. 2d ed. Oxford: Blackwell.
generative linguistics, the term grammar has been used with Newmeyer, Frederick J. 1983. Grammatical heory: Its Limits and Its
a systematic ambiguity: We must be careful to distinguish the Possibilities, Chicago: University of Chicago Press. Discussion of gram-
grammar, regarded as a structure postulated in the mind, from maticality in Chapter 2.
the linguists grammar, which is an explicit articulated theory
that attempts to express precisely the rules and principles of the
grammar in the mind of the ideal speaker-hearer (Chomsky
GRAMMATICALITY JUDGMENTS
1980, 220). Grammaticality judgments involve explicitly asking speakers
Each sense of grammar corresponds to a sense of gram- whether a particular string of words is a well-formed utterance of
maticality. If grammar is taken in the sense of competence, their language, with an intended interpretation stated or implied.
a component of the speakers mind, grammaticality is a grad- Among the many kinds of data available to linguistics, gram-
ual property that is realized in judgments by the speaker (see maticality judgments are particularly useful in distinguishing
349
Grammaticalization
possible from impossible utterances (the latter conventionally (je) chanterai I will sing (Benveniste 1968) came about when
marked *) among those that are not spontaneously produced. the Old French descendents of Latin cantare habeo I have to
Intermediate degrees of well-formedness may also be of inter- sing were collapsed into a single word: cantar ayo > cantarayo
est (e.g., ? questionable, ?? highly questionable). hese > chanterai. he end product of such changes is typically a para-
judgments can also bring to light knowledge of language in spe- digm in which the verb is decked out with person and number
cial populations whose production and even comprehension aixes that were formerly pronouns and tense, modality, and
may not evince it. Contra widespread belief, however, like all aspect aixes that were once auxiliaries.
performance data they bear on linguistic competence only In recent years, linguists have come to see forms undergoing
indirectly and call for the same attention to methodology as grammaticalization as spreading out into wider contexts and as
other data sources. increasing their pragmatic usefulness (Traugott 1995; Traugott
and Dasher 2002). For example, the Old English ancestor of the
Carson T. Schtze
modal auxiliary can know how to occurred exclusively with
human subjects. Later, the restriction to humans was modiied
GRAMMATICALIZATION to include a wider variety of forms, including inanimates: hese
trees can grow to a height of 100 meters. his widening distribu-
Grammaticalization is a historical process whereby ixed gram- tion presupposes a semantic change from knowledge to ability
matical forms, such as prepositions, conjunctions, suixes, and to possibility. he English going to/gonna construction provides
auxiliaries, and the constructions of which they are part arise out another example. here is a change from the sense of purpose,
of what were previously independent categorial forms in freer as in Shakespeares letters to my friends, And I am going to
arrangements. he negative construction with ne pas in French deliver them (Two Gentlemen of Verona 3.1, 51), that is, I am
provides a good example: pas step, pace was originally a noun on my way to deliver them, to a predictive future tense in which
that functioned to reinforce the ne that supplied the negative neither motion nor purpose is expressed, as in he ice carvings
meaning, as in il ne va pas he doesnt go (originally doesnt are going to melt.
go a step). Nowadays, pas has become grammaticalized as a he changes characteristic of grammaticalization are grad-
general marker of negation, as in il (ne) parle pas he doesnt ual (Lichtenberk 1991) and unidirectional (Haspelmath 1999;
speak the ne being increasingly dropped altogether. As a ield, Hopper and Traugott 2003, 88139). hey evolve in the single
grammaticalization is the study of those linguistic changes that direction of semantic difuseness and increased pragmatic range
result in speciically grammatical forms and construction. he and, often, phonological reduction and grammatical agglutina-
standard history of grammaticalization is Lehmann 1995. tion. he exceptions to this directionality are idiosyncratic and
Grammaticalization involves two aspects: structural change frequently turn out not to be true exceptions (see Hopper and
and semantic change. hese two kinds of change go hand Traugott 2003, 1308 for further discussion).
in hand, and it is impossible to assign priority to either of them. Grammaticalization, while studied principally as a subield
Structural changes include reanalysis and phonological reduc- of the study of change, has implications for general linguistics
tion. In reanalysis, adjacent forms are rebracketed: [I am going] in pointing to the essential luidity of grammar. It suggests that
[to sell my pig] Im on my way to sell my pig becomes [I am going the appearance of ixed forms and rules is illusory, that the
to][sell my pig] and eventually [going to] assumes the meaning grammarlexicon division is more blurred than is commonly
of future tense. Characteristically, too, a major category, such as assumed, and that grammatical structure itself is emergent,
verb or noun that was formerly the head of a phrase, is demoted to that is, subject to constant revision by speakers. While the paths
a minor category, such as auxiliary or preposition, and becomes of grammaticalization are constrained by universal and cogni-
a satellite to the new head; thus, when a cup full of lour becomes tive factors, their proximate causes are in discourse, through
a cupful of lour, the erstwhile head noun cup is reanalyzed as a frequency of use (Bybee and Hopper 2001) and the consequent
component of a quantifying expression, now reduced in status to routinization (Haiman 1994) of word combinations.
a determiner of the semantic head noun, and its place as seman-
tic head noun is usurped by lour. Similarly, full, previously the Paul J. Hopper
head of the adjective phrase full of lour, is reduced to the status
of a suix on cup. his decategorialization of grammaticalized
forms (Hopper 1991, 303; Hopper and Traugott 2003, 10614) Aitcheson, Jean. 2001. Language Change: Progress or Decay? 3d ed.
means that the forms lose the typical attributes of the older cat- Cambridge: Cambridge University Press. A readable introduction to
egory, such as ability to take modiiers and determiners, avail- the study of change, with an excellent chapter on grammaticalization.
ability as an argument of the verb, and independent referential Benveniste, Emile. 1968. Mutations of linguistic categories. In Directions
status. Parallel restrictions are placed on verbs that become for Historical Linguistics: A Symposium, ed. Winfred Lehmann and
Yakov Malkiel, 8594. Austin: University of Texas Press.
auxiliaries.
Bybee, Joan, and Paul Hopper, eds. 2001. Frequency and the Emergence of
Phonological reduction, or erosion (Heine and Reh 1984,
Linguistic Structure. Amsterdam: John Benjamins.
215), is a frequent but not inevitable accompaniment of gram- Haiman, John. 1994. Ritualization and the development of language.
maticalization. English lets is clearly derived from let us, but In Perspectives on Grammaticalization, ed. William Pagliuca, 328.
now serves to introduce an adhortative predicate, as in lets Amsterdam: John Benjamins.
leave. When [going][to sell] was reanalyzed as [going to][sell], Haspelmath, Martin. 1999. Why is grammaticalization irreversible?
going to became reduced to gonna. he French future tense in Linguistics 37: 104368.
350
Grooming, Gossip, and Language
Heine, Bernd, and Tania Kuteva. 2002. World Lexicon of an intermediate state of this type is that we, in fact, ind exactly
Grammaticalization. Cambridge: Cambridge University Press. this kind of vocal exchange in some species of monkeys (e.g.,
Heine, Bernd, and Mechthilde Reh. 1984. Grammaticalization in African the contact calls of baboons; see primate vocalizations). In
Languages. Hamburg: Buske. gelada baboons, these calls are exchanged preferentially between
Hopper, Paul. 1991. On some principles of grammaticalization. In grooming partners when they are feeding or traveling. However,
Traugott and Heine 1991, I: 1735. Amsterdam: John Benjamins.
an alternative (but not necessarily incompatible) hypothesis for
Hopper, Paul, and Elizabeth Traugott. 2003. Grammaticalization. 2d ed.
the origin of language might be that it evolved to allow us to com-
Cambridge: Cambridge University Press. An up-to-date survey of the
ield, with detailed discussion of current controversies.
ment on, or even organize, our internal thoughts and only later
Kuteva, Tania. 2001. Auxiliation: An Enquiry into the Nature of became externalized in response to a social context.
Grammaticalization. Oxford: Oxford University Press.
Lehmann, Christian. 1995. houghts on Grammaticalization.
Background
Munich: Lincom Europa. he standard work on the historical back- Over the past century, a great deal of work has been done on
ground to grammaticalization. questions about the anatomical and neural bases of language
Lichtenberk, Frantiek. 1991. On the gradualness of grammaticaliza- production (i.e., speech and phonetics; see brain and lan-
tion. In Traugott and Heine 1991, I: 3780. guage; speech anatomy, evolution of; phonetics and
Traugott, Elizabeth. 1995. Subjectiication in grammaticalization. In phonology, neurobiology of) and on the structural bases
Subjectivity and Subjectivisation in Language, ed. Dieter Stein and of language (i.e., grammar and related aspects of cognition).
Susan Wright, 3154. Cambridge: Cambridge University Press. However, while all of these are important to the grand story of
Traugott, Elizabeth, and Richard Dasher. 2002. Regularity in Semantic
language evolution, none addresses the question of what we
Change. Cambridge: Cambridge University Press.
actually do with language the reasons why it evolved in our
Traugott, Elizabeth, and Bernd Heine, eds. 1991. Approaches to
lineage in the form we now have. Understanding the function
Grammaticalization. 2 vols. Amsterdam: John Benjamins.
of language might help to explain some of the design features of
language since it is function that drives evolution.
One aspect on which almost everyone would agree is that
GROOMING, GOSSIP, AND LANGUAGE grammar plays a central role in language: It is what allows us
Language (and speech) are unique to humans and, of course, to express complex thoughts and convey information to one
play a crucial role in making possible the human social and cul- another. Without that capacity, language would be very impov-
tural worlds. Yet the question as to why language might have erished, and humans would not have been able to achieve the
evolved, or even why it evolved only in the human lineage, has remarkable accomplishments of science, culture, and architec-
seldom been asked. At best, there has been an implicit assump- ture that we have. However, the fact that grammar exists or even
tion that language evolved to allow our ancestors to create and has a particular structure does not necessarily tell us what it was
manufacture the stone tools that have been so important a part designed to do. he fact that we can use language to create sci-
of the story of human evolution or to plan the hunts required to ence now does not necessarily mean that it originally evolved
obtain meat with those tools. However, a number of alternative for this purpose. Grammatical structure is an all-purpose tool
views of language evolution have recently been proposed that that allows any kind of information to be transmitted. he issue,
emphasize the social aspects of language use. Perhaps the best then, is which kinds of information are evolutionarily primi-
known of these is the gossip hypothesis. Others include the need tive (i.e., were the initial driving force that selected for language
to make social contracts and the role of mate choice. capacity).
It is important to appreciate in this context that the use of the he suggestion that language evolved for essentially social
term gossip in the gossip hypothesis does not imply the kinds of purposes (but has since been exploited for the conveyance of
pejorative, often malicious, forms of gossip that are often associ- technical information) emerged from studies of social bond-
ated with the term today. Although conversations may include ing in monkeys and apes. Although grooming has an obvious
statements of this kind (in efect, it is a form of policing designed hygienic function in all animals (namely, removing debris and
to control others behavior), the term gossip is here being used in perhaps parasites from the skin), it is obvious that some spe-
its much broader original sense to refer to all kinds of topics that cies of monkeys and apes do far more grooming than is really
are essentially social in character (information about oneself, necessary for strictly hygienic purposes. Some of the more social
ones likes and dislikes, ones relationship with ones interlocu- species, for example, devote as much as 20 percent of their day
tor, other people, arrangements for future social events, etc.). It to grooming each other. In these species, grooming functions
has more to do with the kinds of casual conversation one might as a mechanism for social bonding: hrough the calming and
have around the hearth or over the garden fence. other physiological efects that grooming has on the recipient,
In one sense, a conversation is a statement of intent or interest it establishes the bases for friendship and cooperative alliances.
in the other party: I would rather be standing here talking to you More importantly, it turns out that the amount of time devoted
than over there talking to so-and-so (and it does not really mat- to social grooming by a given monkey or ape species is related to
ter what we talk about). In this respect, language can be seen as the size of its social groups: he bigger the group, the more time
a form of social grooming, and one might envisage that the ori- spent grooming. However, there seems to be an upper limit on
gins of language lie in some kind of wordless and contentless time spent grooming at about 20 percent of total daytime. his
chorusing when two individuals were physically separated and limit is set by the demands of other essential activities, such as
engaged in other activities like feeding. One reason for suggesting foraging and traveling between feeding sites.
351
Grooming, Gossip, and Language
he puzzle that emerged out of this research was the realiza- percent among them. A more detailed analysis of the social con-
tion that if humans were to bond their groups in exactly the same tent itself suggested that the vast majority was concerned with
way as other monkeys and apes, then the size of typical human factual personal experiences (~30% of all social conversation
social groups would require us to devote more than twice the time), personal social/emotional experiences (~30%), or with
maximum amount of time that any monkey or ape species has third-party social/emotional experiences (~30%). he balance
ever been known to devote to this activity which would involve was devoted to critical comments about third parties and seek-
nearly half of all the hours in the day. Since the realities of having ing/giving advice.
to ind food seem to impose an upper limit on grooming time for More recently, an experimental study of language transmis-
monkeys and apes, it seems unlikely that humans could bypass sion in which groups of four participants were asked to relay
this constraint. In addition, for primates, the amount of time that information passed on to them by a previous member of a chain
is free to devote to grooming limits group sizes. Hence, if there found that both gossip (in the racy sense) and social informa-
is an upper limit on the amount of time that could be devoted tion (about the actions of people in a story) were transmitted
to grooming, then human groups would be limited to the same much more reliably than was factual information about a social
sizes as those of other primates. he fact that they are quite event (with no motivational content) or descriptive information
obviously not so limited (however they are measured, modern about a nonsocial event (for example, tourist information about
human groups are clearly many times larger than the largest a location). his suggests that information with a strong social
groups found in any primate species) means that some other (and perhaps emotional) content is more memorable in some
mechanism has been brought into play to enable humans to way than other kinds of information. he transmission rates for
break through the glass ceiling imposed by time limits on groom- social and gossip content did not difer signiicantly, suggesting
ing. Language was suggested as the likely explanation. that the issue is not the raciness of the content but the fact that it
Language has several properties that make it more eicient concerns individuals social and emotional lives.
than grooming as a mechanism for social bonding. Grooming
is very much a one-on-one activity (as it still is, in fact, with us Alternative Hypotheses
today). hat means that if you have to invest a certain amount of Two alternative hypotheses as to how language might function
time in each social partner to create a working relationship, the in the social domain have been suggested: the symbolic con-
number of relationships is ultimately ixed by how much time tract hypothesis (proposed by Terrance Deacon in his book he
you can spare. But language has broadcast capacities that enable Symbolic Species) and the Scheherazade efect (proposed by
us to have a onemany relationship with our social partners, Geofrey Miller). Both are concerned with aspects of reproduc-
thus allowing us to service several relationships simultaneously. tive behavior.
Speech also has the useful property of allowing us to multi- Deacon observed that humans have a unique mating system
task: We can talk and walk, or talk and feed, at the same time, based on pairbonds that are embedded within a large multimale/
whereas grooming does not allow that. In addition, language multifemale social group. What makes this a particular problem
allows us to exhibit badges of group membership: Being able to is that the division of labor means that mates are often separated
use the local dialect or to understand subtle jokes or obscure for long periods of time in the presence of potential rivals. Since
allusions helps to label us as members of a community. the presence of rivals creates risks for the sexual idelity of a pair-
Over and above these properties, however, the information- bond, Deacon argued that language must have been needed to
transfer capacities of language allow us to do one thing that establish formal social contracts in which exclusive mating rights
grooming does not, and this is to pass on information about the are identiied and publicly agreed upon (his is my mate; you
state of the social network in the absence of irsthand knowledge. are not allowed to interfere with him/her while I am away).
For monkeys and apes, what they do not see they will never know Millers Scheherazade efect is also concerned with the busi-
about. But we can ask and be told about what others have been ness of mating, but in this case, the focus lies with the intrinsic
up to in our absence. hat way, we can keep track of our dynam- (as opposed to the extrinsic) dynamics of the pairbond: how to
ically changing social world, know about cases in which indi- woo and keep your mate interested, rather than merely guarding
viduals have reneged on their social obligations to us, and more him/her against rivals. Miller argued that the capacity to be witty
generally avoid the worst social faux pas by not knowing who is and entertaining (in the Arabian Nights sense) would have been
or is not now friends with whom. strongly selected for, especially in a context where more attrac-
tive rivals were readily available.
Evidence for the Gossip Hypothesis While both suggestions are undoubtedly plausible, for nei-
Although we can never know what actually happened when lan- ther has any substantive evidence yet been adduced. However,
guage irst evolved, the case for the gossip hypothesis would be an alternative view might be to see both of these mechanisms
strengthened if we could show that language use was heavily as being derivative of an initial situation for which language
dominated by social functions (in the loose sense, gossip). A had been selected as a more general mechanism for bonding
number of studies provide evidence of this kind. large social groups. One reason for this suggestion is simply that
An early study of freely formed, natural conversations indi- Deacons paradox (the risk to pairbonds posed by the presence
cated that around 65 percent of conversation time by both gen- of large numbers of rivals) can only be a problem when social
ders was devoted to social topics, with all other topics (including group size is large. But without some kind of bonding mecha-
sports, politics, religion, technical or work-related topics, nism over and above social grooming, it is diicult to see how
and factual matters of all kinds) accounting for only about 35 our distant ancestors would have been able to create and hold
352
Habitus, Linguistic
together large social groups. hus, we might see language evolv- communicative). Every act of language takes place in a space
ing initially as a social bonding device and then subsequently see governed by the rules of a ield, which itself both forms and regu-
the skills underpinning language having been exaggerated by lates the linguistic habitus. Linguistic habitus is intimately con-
either or both of these two mechanisms. nected to habitus as a whole, which it helps to form and express.
It is a kind of social personality and, in language, is expressed
R. I. M. Dunbar
in linguistic dispositions. Such dispositions can be conscious but
WORKS CITED AND SUGGESTIONS FOR FURTHER READING are mostly unconscious motives, behaviors, and tendencies to
act and, in this case, speak in a certain way a way originating
Deacon, T. 1997. he Symbolic Species: he Coevolution of Language and from social background. he objective background of the lin-
the Human Brain. Harmondsworth, UK: Allen Lane.
guistic environment establishes norms of behavior in language,
Dunbar, R. I. M. 1993. Coevolution of neocortex size, group size and lan-
which are both sanctioned and policed within the ield. In this
guage in humans. Behavioral Brain Science 16:681735.
Dunbar, R. I. M., N. Duncan, and A. Marriot. 1997. Human conversa-
way, dominant linguistic patterns are established, legitimated,
tional behaviour. Human Nature 8: 23146. and consecrated.
Mesoudi, A., A. Whiten, and R. I. M. Dunbar. 2006. A bias for social infor- Linguistic habitus can express itself at any level of language.
mation in human cultural transmission. British Journal of Psychology However, it has particularly strong markers in phonetics, syn-
97: 40523. tax, and paralinguistic features (see paralanguage), such
Miller, G. 1999. Sexual selection for cultural displays. In he Evolution as propensity to speak, interest, and expression. In efect, what
of Culture, ed. R. I. M. Dunbar, C. Knight, and C. Power, 7191. Bourdieu is attempting to do with this approach to language and
Edinburgh: Edinburgh University Press. linguistics is to integrate the study of objective social variation
from sociolinguistics (see Vann 2004) with the patterns of
subjective afective convergence and divergence found in social
H psychology. In linguistic habitus, objective patterns of social
variations can be found together with subjective dispositions
expressed in and against a linguistic ield background. Field here
refers to both overarching dominant ields and ields within
HABITUS, LINGUISTIC
ields. Because linguistic habitus is essentially formed as part
Habitus is one of Pierre Bourdieus two fundamental concep- of and expresses power relations, all language acts whether of
tual tools of analysis; the other being field. If ield relates to the comprehension or articulation, oral or aural need to be under-
objective conditions of social space, then habitus is an expression stood as power relations and as symbolizing a certain relation
of subjectivity and is deined by Bourdieu as both to language and the ield in which it arises. here are con-
sequently forms of both self-censure and selectivity in language
systems of durable, transposable dispositions, structured struc-
uses, which express the linguistic habitus and the logics of prac-
tures predisposed to function as structuring structures, that is,
tice that formed it and with which it is now confronted. here are
as principles which generate and organize practices and rep-
also strategies in language use, for example, euphemism in place
resentations that can only be objectively adapted to their out-
of direct expression. Such strategies can be seen to be employed
comes without presupposing a conscious aiming at ends or an
by those occupying positions of linguistic dominance in the
express mastery of the operations necessary in order to attain
ield: hey play with language as a part of linguistic mastery.
them. Objectively regulated and regular without being in any
Condescension is another of these features of linguistic dom-
way the product of obedience to rules, they can be collectively
inance. A further strategy is hypocorrection: he linguistically
orchestrated without being the product of the organizing action
dominant can descend into vulgar speech as an expression of
of a conductor. ([1980] 1990, 53)
their complete control of the dominant vernacular. It is acknowl-
Both habitus and ield are homologous in terms of structures that edged as such by those around them as a sign of distinction in
are both structured and structuring. Linguistic habitus concerns the way someone plays with the popular form. hese strategies
the language element of any individuals or group of individuals are not open to those less linguistically secure. In fact, some of
habitus. them may even have recourse to the opposite strategy hyper-
Linguistic habitus is a central theme in Bourdieus attack on correction where anxiety to be linguistically correct is merely
orthodox linguistics. In Language and Symbolic Power ([1982] interpreted as evidence of linguistic insecurity and, therefore, an
1991), he argues that conventional linguistic studies are based inferior position in the ield.
on a fundamental misunderstanding. In linguistics, language is Bourdieu uses linguistic habitus in a number of ways and
studied very much as an object of contemplation. his tradition ield contexts. However, it has particular signiicance in the areas
began with Ferdinand de Saussure and treats the social world as of culture and education. In culture, linguistic habitus is the
a series of phenomena of which language is one that can be base generator of a certain style and way of being in the world.
decoded or deciphered according to a particular established the- Whether afected, direct, stylized, abrupt, or prompt, linguistic
oretical code. Bourdieu refers to this tradition as an intellectual- manners can be understood in terms of the social conditions that
ist philosophy against which he wishes to pose his own theory produced them and the social diferentiating purposes for which
of practice. For Bourdieu, it is not enough to treat language as they were created. In fact, at one point, he even argues that our
symbolic interactions; it is also necessary to see them as expres- entire language and its classiicatory systems can be understood
sions of symbolic power (see inequality, linguistic and as the expression of opposing (antagonistic) adjectives for
353
Habitus, Linguistic Head-Driven Phrase Structure Grammar
example, high/low, ine/coarse, light/heavy, broad/narrow, Pierre Bourdieu: Language, Culture and Education, ed. M. Grenfell and
common/unique, brilliant/dull and that these adjectives have M. Kelly, 7384. Bern: Peter Lang.
as their social derivation the structure of society (and its domi-
nant ields to be found in social classes; see Bourdieu [1979]
HEAD-DRIVEN PHRASE STRUCTURE GRAMMAR
1984, 468).
Language, of course, is the medium of education, and it is HPSG: A Preview
here that the convergence or divergence between a particu- The core idea of head-driven phrase structure grammar
lar linguistic habitus and the ield that surrounds it is most (HPSG), as a theory of grammatical representations, is
apparent. In publications such as he Inheritors ([1964] 1979), that local syntactic dependencies patterns of covariation
Reproduction ([1970] 1977), and Academic Discourse ([1965] accounted for in earlier versions of generative grammar via
1994), Bourdieu shows the link between academic language and transformations can be reduced to epiphenomena of selec-
individual habitus, expressed in cognitive and mental struc- tional specifications on heads. In order to implement this
tures ways of thinking and the very language of such expres- explanatory strategy, HPSG models linguistic expressions as
sion. here are power relations between teachers and students complex symbols, which interact with a small network of very
played out when one linguistic habitus (that of the teacher) faces general constraints to admit a subset of possible representa-
another (that of the student) (see Grenfell 1998, 2004). he fact tions. The constraints that restrict structural admissibility are
that some students come from the same social origins as the cul- defined on local phrase structure representations phrasal
ture represented in education and others do not is the basis of categories and their daughters and take the form of feature
matches and mismatches that impact academic achievement. (in)equality requirements, where feature/value pairs are used
Put succinctly, the linguistic habitus of some students results in to encode separately transmissable properties of linguistic
their feeling like a ish in water during their schooling while oth- expressions. These constraints apply at a single level of rep-
ers are most certainly out of the water and left somewhat high resentation and jointly determine the status of complex lin-
and dry. Parents own linguistic habitus even complements their guistic signs. Roughly speaking, sentences of the language
own explicit collusion in this process of hidden social selection, correspond one-to-one with these admissible signs.
as some students pass through to the upper echelons of acade- A true theory of grammar is a completely explicit set of rep-
mia while others drop out. Linguistic habitus, in this sense, is resentations, a subtheory of models for the representations the
their very being. theory sponsors, and fully interpreted constraints on those rep-
Ultimately, Bourdieu is seeking to transform empirical think- resentations. One objective of such a theory is to make it clear
ing (everyday/common sense) into scientiic thinking (in his why sentences such as Terry put that book on the table, hat
case, sociological) by altering the linguistic habitus of research- book, Terry put on the table, On that table Terry put the book, and
ers in its empirical forms in such a way that everyday language so on are well formed, whereas not only *Table put the on the that
is replaced, at least in part or partially relected upon, by such Terry book but also *Terry put the book, *Terry put the book on,
analytical thinking tools as habitus, ield, and so on. *Terry put on the table, and so on, are all bad. he irst of these
bad examples has all the words of the well-formed versions cited
Michael Grenfell
earlier but hopelessly scrambled together; the latter examples
have the right order, but all appear to be missing crucial parts.
he problem is to account for these facts and for the similarities
Bourdieu, Pierre. [1964] 1979. he Inheritors: French Students and heir and diferences in the seemingly related sentences exhibited.
Relation to Culture. Trans. R. Nice. Chicago: University of Chicago he theory of syntax embodied in HPSG appeals to principles of
Press. valence satisfaction how lexical items combine with other ele-
, with Jean-Claude Passeron and Monique De Saint Martin. [1965]
ments that they require as parts of their idiosyncratic individual
1994. Academic Discourse. Oxford: Polity.
, with Jean-Claude Passeron. [1970] 1977. Reproduction in
Education, Society and Culture. Trans. R. Nice. London: Sage. (1)
. [1979] 1984. Distinction. Trans. R. Nice. Oxford: Polity. PHON criticized
. [1980] 1990. he Logic of Practice. Trans. R. Nice. Oxford: Polity.
. [1982] 1991. Language and Symbolic Power. Trans. G. Raymond verb
and M. Adamson. Oxford: Polity.
HEA D VFORM fin
, with Loc Wacquant. 1989. Towards a relexive sociology: A work-
shop with Pierre Bourdieu. Sociological heory 7.1: 2663. A UX
LOC
NP
Encrev, Pierre. 1983. La liaison sans enchainement. Actes de la recher-
ch en science sociales 46: 3966. SYNSEM LOC CA T SUBJ normi
COMS NP
Grenfell, Michael. 1998. Language and the classroom. In Bourdieu
and Education: Acts of Practical heory, ed. M. Grenfell and D. James, normj
7288. London: Falmer. CONT w(j )(i )

. 2004. Bourdieu in the classroom. In Culture and Learning: Access
and Opportunity in the Curriculum, ed. M. Olssen, 4972. Westport, CONX ...
CT: Greenwood.
MONLOC ...
Vann, Robert. 2004. An empirical perspective on prac-
tice: Operationalising Bourdieus notions of linguistic habitus. In Figure 1.
354
Head-Driven Phrase Structure Grammar
(2)
(a) PHON criticized (b) VP
verb v NPj
HEA D VFORM fin
A UX criticized Leslie
LOC
SYNSEM LOC CA T SUBJ
NPnormi
COMPS
CONT w(j)(i) Figure 2.
PHON criticized NPnormi

verb
HEA D VFORM fin Leslie
A UX

LOC
LOC CA T SUBJ NP
normi
SYNSEM

COMPS NPnormj
CONT w(j)(i)
properties together with what are, in efect, lexical redundancy he key valence features comp(plement)s and subj(ect)
rules, to account for all facts of the kind noted in the previous encode the combinatorial requirements of particular lexical ele-
paragraph and all syntactic dependencies, including parallelisms ments, making it possible to eliminate speciic phrase structure
between seemingly related construction types. he following rules in favor of broad schemata that correctly project phrases
discussion presents a more technically leshed-out instantiation from lexica regardless of the latters valence peculiarities. Each
of this general approach to characterizing grammatical well- such schema identiies a certain very general kind of structure,
formedness. which will be leshed out in detail, depending, in many cases,
on the particular lexical item that heads the structure. Two of the
An Illustration of the System: Constituency and most important conditions interacting with these schemata are
Dependencies the valance principle and the head feature principle, which can
As noted, local dependencies are uniformly accounted for in be stated roughly as follows:
HPSG as instances of nothing more elaborate than valence sat-
Valence Principle: For any valence feature f, the value on the
isfaction, or systematic relationships between valence speci-
mother is the list containing the value of f, on the head daugh-
ications of related classes of lexica. his approach is suicient
ter minus the values corresponding to the head daughters
to account for even quite distant linkages, such as It continues
sisters.
to appear to have been raining during the night, by means of a
Head Feature Principle (HFP): For any phrasal category,
feature subj encoding subject-selection possibilities on lexical
the value of the head feature is identical to the value of the
heads and their phrasal projections.
categorys head daughters head feature.
LOCAL DEPENDENCIES. he best way to exhibit the nature of he valence principle speciies that the appearance of any
HPSGs approach to an account of the patterns relected in natu- required valent in a local structural relationship to a select-
ral language is to take an example of a single sentence and show ing head removes the corresponding element from the must-
how it is licensed by the interaction of HPSG constraints. Assume, have list of the mother. his principle does not, of course, limit
for example, that the lexical entry for criticized comprises a num- the number of valents that may appear as the heads sisters.
ber of separate (partial) descriptions that includes a description But the schematic possibilities of English require a phrase of
roughly along the lines of (1) in Figure 1. this type to be lexically headed, and in order to satisfy both this
As this (partial) lexical entry illustrates, signs in HPSG com- requirement and the valence principle, exactly the number
prise a ramiied feature geometry that simultaneously speciies and type of complement sisters that the verb identiies on its
phon(ology) and synt(tax)/sem(antics), the latter a complex comps list (i.e., the list of descriptions that must be satisied by
information package revealing loc(al) and nonloc(al) prop- the heads selected sisters) must appear in the structure so as
erties, itemizing, respectively, the inherent morphosyntactic to yield the empty complements lists on the mother. We thus
and semantic properties of the category, on the one hand, and license the structure in (2)a, which can be abbreviated as (2)b
the presence of elements within the sign with nonlocal link- in Figure 2. he head-subject schema allows a completely satu-
ages (the presence of a gap linked to a iller arbitrarily far away, rated verb (V) to have a phrasal head daughter with a subj list
wh properties, etc.), on the other hand. Further subspeciica- of length one. Again, in conjunction with the valence principle,
tions identify inherent morphosyntactic properties of the sign, this schema allows a verb phrase (VP) combining with a con-
including head those necessarily shared with the heads stituent that exactly meets the description indicated in its subj
phrasal mother by virtue of the mother/head daughter rela- to appear as a structure of type head-subject under an S (i.e.,
tionship along with semantic and contextual information. clausal) node, that is, a V with both an empty comps and an
355
Head-Driven Phrase Structure Grammar

(b)
(3) (a) S PHON
NP S LOC 1
LOC 1 SLA SH 1 SYNSEM NONLOC|SLA SH 1
Terry
NP VP SLA SH
1
I
V NP
SLA SH 1
know
N PP
SLA SH 1
stories
P NP
LOC 1
about
SLA SH 1
e
Figure 3.
empty subj list. hus, Robin criticized Leslie will be straightfor- the percolation of wh properties (as in so-called pied-piping phe-
wardly licensed. nomena), throughout arbitrarily large structures exist in natural
Note further that *Robin criticized and *Robin criticized Leslie language and must be accounted for.
certain books will both be ruled out. Neither of them will satisfy the HPSG follows the central innovation introduced by Gerald
head-complement schema under the constraint imposed by the Gazdar a quarter of a century ago in treating iller/gap linkages
valence principle. In the irst case, the lack of a complement daugh- as a by-product of the difusion of a speciic feature slash, whose
ter will yield a result with a nonempty complements list, violating value is correlated with that of the iller and which is driven
the schema requirement. In the second case, the valence require- from mother to (at least one) daughter by the nonlocal feature
ments on criticize will not equal the sum of those on the mother plus principle:
the set of daughters that appear in the structure, as the full form of
Nonlocal Feature Principle (NFP, simpliied): For any nonlo-
the valence principle requires. Hence, both fail to be licensed.
cal feature f the value of f on a phrase at any point below the
he valence feature lists do not in fact specify information
place in the structure where f was introduced is the union of
about the entire sister sign selected by the head. Lexical selection
the values of f on the set of daughters.
is universally blind to phonological form and also to the descend-
ing constituent structure of any selected phrasal constituent; he NFL states, in essence, that below the point where the non-
therefore, it makes sense to restrict valence speciications to syn- local feature is introduced, at least one daughter in each two-
sem values. But the latter contain enough information to imple- generation tree must bear the value of f speciied. In the case of
ment virtually all local dependencies as simple expressions of iller/gap constructions, the feature slash is identiied at the top
selection. hus, it is possible to select not only for coarse-grained of the dependency with the relevant part (called the loc value,
information, such as the number of valents and their respective illustrated in Figure 2) of the iller. he descendents of the high-
lexical category types, but also for information such as their case, est category with a nonempty slash feature preserve this value
or the inlectional properties of their heads (on the assumption in their own speciications, until at the bottom of the depen-
that the latter are head features and will therefore be shared dency, the slash value is cashed out as an empty category. An
between a phrasal valent and its lexical head daughter). he mor- example is illustrated in (3)a, with the relevant lexical entry for a
phosyntactic dependency in English between the identity of aux- slash terminal category listed in (3)b in Figure 3. his approach
iliary elements, on the one hand, and the inlectional form of the to iller/gap dependencies comports well with familiar phenom-
verb that immediately follows them, on the other hand, follows ena in languages that mark such dependencies by local lagging
simply and directly by specifying that, for example, each auxiliary of extraction pathways; it also has considerable advantages over
have selects as its complement a VP, preserving, in its own feature movement-based approaches in the analysis of multiple gap
speciications, the inlectional class of its head daughter (specii- linkages to a single iller, such as parasitic and across-the-board
cally, the value psp, encoding past participial status). extractions in coordinate structures.
Nonlocal Dependencies Remaining Issues

Nonlocal dependencies, in contrast, do not depend on the here are, of course, a number of other important aspects to
selectional possibilities of particular (classes of) lexical items. the grammar architecture incorporated in HPSG, some of them
Unbounded dependency constructions involving extraction, or involving major points of debate within the framework.
356
Hippocampus
he syntax/semantics interface issue is not fully resolved in Because of the unique and circumscribed nature of his
HPSG, in the sense that there are a number of competing 1953 surgery, H. M. is probably the most studied patient in the
approaches to a variety of issues, ranging from the nature of history of neuropsychology (Ogden and Corkin 1991): A neu-
the objects in HPSG representations that map to semantic rosurgeon inserted thin metal tubes above the eyes, and via
representations to the mapping rules themselves. Diferent suction, removed parts of H. M.s hippocampus and directly
answers to these questions typically correspond to major dif- linked MTL structures. his operation greatly ameliorated H.
ferences in syntactic analyses. M.s life-threatening epilepsy, left H. M.s neocortex virtually
Probably the most fundamental point of contention is the undamaged, and spared all neocortex with known links to lan-
disagreement within HPSG about the degree to which pat- guage comprehension. However, the operation caused a selec-
terns in natural languages relect, on the one hand, lexi- tive memory deicit, with normal recall of information familiar
cal properties and systematic mappings between lexical to H. M. before his operation and used frequently since then,
descriptions, or restrictions on speciic constructions, as but impaired recall of information newly encountered after his
encoded by constraints imposed on types belonging to very operation and not massively repeated since then (see MacKay
elaborate ontologies, on the other hand. At one extreme, et al. 2007).
HPSG includes a set of lexical heads with very abstract H. M. has sentence-level language deicits that precisely
properties and possibly no phonological realization, as in mirror his memory deicits. D. G. MacKay et al. (2007) tested
earlier treatments of relative clauses. At the other extreme H. M.s sentence-level comprehension in six tasks. In one task,
are intricate multi-inheritance hierarchies in which prop- participants identiied the grammatical versus ungrammati-
erties of constructions are essentially posited as underived cal status of never previously encountered sentences that were
primitives, or are derived by combining a number of such either grammatical or ungrammatical (see grammaticality).
underived primitives. Here, H. M. responded with the correctly answer reliably less
often than controls matched for age, IQ, and education. his
hese and a number of other foundational issues are currently
comprehension deicit impaired a wide variety of syntactic struc-
under intense discussion and debate within the HPSG research
tures, including ones that memory-normal participants ind easy
community, and there is no reason to expect a consensus on any
to recall: H. M. exhibited equivalent comprehension deicits for
of these questions in the near future.
easy- and diicult-to-recall sentences.
On the basis of the relatively brief history of the framework
In a second task, H. M. again performed reliably worse than
so far, it seems very possible that divergences in approaches
controls in identifying grammatical sentences as grammatical
to these fundamental matters will in time yield major schisms
and in detecting, identifying, and repairing errors in sentences
within the theory, leading in the end to two or more major ver-
containing incorrect and misordered words. A third task required
sions of the theory, in much the way that categorial grammar has
multiple-choice identiication of who-did-what-to-whom in
split into combinatory categorial grammar, on the one hand,
novel sentences. Here, H. M. identiied the correct thematic
and the type-logical version based on the Lambek calculus, on
role of sentence constituents reliably less often than controls. A
the other. It remains to be seen whether the theory will develop
fourth task required multiple-choice recognition of the appropri-
along these lines.
ate interpretation for sentences containing novel metaphors.
Robert D. Levine Here, H. M. chose the correct interpretation reliably less often
than controls, and his errors indicated failure to recognize that
WORKS CITED AND SUGGESTIONS FOR FURTHER READING the sentences were metaphoric. A ifth task required yesno
Borsley, Robert. 1996. Modern Phrase Structure Grammar.
recognition of the appropriate interpretation for ambiguous
London: Blackwells. sentences. Here, H. M. responded correctly less often than con-
Levine, Robert, and homas Hukari. 2006. he Unity of Unbounded trols and sometimes responded yes-and-no despite repeated
Dependency Constructions. Stanford, CA: CSLI requests to respond yes-or-no.
Pollard, Carl, and Ivan Sag. 1994. Head-Driven Phrase Structure Grammar. Consistent with several earlier results discussed next,
Chicago: University of Chicago Press. H. M.s ambiguity comprehension deicits were not due to
memory overload associated with multiple meanings: In the
ambiguity detection and description task of MacKay, Stewart,
HIPPOCAMPUS
and Burke (1998), H. M. took much longer than controls to
he hippocampus is a bilaterally symmetric subcortical structure begin to describe the irst of two meanings in ambiguous sen-
adjacent to the lateral ventricle in the medial temporal lobe tences, even when he never discovered the second meaning.
(mtl). Researchers in 1968 reported dramatic memory phe- H. M. also discovered both meanings without experimenter
nomena associated with hippocampal-MTL damage, and data help less often than controls and often failed to understand
reported from 1998 to 2006 have indicated parallel phenomena meanings that the experimenter had just explained. Research
for many other aspects of cognition, including language. I irst isolated seven deicits in how H. M. described the sentence
discuss patient H. M., the initial source of data for the hippocam- meanings: Grammatically impossible interpretations, misread-
pal-memory and hippocampal-language links. I then discuss ings relecting failure to comprehend sentence-level mean-
related patient groups and the theoretical signiicance of the ing, errors in pronoun use (anaphora), error correction
hippocampal-language link. failures, free associative responses, self-miscomprehensions,
357
Hippocampus
and failures to follow experimenter requests for clariication. in reading novel sentences aloud (Friedman 1996; MacKay and
Research also indicated comprehension failure involving an James 2001).
initial meaning for sentences, ambiguous or not. Second, these parallels are diicult to explain in current sys-
To summarize, in a wide range of tasks involving many fun- tems theories, in which independent systems process memory,
damental aspects of sentence comprehension, H. M. exhibited language comprehension, language production, and visual cog-
deicits not caused by his memory problems (for corroborating nition, and the hippocampus subserves only the memory sys-
evidence on H. M.s comprehension deicits, see Corkin 1984; tem (see, e.g., Schmolck, Stefanacci, and Squire 2000). Under
Lackner 1974; and Schmolck, Stefanacci, and Squire 2000). systems theories, hippocampal-MTL damage should yield
However, H. M.s comprehension deicits were selective rather memory deicits without deicits in other cognitive systems, and
than across the board: Experiment six in MacKay et al. (2007) certainly without parallel deicits and parallel sparing across
demonstrated that H. M. comprehended familiar words and supposedly independent systems for sentence comprehension,
phrases in isolation without deicit despite large deicits in com- sentence production, visual cognition, and episodic memory.
prehending these same stimuli when embedded within sen- hese predictions have failed, and major attempts to rescue
tences. Besides demonstrating selectivity, these results indicated current systems theories from these failed predictions have like-
that H. M.s deicits were not attributable to low motivation, to wise failed (see MacKay 2001, 2006; and MacKay, James, and
failure as a child to learn the meaning of the critical words and Hadley 2008).
phrases, or to failure to understand and follow instructions for Third, a new theoretical framework known as binding the-
the task. ory (not to be confused with the anaphoric binding theory
H. M. also exhibited signiicant production deicits when in linguistics; see Jackendoff 2003, 15) readily explains and,
describing the meanings of familiar words that he compre- indeed, originally predicted the links between hippocampal-
hended without deicit in MacKay et al. (2007): Judges blind to MTL damage and parallel deficits and sparing in memory,
speaker identity rated H. M.s meaning descriptions as reliably sentence-level language, and other aspects of cognition.
more redundant, less coherent, less grammatical, and less com- Under binding theory, hippocampal-MTL damage impairs
prehensible than those of controls. hese indings replicated binding mechanisms for forming new internal representa-
earlier results indicating deicits in H. M.s production of novel tions in the cortex but does not affect mechanisms for activat-
or non-clich sentences (see MacKay, Stewart, and Burke 1998). ing already existing cortical representations (see, e.g., MacKay
Again, however, H. M. exhibited selective production deicits that et al. 2007 and James and MacKay 2001 for important theoret-
mirrored his memory deicits, for example, spontaneously pro- ical details regarding forgetting, frequency of use, and aging
ducing clich phrases such as in a way (familiar from before his and language ).
surgery) without errors (ibid.). To illustrate in detail how binding theory explains his selec-
H. M. also exhibited similar deicits and sparing in the seem- tive deicits, consider H. M.s sentence production in a standard
ingly simple task of reading sentences aloud (MacKay and James picture-description task requiring the incorporation of prespeci-
2001): He produced abnormal pauses at major syntactic bound- ied target words (MacKay et al. 2007): H. M. described the word-
aries unmarked by commas in the sentences, but normal pauses picture stimuli signiicantly less accurately and completely than
at syntactic boundaries marked with commas, a prosodic marker eight controls, included fewer target words, and produced more
that H. M. had learned prior to his operation. H. M. also produced incomplete sentences (e.g., lacking a subject or verb), viola-
abnormal pauses within unfamiliar phrases in the sentences, but tions of agreement rules, non sequiturs, and run-on sentences
normal pauses within frequently used phrases. hese and other than the controls. Descriptions by H. M. (1a2a) versus controls
selective deicits indicated that he has diiculty with the process (1b2b) for the same word-picture stimuli illustrate some of these
of reconstructing novel aspects of sentence structure when read- diferences.
ing aloud.
(1a) H. M. description: Because its wrong for her to be
H. M. also exhibited similar deicits and sparing in visual
and hes dressed just as this that hes dressed and the same
cognition: When detecting target igures hidden in concealing
way.
arrays, he performed reliably worse than controls for unfamiliar
targets but not for familiar targets (MacKay and James 2000). In (1b) Control description: Well, I think Ill take that one
short, H. M. exhibits similar selective deicits in visual cognition, although it looks wrong.
episodic memory, sentence-level comprehension, and sentence (2a) H. M. description: I want some of that pie either some
production when speaking and reading aloud; impaired pro- pie and Ill have some.
cessing of never previously encountered events, visual igures, (2b) Control description: Uh, there are two people getting
phrases, and propositions; but spared processing of informa- pie, but theres only one piece of blueberry pie left, and so,
tion familiar to him before his lesion and used frequently since either one of them will have to have it.
then.
Why are these parallels important? One reason is that H. M. Note that H. M.s picture-description problems in 1a and
is not unique: Other patients with hippocampal-MTL damage 2a were selective: Unlike agrammatic aphasics, H. M . did
exhibit identical parallels, reinforcing the links among hippocam- not produce morphemes and nonsense words jumbled
pus-MTL, language, and memory. For example, other amnesiacs together into morphological salads (Jackendoff 2003,
exhibit deicits in detecting the two meanings in ambiguous sen- 264). Moreover, he produced frequently used units, such as
tences (Zaidel et al. 1995) and make errors resembling H. M.s its wrong, to be, the same way (1a), some of that, and
358
Hippocampus Historical Linguistics
Ill have some (2a), without errors. Under binding theory, memory and sentence production. Journal of Experimental and
separately stored syntactic units and rules serve to activate Clinical Neuropsychology 30.3: 280300.
already formed internal representations so that words and MacKay, D. G., L. E. James, J. K. Taylor, and D. E. Marian. 2007.
phrases become produced in the appropriate order. Because Amnesic H. M. exhibits parallel deicits and sparing in language and
memory: Systems versus binding theory accounts. Language and
H. M.s syntax-based activation mechanisms are intact and
Cognitive Processes 22.3: 377452.
frequently used since his lesion, H. M. therefore produces
MacKay, D. G., R. Stewart, and D. M. Burke. 1998. H. M. revis-
familiar words, phrases, and propositions such as its wrong ited: Relations between language comprehension, memory, and the
and Ill have some, without errors. However, he lacks already hippocampal system. Journal of Cognitive Neuroscience 10: 37794.
formed internal representations for propositions that he Ogden, J. A., and S. Corkin. 1991. Memories of H. M. In Memory
has used repeatedly before and after his lesion to describe the Mechanisms: A Tribute to G. V. Goddard, ed. W. C. Abraham, M.
MacKay et al. (2007) word-picture stimuli. The word-picture Corballis, and K. G. White, 195215. Hillsdale, NJ: Erlbaum.
stimuli, therefore, triggered familiar units that H. M. simply Schmolck, H., L. Stefanacci, and L. R. Squire. 2000. Detection and expla-
concatenated without forming complete, appropriate, and nation of sentence ambiguity are unafected by hippocampal lesions
coherent utterances (see 1b, 2b). but are impaired by larger temporal lobe lesions. Hippocampus
In conclusion, the pressing problem for future research is to 10: 75970.
Zaidel, D. W., E. Zaidel, S. M. Oxbury, and J. M. Oxbury. 1995. he inter-
test new binding theory predictions for relations among brain,
pretation of sentence ambiguity in patients with unilateral focal brain
language, memory, and other aspects of cognition (see MacKay
surgery. Brain and Language 51: 45868.
et al. 2007).
Donald G. MacKay
HISTORICAL LINGUISTICS
WORKS CITED AND SUGGESTIONS FOR FURTHER READING Historical linguistics is the study of how languages change over
Corkin, S. 1984. Lasting consequences of bilateral medial tempo- time. We can approach the study of change in various ways. One
ral lobectomy: Clinical course and experimental indings in H. M. is by studying the histories of individual languages. An example
Seminars in Neurology 4: 24959. would be analyzing the changes that have taken place in English
Friedman, R. B. 1996. Phonological text alexia: Poor pseudo-word over the last thousand years. A second approach involves com-
reading plus diiculty reading functors and aixes in text Cognitive paring various related languages in order to draw inferences
Neuropsychology 13: 86985. about the types of changes that have occurred since the time
Jackendof, R. 2003. Foundations of Grammar: Brain, Meaning, Grammar, they split from their common ancestor. We can further study the
Evolution. Oxford: Oxford University Press. frequency and naturalness of the changes that the languages are
James, L. E., and D. G. MacKay. 2001. H. M., word knowledge and
hypothesized to have undergone and the efects of change in
aging: Support for a newtheory of long-term retrograde amnesia.
one area on other parts of the language. Finally, historical lin-
Psychological Science 12: 48592.
Lackner, J. R. 1974. Observations on the speech processing capabilities
guistics is also concerned with inding explanations for change,
of an amnesic patient: Several aspects of H. M.s language function. including why languages change and how a particular change
Neuropsychologia 12: 199207. was actuated in a particular circumstance. Historical linguistics
MacKay, D. G. 2001. A tale of two paradigms or metatheoreti- also intersects with other ields. For example, a linguist trying
cal approaches to cognitive neuropsychology: Did Schmolck, to reconstruct the geographic extent of a proto-language may
Stefanacci, and Squire demonstrate that Detection and explanation also make use of data from both archaeology and historical
of sentence ambiguity are unaffected by hippocampal lesions but anthropology.
are impaired by larger temporal lobe lesions? Brain and Language he following discussion begins with an abbreviated history
78: 26572.
of the ield, from classical times to the early twentieth century.
. 2006. Aging, memory and language in amnesic H. M.
From there, we turn to contemporary research and conclude
Hippocampus 16: 4914.
with a look at possible future directions for the ield.
MacKay, D.G., D. M. Burke, and R. Stewart. 1998. H. M.s language pro-
duction deicits: Implications for relations between memory, seman-
tic binding, and the hippocampal system. Journal of Memory and History of the Field
Language 38: 2869. Modern historical linguistics was developed in the nineteenth
MacKay, D. G., and L. E. James. 2000. Binding processes for visual cog- century, although there were both eighteenth- and seventeenth-
nition: A hippocampal amnesic (H. M.) exhibits selective deicits in century scholars who practiced something that todays linguists
detecting hidden igures and errors in visual scenes. Poster presented would recognize. In contrast, Classical Greek and Roman lin-
to the Cognitive Neuroscience Society, San Francisco.
guists had little to say about language history despite consider-
. 2001. he binding problem for syntax, semantics, and
able sophistication in their synchronic descriptive techniques
prosody: H. M.s selective sentence-reading deicits under the
(see synchrony and diachrony). he study of language
theoretical-syndrome approach. Language and Cognitive Processes
16: 41960. change in the Graeco-Roman world was largely conined to
. 2002. Aging, retrograde amnesia, and the binding problem for etymology that is, to claims about the origin of individual lexi-
phonology and orthography: A longitudinal study of hippocampal cal items. For example, in Platos Cratylus, the word anthropos is
amnesic H. M. Aging, Neuropsychology, and Cognition 9: 298333. said to derive from the contraction of the phrase anathro:n h
MacKay, D. G., L. E. James, and C. Hadley. 2008. Amnesic H. M.s per- po:pen (looking up at the things hes seen). Similar methods
formance on the Language Competence Test: Parallel deicits in were employed by Latin linguists such as Varro, who claimed that
359
Historical Linguistics
anas duck is related to the verb nre to swim. Furthermore, William Jones to the Royal Asiatic Society in 1786, in which he
although numerous similarities between Latin and Greek were noted similarities between Sanskrit and the languages of Europe
noted, it was assumed that all such words were direct borrow- and hypothesized that they may come from a common ances-
ings into Latin from Greek. Shared common ancestry from a lan- tor: [N]o philologer could examine the Sanskrit, Greek, and
guage no longer spoken was never considered (see extinction Latin, without believing them to have sprung from some com-
of languages). Such an assumption was in keeping with the mon source, which, perhaps, no longer exists. here is a similar
strong cultural debt that the Roman world owed to the Greek reason, though not quite so forcible, for supposing that both the
(see Law 2003). Gothic and the Celtic had the same origin with the Sanskrit.
he classical etymological method continued to be employed (he speech is quoted in almost all introductory textbooks for
throughout the Middle Ages, where it was joined by theories of historical linguistics; see, for example, Campbell 2004 and Trask
language change and diversity built on the biblical story of the 1996.) However, as we have seen, elements of comparative and
destruction of the Tower of Babel (Gen. 11). A summary of the historical linguistic methods predate Jones by several hundred
theory can be found in Dantes de Vulgari Eloquentia (for one years.
English translation, see Shapiro 1990). Such a theory of diversity he nineteenth century saw an explosion of work on his-
speciies both a cause of language change and a partial model torical linguistics and the reconstruction of language history;
of the origin of modern linguistic diversity. Work within a model in fact, many methods developed during this period are still in
of change laid the foundation for much later linguistic scholar- use. It is this period that gives us the idea of the correspondence
ship, for it led to questions about the language that was spoken set: a set of words in related languages that are descended from
by those erecting the Tower of Babel (and, therefore, what the a common proto-form and which exhibit regular phonologi-
irst human language was) and exactly how modern attested lan- cal correspondences. For example, English three, Latin trs,
guages related to one another. Greek tris, Sanskrit tryas, and Gothic rs all relect proto-
he de Vulgari Eloquentia is also a founding discussion of Indo-European *treyes. Furthermore, the correspondences
relationships among the vernacular languages of Europe. he among phonemes in these languages are regular. For example,
rise of the study of Romance vernaculars led to an examination English th corresponds to Latin t in cognate words (cf. father
of systematic diferences among those languages, as well as com- and pater, brother and frater, among others; for the method and
parison with Latin (for example, why Latin de is a preposition further reconstructions, see Trask 1996). he irst detailed dis-
meaning from, but in French and Italian it marks possession). cussion of such correspondence sets dates to A. Turgot ([1756]
As R. H. Robins (1968, 100 f) notes, it was this examination that 1961), and the method was systematized by Rasmus Rask
allowed the development of an adequate framework for dia- (1818). We also, shortly afterwards, ind the irst reconstructions
chronic linguistics because of a chance to study change where of Indo-European. hese are in the work of August Schleicher
the parent language was already well understood. (1848), who not only reconstructed lexical items but also con-
In the early Middle Ages, there was also a highly sophisticated structed an Indo-European fable. Schleicher also introduced
Arabic comparative linguistic tradition. Ibn Hazm (9941064) the Stammbaum, or family tree, model of linguistic relationship
noted regular correspondences among Hebrew, Arabic, and (see language families ).
Syraic, and in the Ihkam Ibn Hazm further identiied changing Neither Schleicher nor his contemporaries placed much
pronunciation and language contact as driving forces in the cre- weight on the importance of regularity in correspondences,
ation of linguistic diversity (see contact, language). however. Scholars of the following generation, including Karl
In the sixteenth to eighteenth centuries, we begin to see the Brugmann and Berthold Delbrck (the Neogrammarians or
study of language change linked to other branches of linguistics, Junggrammatiker), were the irst to recognize the importance
such as typology. Scholars such as Konrad Gessner ([1555] of regularity in sound change for the comparative method
1974), Joseph Justice Scaliger ([1599] 1610), and later Andreas and to use it as a tool for discriminating between inheritance
Jger (1686) and Peter Simon Pallas (1786) collected and com- and analogy. he recognition of regularity in sound change
pared vocabularies of the languages available to them and made is the pillar of historical reconstruction and forms the
hypotheses about linguistic relationships on this basis. However, basis of much modern work on historical linguistics. Without
the comparisons are unsystematic and based mostly on very few a conception of sounds changing regularly in particular pho-
features. For example, Scaliger divides the languages of Europe netic environments, it is impossible to identify irregularities,
into four major classes, depending on whether their word for to reconstruct proto-forms, and thereby to form a reliable idea
god is based on deus, theos, gott, or bog (roughly correspond- of linguistic relationships. However, arguments about the uni-
ing to Latin/Romance, Greek, Germanic, and Slavic, respec- versal regularity of sound change continue. One area involves
tively). Gessner ([1555] 1974, 110) deduces that Armenian is the paradox between apparent regularity at the macro level
closely related to Hebrew because of the similarity of words such and irregularity when a language at a particular stage in time
as lezu tongue (Hebrew laschon in Gessner) and hhatz cross is examined. hat is, a language is not homogeneous at any
(Hebrew etz or hetz). Moreover, until G. W. Leibniz, there is no stage because of the amount of dialectal variation among
conception that languages could have been descended from a speakers. A second point of debate is the applicability of mod-
language that is no longer attested. els relying on regularity of sound change to languages outside
he beginning of modern historical linguistics and the com- of Europe, for instance, those spoken by huntergatherer com-
parative method is often said to date from a speech given by Sir munities in Australia.
360
Historical Linguistics
A further methodological coup is due to Ferdinand de work on historical syntax concerns word order change in the syn-
Saussure ([1915] 1972). Saussure hypothesized, on the basis of chronic analysis of ancient languages or the causes of syntactic
both internal evidence of root structure and varied correspon- change, rather than reconstruction per se. Inluential here has
dences among vowels in Indo-European languages such as Greek been the work of David Lightfoot (e.g., 1999), who has developed
and Sanskrit, that there had once been a further set of laryngeal a theory of change that assigns the primary cause of change to
consonants (possibly /h/, // and /w/) that had disappeared in a childs acquisition of the language. Since children are exposed
all environments in attested languages. Saussures contribution to linguistic data that is slightly diferent from what their parents
is very important to historical linguistics for two reasons: First, it were exposed to, they therefore draw slightly diferent conclu-
is the foundation of internal reconstruction, that is, the sions about the syntactic structure of their language. We see this
hypothesis of reconstructions based on synchronic patterns in relected in the historical record as a syntactic change. Others are
one language, rather than a direct comparison of forms between less comfortable in ascribing change solely to grammar change at
languages. Second, it demonstrates very clearly the power of the acquisition and argue that syntactic changes also occur in adult
comparative method and the importance of regularity in corre- speakers as a result of exposure to new languages and dialects,
spondence sets. he subsequent decipherment of inscriptions in changing prestige, and other factors. here is ongoing debate
Hittite and Luwian conirmed Saussures hypothesis since two of over the extent to which a person may spread change as an adult,
the laryngeals are preserved in these languages precisely where since there are also clearly generational diferences in linguistic
we should expect to ind them. production (see age groups).
Historical linguists in the twentieth century made progress in he study of reconstructions within historical phonology,
the reconstruction of families outside Europe, in the growing use morphology, and syntax may also be used to classify languages
of quantitative methods in modeling and describing language into genetic families. Language classiication must also take ideas
change, and in historical syntax (see syntactic change). Most of language contact into account. Extensive contact between two
of the methods used today were developed through the recon- unrelated languages may over time lead to enough similarities
struction of proto-Indo-European; however, the methods have that it is diicult to tell whether they are related or not. Several
also been successfully applied to other families. Much early his- languages have been misclassiied on this basis. For example,
torical work was done on Finno-Ugric languages (e.g., Sajnovics Armenian was originally classiied as an Indo-Iranian language
[1770] 1968; Gyarmathi 1799), and more recently, there has been rather than as its own branch of Indo-European because of the
considerable progress in the reconstruction of Austronesian number of loans it exhibits.
(Pawley and Ross 1993), Niger-Congo (cf. Hombert and Hyman here are other less widely accepted methods of investigat-
1999), and numerous families in North America (see, for exam- ing linguistic prehistory. One is lexicostatistics, which involves
ple, Campbell 1997). estimating genetic relatedness by comparing the percentage of
While the family tree model has been very inluential in his- vocabulary common to pairs of languages. Underlying the method
torical linguistics, other models of language change are also is the assumption that languages that share more common mate-
used. Perhaps the most commonly cited is the wave theory of rial are likely to be more closely related. Glottochronology uses
J. Schmidt and Jules Guilliron (see Guilliron 1921), who pro- the estimations from lexicostatistics to estimate the time depth
posed that sound changes difuse through the lexicon, gradually of a particular family. Mass comparison (e.g., Greenberg 1987)
afecting more and more instances of phonemes in a given envi- involves using large-scale word lists to reconstruct further back
ronment. Regularity in correspondences is thus epiphenomenal than the strict application of the comparative method allows, by
and only appears once a change is complete. Another common granting more exceptions to the regularity of sound change and
appeal to wave theory is in subgrouping, where it is argued that greater latitude in semantics. In each case, the methods are not
the family tree is not an accurate representation of language split- widely accepted. For example, gross similarity between two
ting. Rather, linguistic diferentiation occurs through the gradual languages may be caused by several factors apart from common
building up of isoglosses and changes afecting individual lexical genetic inheritance, including chance and borrowings, and only
items. detailed reconstruction of correspondences by the comparative
method allows us to choose between them.
Current State of Research
he methods of historical linguistics can be applied to all areas Future Prospects
of language study. Within historical phonology, there has been Currently, our knowledge of the history of diferent language
a great deal of work on types of sound change, the plausibility families in the world is very uneven. Some families includ-
of diferent changes, and the mechanisms by which change is ing Indo-European, Finno-Ugric, and Algonquian have been
spread throughout a speech community. (For discussion and reconstructed in detail. In other cases, we are not even sure
diferent approaches, see Ohala 1993, Blevins 2004, and Labov which languages belong to the family, let alone what the proto-
2001.) he study of morphological change was particu- language looked like.
larly important in the nineteenth century, and the reconstruc- here are still many pressing concerns and active areas of
tion of morphemes and paradigms is still an important area of research. he irst is in the reconstruction of various language
research, as is grammaticalization theory. families. Basic original reconstruction research is needed for
Historical syntax has received rather less attention than histor- much of the world. Secondly, there is an ever-increasing concern
ical morphology or phonology mostly because of the diiculties of with questions about how and why languages change. We have
applying the comparative method to syntactic constructions. Most long moved away from arguments of language change involving
361
Historical Linguistics Historical Reconstruction
sloppy speech or linguistic degeneration by ignorant speakers. Labov, William. 2001. Principles of Linguistic Change.
Instead, research has focused on the relative importance of lan- Oxford: Blackwell.
guage acquisition in language change versus social factors, such Law, Vivian. 2003. he History of Linguistics in Europe: From Plato to
as peer pressure, prestige, and diffusion. Linguistic research is 1600. Cambridge: Cambridge University Press.
Lightfoot, David. 1999. he Development of Language: Acquisition,
also important for the study of prehistory and ancient population
Change and Evolution. Oxford: Blackwell.
movement.
McMahon, April, and Robert McMahon. 2006. Language Classiication by
Not everyone is convinced that the methods discussed in Numbers. Oxford: Oxford University Press.
this entry are generally applicable to all languages and lan- Ohala, John. 1993. he phonetics of sound change. Historical
guage families in the world. As already noted, the family tree Linguistics: Problems and Perspectives, ed. Charles Jones, 23778.
has been the predominant model for 150 years. However, some London: Longman.
have pointed out its reliance on transmission from parents Pallas, Peter Simon. 1786. Linguarum totius orbis vocabularia compara-
to children, that ignores other types of transmission which tiva. St. Petersburg.
can lead to rapid language change, such as creolization (see Pawley, Andrew, and Malcolm Ross. 1993. Austronesian histori-
creoles) and the formation of mixed languages (homason cal linguistics and culture history. Annual Review of Anthropology
and Kaufman 1988). 22: 42559.
Rask, R. 1818. Undersgelse om det gamle nordiske eller islandske sprogs
Finally, historical work increasingly involves computational
oprindelse. Copenhagen.
modeling and the integration of techniques used in computa-
Robins, R. H. 1968. A Short History of Linguistics. Bloomington: Indiana
tional biology. he last 10 years have seen an increasing amount University Press.
of sophisticated statistical analysis and computational modeling Sajnovics, Jnas. [1770] 1968. Demonstratio idioma Ungarorum et
in research (for an overview, see McMahon and McMahon 2006). Lapponum idem esse, ed. homas Sebeok. Bloomington: Indiana
We also see work that aims at estimating time depth and rates of University Press.
language change. It remains to be seen, however, how successful Saussure, Ferdnand de. [1915] 1972. Course in General Linguistics
this work will be. No matter how sophisticated the techniques for [Cours de linguistique general]. Trans. Roy Harris. Peru, IL: Open Court
statistical analysis, any estimates of time depth need also to take Classics.
into account sophisticated theories of language change. At this Scaliger, Joseph Justus. [1599] 1610. Diatriba de Europaeorum linguis. In
Opuscula varia antehac non edita. Paris.
point, we have no idea why languages change and split at difer-
Schleicher, August. 1848. Sprachvergleichende Untersuchungen. / Zur
ent rates, although such difering rates are clearly observable in
vergleichenden Sprachgeschichte. 2 vols. Bonn: H. B. Koenig.
the historical record.
Shapiro, Marianne. 1990. De Vulgari Eloquentia, Dantes Book of Exile.
Claire Bowern Lincoln and London: University of Nebraska Press.
homason, Sarah Grey, and Terrence Kaufman. 1988. Language Contact,
Creolization, and Genetic Linguistics. Berkeley: University of California
WORKS CITED AND SUGGESTIONS FOR FURTHER READING Press.
Blevins, Juliette. 2004. Evolutionary Phonology. Cambridge: Cambridge Trask, R. L. 1996. Historical Linguistics. London: Arnold.
University Press. Turgot, A. [1756] 1961. tymologie. Brugge: De Tempel.
Campbell, Lyle. 1997. American Indian Languages: he Historical
Linguistics of North America. Oxford: Oxford University Press.
HISTORICAL RECONSTRUCTION
. 2004. Historical Linguistics: An Introduction. Cambridge, MA: MIT
Press. Like all complex things in nature, languages change over
Gessner, Konrad. [1555] 1974. Mithridates: de Diferentiis Linguarum time. Historical reconstruction is a process of inference by
tum Veterum tum quae Hodie apud Diversas Nationes in Toto Orbe which changes are undone so as to recover certain aspects
Terrarum in Usu Sunt. Neudruck der Ausgabe Zrich, Aalen: Scientia of historically nonattested linguistic structure and content in
Verlag.
hypothetical form. Although it can be applied to languages
Greenberg, Joseph. 1987. Language in the Americas. Stanford,
for which written documentation is available (e.g., the recon-
CA: Stanford University Press.
struction of proto-Romance next to the extant records of Latin)
Guilliron, Jules. 1921. Pathologie et thrapeutique verbales.
Paris: Champion. or to single languages by means of internal reconstruction, his-
Gyarmathi, Smuel. 1799. Affinitas linguae Hungaricae cum linguis torical reconstruction typically is directed to prehistoric lan-
Fennicae originis gram matice demonstrata. Vocabularia dialec- guages and requires the simultaneous comparison of multiple
torum Tataricarum et Slavicarum cum Hungarica comparata. witnesses.
Gttingen. he set of procedures used in historical reconstruction is
Harris, Alice, and Lyle Campbell. 1995. Historical Syntax in Cross- called the comparative method. After the elimination of
Linguistic Perspective. Cambridge: Cambridge University Press. chance, borrowing, and universals as plausible causes of cross-
Hock, Hans H. 1991. Principles of Historical Linguistics. Berlin: linguistic similarity, it can be shown that the resemblances
Mouton.
between two or more languages must result from a common
Hombert, Jean-Marie, and Larry Hyman. 1999. Bantu Historical
origin (usually called genetic relationship), coupled with diver-
Linguistics: heoretical and Empirical Perspectives. Stanford,
gent descent. In this respect, historical reconstruction in lin-
CA: CSLI.
Jger, Andreas. 1686. De Lingua Vetustissima Europae, Scytho-Celtica et guistics shows striking parallels to the study of the evolutionary
Gothica. Wittinberg. history of natural species, and much of its terminology is con-
ceptually modeled on that of biological taxonomy. Once it has
362
Historical Reconstruction
been determined that languages are genetically related, a more dispersal, or homeland, of a language family (or subgroup
exact picture of their historical connection can be achieved by ancestor) and, hence, gives rise to hypotheses about direction
the reconstruction of a proto-language or hypothetical common of migration that can be tested against the evidence of other
ancestor. scholarly disciplines, such as archaeology or population genet-
Although some progress has been made with other aspects ics. However, it has been recognized since at least the 1870s
of historical reconstruction, it is widely agreed that the that not all processes of linguistic diferentiation are treelike,
comparative method has been successfully applied only in and it is widely accepted that both family tree and wave mod-
phonology . For reconstruction to begin, it is necessary to els accurately describe the process of language split, the former
identify a corpus of cognate morphemes , that is, morphemes under conditions of sharp social separation and the latter under
that have a common historical origin, identified inductively conditions of gradual diferentiation of independent languages
as forms of similar meaning that exhibit recurrent sound from a dialect complex.
correspondences. Distinct sound correspondences that are he reconstruction of the proto-Indo-European case system
found in the same or closely similar environments normally in the irst half of the nineteenth century marked the beginning
must be attributed to different proto-phonemes , and the of work on comparative morphosyntax, but many would argue
inventory of proto-phonemes so inferred forms a hypothesis that since this involves the identiication of cognate aixes, it is
about the sound system of the proto-language of a language a variant of lexical reconstruction. In recent years, greater atten-
family . Since proto-phonemes can only be reconstructed in tion has been paid to problems of reconstruction in other areas
lexical forms, phonological and lexical reconstructions of syntax, such as word order, and in semantics. It is note-
are inextricably bound together. Reconstructed phonemes worthy that the models for such work almost invariably derive
and reconstructed words (proto-forms) are preceded by an from typological approaches to synchonic linguistic structure,
asterisk to indicate their hypothetical status. Some linguists rather than from formal theories of syntax (see synchrony and
take the position that the phonetic substance of such sym- diachrony).
bols is beyond recovery and that proto-phonemes are, there- A proto-language inevitably presents a very incomplete pic-
fore, little more than abstract formulas used to summarize ture of the language that must actually have existed. Nonetheless,
sound correspondences (the formulaic position). The major- the comparative method, which is generally thought to permit
ity view (the realist position) is more sanguine; although the reconstruction of languages up to about 6,000 years old, is a
phonetic nature of some proto-phonemes clearly is contro- powerful tool that allows a variety of inferences about prehis-
versial, many others permit little latitude in interpretation. toric language communities and their cultures. he potential
A related, though distinct, issue concerns the structure of use of linguistic reconstruction for inferences about culture his-
reconstructed phonological systems, since some of these tory was recognized in the second half of the nineteenth century
have violated implicational universals in typology . and labeled linguistic palaeontoloy by Ferdinand de Saussure.
Where this occurs, most historical linguists today would However, little use was made of this potential until relatively
question the validity of the reconstruction, using the typo- recent times. Since roughly the 1970s there has been increasingly
logical generalizations of the present as a guide to inferences fruitful interdisciplinary cooperation, especially between histor-
about the past. ical linguists and archaeologists, in exploring Holocene human
Once lexical reconstructions are available, it becomes pos- prehistory. his has led to a renewed inquiry into the antiquity of
sible to determine sound changes in a large number of daugh- the Indo-European settlement of Europe and has been a power-
ter languages. No topic in linguistics has a longer history than ful force in understanding the prehistoric human settlement of
the study of sound change, which commenced during the irst the Paciic.
quarter of the nineteenth century with the pioneering studies of
Robert Blust
Rasmus Christian Rask and Jacob Grimm. A major point of con-
troversy in the study of sound change is the issue of regularity.
It is now generally agreed that the strict Neogrammarian posi- WORKS CITED AND SUGGESTIONS FOR FURTHER READING
tion, which ruled out the possibility of unconditioned phone- Blust, Robert. 1987. Lexical reconstruction and semantic recon-
mic splits, is overly restrictive. A second issue, which is yet to be struction: he case of Austronesian house words. Diachronica
resolved, is whether all sound change is phonetically (or phono- 4: 79106.
logically) motivated. Campbell, Lyle. 1999. Historical Linguistics: An Introduction. Cambridge,
An examination of sound changes leads not only to theoret- MA: MIT Press.
ical models of how (and why) this process occurs but also to Harris, Alice C., and Lyle Campbell. 1995. Historical Syntax in Cross-
evidence for subrelationship within a language family. Sound Linguistic Perspective. Cambridge: Cambridge University Press.
Kiparsky, Paul. 1995. he phonological basis of sound change. In he
changes that are exclusively shared (exclusively shared inno-
Handbook of Phonological heory, ed. John A. Goldsmith, 64070.
vations) form the basis for linguistic subgroups. Subgrouping
Oxford: Blackwell.
allows linguists to move beyond the mere recognition of a
Renfrew, Colin. 1998. Archaeology and Language: he Puzzle of Indo-
language family as an internally undiferentiated collection European Origins. London: Cape.
of related languages to the reconstruction of a family tree that Ross, Malcolm, Andrew Pawley, and Meredith Osmond, eds. 1998.
deines the historical order of splits of major and minor groups The Lexicon of Proto Oceanic: The Culture and Environment
of languages within the family. he structure of a family tree, of Ancestral ceanic Society. Vol. 1. Material Culture. Pacific
in turn, supports inferences about the most likely center of Linguistics C-152. Canberra: Department of Linguistics, Research
363
Holophrastic Stage, The
School of Pacific and Asian Studies, The Australian National Theoretical Approaches
University. At the onset, it is important to understand the possible
. 2003. he Lexicon of Proto Oceanic: he Culture and Environment
claims about grammatical knowledge during the holophras-
of Ancestral Oceanic Society. Vol. 2. he Physical Environment. Paciic
tic period. The most conservative claim would be, basically,
Linguistics 545. Canberra: Department of Linguistics, Research
what you see is what you get, that is, that children only pro-
School of Paciic and Asian Studies, he Australian National
University. duce single words because that is all they know at that point.
This approach has come to be known as a lean interpretation
of childrens knowledge. At the other extreme, one can claim
HOLOPHRASTIC STAGE, THE that childrens grammatical knowledge is greater than what is
reflected in single-word productions. This rich interpretation
Despite extensive individual variation, children on average is based on the fact that language acquisition is rapid and that
begin to produce words around the end of the irst year of life. children must have innate language learning mechanisms
Initially, there is period of slow word learning of several months, that enable them to determine the grammatical character-
when the rate of learning is usually not much more than eight or istics of their language at a very early age (see innateness
so words a month. Toward the end of this period of slow word and innatism ).
learning, two changes take place. First, word learning begins to hese two positions can be demonstrated by again looking at
increase, such that the rate eventually increases to between one a childs production of the words doggie and eat. A lean interpre-
and two new words a day. Second, children begin to combine tation would be that the child has some initial semantic cat-
words to form their irst sentences (see two-word stage ). egorization of word meaning, such that doggie represents
his period of word acquisition, demarked at the start by the an emerging category of animate objects, and eat represents
irst words and at the end by the onset of word combinations, an emerging category of actions. A rich interpretation would
is commonly referred to as the holophrastic stage of language propose that innate principles enable the child to establish
acquisition. explicit grammatical knowledge from these semantic categories.
An alternative would be to refer to this period as the one- One principle would be that languages universally categorize
word stage. his option would be descriptively adequate, in things as nouns and actions as verbs. Another principle would
that it captures the fact that childrens productions are limited be that categories like noun and verb are heads of larger units
to a single word. Lois Bloom (1973), in fact, in an extensive or phrases, that is, noun phrases (NPs) and verb phrases (VPs).
study of her daughters irst words, made this choice, entitling Another principle would be that NPs and VPs are semantically
her book One Word at a Time. he term holophrastic, how- connected through semantic relations such as agent and action.
ever, is making a somewhat diferent claim about childrens Seeing the doggie eating would lead to the establishment of the
language acquisition when they are producing single words. sentence category. his process of building grammatical knowl-
Holophrastic can be deined as a single word expressing the edge from semantics has been called semantic bootstrapping
ideas of a phrase or sentence. he term more explicitly states (Pinker 1984).
that something more than single word production is taking How, then, does one decide between these two positions? he
place. When adults produce single words, such as doggie or answer depends on the importance placed on supporting chil-
eat, it is usually with the intent to express the meaning of the drens grammatical advances with observable changes in their
individual word. When a child around age one says doggie linguistic behavior. he rich interpretation deals with grammati-
(or probably a little later eat), however, he or she combines cal development as a logical problem. Since the complexities of
the meaning of the individual words with the communica- language are acquired so rapidly, it seems reasonable to assume
tive intention of the utterance, intending a broader mean- that linguistic principles must be at work at the very onset of
ing, such as there is a doggie, and I can see it or there is an language acquisition. Researchers who study childrens actual
apple, and I want to eat it. comprehension and production of language, however, set an
he decision to call the period of slow word learning the one- additional requirement. Claims about childrens abilities must
word stage versus the holophrastic stage is more than that of be supported in the way children are comprehending and pro-
personal taste. he former choice relects a more conservative ducing language. he remainder of the present entry discusses
view of the childs grammatical knowledge at this point is lan- the evidence for grammatical knowledge during the holophras-
guage acquisition. he latter choice, on the other hand, repre- tic stage, based on studies of childrens comprehension and
sents the position that childrens knowledge of grammar may production.
be greater than that directly seen in single-word productions.
he present entry provides an overview of these two viewpoints Language Comprehension
and discusses the nature of childrens emerging grammatical Studying the language comprehension of one-year-olds can be
knowledge during the time in which they produce one-word diicult since they are not capable of responding to the tasks typ-
utterances. It suggests that grammatical development begins ically used for older children and adults. Researchers, however,
during this period, as relected in childrens understanding of have come up with several clever ways to get at least a general
sentences and in some of the patterns of their single-word pro- idea of childrens understanding at this age. If childrens knowl-
ductions, particularly as they near the point of making word edge of language is limited to single words, then one would pre-
combinations. dict that they should do as well or better responding to utterances
364
Holophrastic Stage, The
of one word than to those of multiple words. he results of sev- understand the nature of multiword utterances. hey can pro-
eral studies, however, have shown that children can respond to cess relations between at least two words in some sentences and
multiword utterances and that they have an awareness that the are aware that sentences contain both stressed lexical words
words are related in a way suggestive of the relations between and unstressed words. hey may not be aware of the nature of
words in a sentence. the latter words, but they know that sentences require them to
E. Shipley, C. Smith, and L. Gleitman (1969) examined be well formed.
young childrens responses to commands directed toward
them by their mothers while the children played. he com- Language Production
mands were a single word, for example, ball!, two words, he claim that preliminary knowledge of grammar takes place
for example, throw ball!, and well-formed commands, for during the holophrastic stage would be strengthened if evidence
example, throw me the ball! he results showed that children could also be found in childrens spoken language. At irst glance,
around the end of the holophrastic period and beginning of this would seem impossible since holophrastic children are only
word combinations actually responded most often when they producing a single word at a time. here are, however, aspects
heard the well-formed commands. While not directly show- of childrens spoken language at this stage that, taken together,
ing that the children understood the well-formed commands, indicate emerging grammatical knowledge as well.
the results indicated that children were aware that such com- It is well known that holophrastic children are not very intel-
mands met the characteristics of English sentences, while the ligible, the result of the fact that they are limited in their pho-
other two do not. netic skills and are, in most cases, mixing their single-word
Other studies have examined more directly whether very productions with babbling . A. Peters (1983) also pointed out
young children can diferentiate multiword utterances on the that some children are not exclusively single-word producers.
basis of the speciic words in them. One potential problem in He identiied children who do not just attempt single words
testing children on this aspect is that they may give the appear- but attempt to imitate and repeat longer sentences. hese lon-
ance of understanding a multiword utterance when they are only ger utterances are often hard to interpret and may be identi-
doing what they typically would do. For example, a child who ied in many cases as some form of jargon, that is, attempts
throws a ball when told throw the ball may do so just because to produce sentence-length productions without meaning.
that is what children typically do with a ball. To avoid this prob- Peters found, however, that some of these jargon productions
lem, J. Sachs and L. Truswell (1978) used novel combinations may be meaningful, though the meaning may be missed by
that were not likely to be part of childrens experiences. Test sen- their parents. hese productions do not represent evidence
tences included unusual commands, such as smell truck and that holophrastic children know grammar, but they support
kiss truck. he results indicated that young children were able the previously stated results on well-formed commands, that
to respond correctly to such novel commands, suggesting that is, that the children know that sentences consist of more than
they were aware of at least two-word relations in the sentences single words.
they heard. Other research by J. Miller and colleagues (1980) has Peters (1983) and others also drew attention to the fact that
examined the range of sentence types that are understood by even single-word productions are not exclusively single words.
children during the holophrastic stage. he results indicate that With the advent of advanced tape recorders, researchers found
this range is limited. Children did best on responding to sen- in their phonetic transcriptions that words often were pre-
tences that communicated an actionobject relation (e.g., kiss ceded by brief phonetic material that was often diicult to hear
the shoe) and to those that communicated a possessor to pos- or interpret. For example, a child who was saying book, was
sessed relation (e.g., mamas shoe). hey did less well on other actually saying something like uh book or uhm book. hese
relations, such as agent action (e.g., make the horsey kiss). brief phonetic instances have been called several terms, such as
hese studies examined relations between lexical words in iller syllables, phonetically consistent forms, and presyntac-
sentences. A further question would be whether or not holo- tic devices. he last term relects the opinion of many research-
phrastic children are also becoming aware that sentences ers that these iller syllables are not yet syntactic units, such
contain smaller functional words as well, such as articles and as articles or auxiliaries, but they are evidence that children
auxiliaries. he fact that young children preferred the well- are taking notice that they exist and noting their distributional
formed commands suggests that this may be the case. N. Katz, characteristics.
E. Baker, and J. Macnamara (1974) explored this issue by show- A further characteristic of holophrastic speech is that sin-
ing children pictures that contained either a single instance of gle words are produced in sequences. Bloom (1973) exam-
a nonsense igure (e.g., an odd-shaped form called zav) or a ined the successive single-word utterances of her daughter to
picture of more than one instance. he children were then asked see if these sequences relected later word combinations. For
either show me zav or show me the zav. It was found that the example, does a child who says eat cookie when she begins
children tended to indicate the picture of the single instance in word combinations show earlier single-word sequences, such
the former case and the picture of the multiple instances in the as eat, cookie, during the holophrastic stage? Such cases
latter case. he children in this study were a bit older than holo- would provide evidence of grammatical knowledge during
phrastic children, but they were not yet using articles in their the holophrastic stage. Bloom found that her daughters early
spontaneous speech. sequences did not show these relations, in that each word had
In summary, a variety of studies on children in or around its own context, that is, distinctly associated action. Later, how-
the holophrastic stage indicate that they are beginning to ever, toward the end of the holophrastic period, sequences on
365
Homologies and Transformation Sets
single words with a shared event began to emerge. For exam- only by locating them within complexes of relations or structures.
ple, one sequence of words involving up, neck, zip was One simple and inluential technique of structuralist analysis,
all produced in the context of her daughter wanting her mother developed by Claude Lvi-Strauss, is the isolation of homologies.
to zip up her coat. A homology is a simple binary opposition mapped onto another
he last piece of evidence to suggest that early words are holo- binary opposition as its structural equivalent. One model here is
phrastic comes from returning to the original sources of the term. phonology, where, for example, voiced/unvoiced pairs (e.g.,
Early diary studies many years ago by parents on their childrens b/p and d/t) may be understood as structurally equivalent with
language learning recorded observations that their children respect to voicing. Lvi-Strauss takes homologies to manifest
used single words with difering communicative contexts. One of important structural relations across a range of higher linguistic
the most famous was the study by W. Leopold (193949) on his levels, including, for example, narrative. hus, we might say that
daughter, Hildegard. Leopold noted that Hildegard used her sin- in Shakespeares Hamlet, Hamlet is to Laertes as Claudius is to
gle words to express distinct communicative intents. Some utter- Hamlet.
ances were intended to show her noticing something, some to In a way, this particular homology is self-evident. But why
make a request for the parent to perform some action, and others does it work? Hamlet killed Laertes father, just as Claudius killed
to demand something such as a toy out of reach. hese diferent Hamlets father. Moreover, Hamlet was in love with Laertes sis-
functions were distinguished by variations in the prosody of the ter, just as Claudius was in love with Hamlets mother. Finally,
word and by the childs gestures. Hamlet was partially responsible for Laertes losing his sister,
as Claudius was responsible for Hamlet being, to some degree,
Summary separated from his mother.
Children go through a period of language acquisition of six As this suggests, homologies operate through larger com-
months or so during the second year of life when they produce plexes of relations. In his four-volume Introduction to the Science
single words at a time. his time of acquisition has been referred of Mythology (see, for example, Lvi-Strauss 1969), Lvi-Strauss
to as the holophrastic stage. he reason is that the single words systematically explored structural analysis beyond homologies
used often communicate the idea of a sentence, that is, the through the concept of transformation sets. A transformation set
meaning of the word expressed and the communicative intent is a series of multiplace structures that map onto one another.
of the utterance. he term is suggestive that preliminary gram- he mapping is deined by transformation rules, which are trig-
matical acquisition is taking place during this period. Research gered by speciiable conditions, often cultural conditions. For
on childrens comprehension and production during this stage example, there may be a transformational relation between the
suggests that this may be the case. myths of two groups say, agriculturalists and ishers such that
when the myths of one group refer to earth, the parallel myths of
David Ingram
the other group refer to water.
WORKS CITED AND SUGGESTIONS FOR FURTHER READING One model here is morphology. For instance, in English,
the plural morpheme is pronounced s after unvoiced nonsi-
Bloom, L. 1973. One Word at a Time. he Hague: Mouton. bilants (as in cats), z after voiced nonsibilants (as in dogs),
Ingram, D. 1989. First Language Acquisition. Cambridge: Cambridge
and z after sibilants (as in bushes). S, z, and z form a trans-
University Press.
formation set, and the contextual trigger deining the transfor-
Katz, N., E. Baker, and J. Macnamara. 1974. Whats in a name? A study of
how children learn common and proper names. Child Development
mation is phonological.
45: 46973. Clearly, the situation is more complex and the analysis less
Leopold, W. 193949. Speech Development of a Bilingual Child: A straightforward with higher-level structures, such as literary
Linguists Record. 4 vols. Evanston, IL: Northwestern University works. Consider Hamlet. In addition to the structures already iso-
Press. lated, Hamlet and Ophelia may be mapped onto each other in
Miller, J., R. Chapman, M. Bronston, and J. Reichle. 1980. Language losing their fathers due to an older relative/older lover, in feigning
comprehension in sensorimotor stages V and VI. Journal of Speech madness/going mad, in contemplating suicide/committing sui-
and Hearing Research 23: 284311. cide, and so on. he establishment of such a transformation set
Peters, A. 1983. he Units of Language Acquisition. Cambridge: Cambridge raises intriguing questions. For example, are the Hamlet/Ophelia
University Press.
structures diferentiated by a simple gender context (i.e., male
Pinker, S. 1984. Language Learnability and Language Acquisition.
vs. female)? If so, how is this related to the mapping of Hamlets
Sachs, J., and L. Truswell. 1978. Comprehension of two-word instruc-
mother onto Laertes sister in the Hamlet/Laertes transformation
tions by children in the one-word stage. Journal of Child Language set? Lvi-Straussian analysis allows us to recognize such complex
5: 1724. patterns and, perhaps, begin to understand them as well.
Shipley, E., C. Smith, and L. Gleitman. 1969. A study in the acquisition of
Patrick Colm Hogan
language: Free responses to commands. Language 45: 32242.
WORK CITED
HOMOLOGIES AND TRANSFORMATION SETS
Lvi-Strauss, Claude. 1969. he Raw and the Cooked. Trans. John and
A fundamental principle of structuralism is that one cannot Doreen Weightman. New York: Harper. his is the irst volume of
understand elements in isolation. One can understand elements Introduction to the Science of Mythology.
366
Icon, Index, and Symbol
I association of general ideas, which operates to cause the sym-

bol to be interpreted as referring to its object (1935a, 249). here
are numerous examples of symbols, from the various words and
utterances in human languages to such things as road signs.
ICON, INDEX, AND SYMBOL What is crucial in the case of symbols is that there exists some
he nineteenth-century American philosopher C. S. Peirce devel- underlying convention, agreement, habit, or law that means
oped extensive sign theories in order to explain reference, that invoking some symbol invokes its associated object. For
meaning, communication, and cognition. One of the central instance, a red traic lights being symbolic of a lack of priority
and most innovative features of his theories was the icon, index, at a road junction works because we have all agreed (by habit, by
symbol classiication of signs. convention, and by imposing traic regulations) to use red traic
A crucial aspect of understanding Peirces icon, index, sym- lights this way.
bol division is his account of sign structure. According to Peirce, hroughout his life, Peirce made numerous alterations to his
any instance of signiication consists of three interrelated parts: a account of signs (see, for instance, Short 2004), but the broad
sign, an object, and an interpretant. For the sake of simplicity, division among icons, indices, and symbols tends to ind a place
we can think of the sign as the signiier, for example, a written throughout. here are, of course, some subtleties to Peirces
word or an animals footprint. he object, on the other hand, account. For instance, it is not clear that there are very many
is whatever is signiied, for example, the object denoted by the examples of signs that are purely iconic, indexical, or symbolic
written word or the animal that left the print. he interpretant is that is, which do not overlap with one or both of the other ele-
the understanding or interpretation that the sign/object relation ments of the trichotomy. As an example, take a painted portrait
generates, for example, that the word or utterance is meant to as a sign of the person it depicts. his sign is an icon in that it sig-
refer to its object or that the animal track signiies the presence niies its object in virtue of the qualities it shares with that object
of the animal that made it. he importance of the interpretant the skin and hair color of the depicted person are replicated in
for Peirce is that signiication is not a simple dyadic relationship the painting. But, of course, many of the things that make a por-
between sign and object: A sign signiies an object only if it can trait a successful depiction of its sitter are due to particular con-
be interpreted as such. ventions governing paintings and how particular blocks of color
With this structure in mind, Peirce was interested in classi- in two dimensions can stand for some subject. his seems to
fying the various ways in which the sign/object relation might make the painting look as though it has symbolic elements, too.
generate an interpretant. In particular, he thought that a sign Similar considerations hold for indices such as barometers
might come to signify its object, and so generate an interpretant, although such signs indicate their objects in virtue of a causal
in three possible ways. First, a sign may be understood as signify- and physical connection with their object; conventions about
ing in virtue of similarities or shared qualities between it and its how we should interpret this physical connection also seem to
object. As Peirce says, I call a sign which stands for something play a part in signiication. Whats more, there are clear instances
merely because it resembles it an icon (1935b, 362). His own of symbols that have some iconic element. Obvious examples
preferred examples of icons are portraits or mathematical dia- might include forms of writing, such as Chinese, that involve
grams indeed, he thought icons were especially important to pictograms, at least partially. Even onomatopoeic words such as
mathematical thought. However, we can also include examples cuckoo present clear cases of symbols with a strong iconic ele-
such as color swatches, sculptures, and so on. What is central to ment the phonic qualities of the object are aped by the phonic
iconic signiication is that the qualities of the sign are also quali- qualities of the word.
ties of the signiied object and that this sharing of qualities is cru- Peirce was aware of the various overlaps among icons, indi-
cial in enabling the sign to signify. ces, and symbols, and at some point proposed to call icons and
he second way in which a sign might be understood as signi- indices with symbolic elements hypo-icons and subindices as
fying is in virtue of some physical or causal connection between a way of acknowledging this. However, in any case where more
it and its object. Such a sign is an index. Peirces own description than one of the three elements is present, one will be most
of an index is as a sign which refers to the object that it denotes prominent. Consequently, we can think of Peirces trichotomy
by virtue of being really efected by that object (1935a, 248). as dividing signs according to whether they are predominantly
Again, there are numerous and wide-ranging examples, includ- iconic, indexical, or symbolic.
ing demonstratives and indexical expressions, weather vanes, he main inluence of Peirces division is in semiotics,
barometers, fever as a sign of an underlying illness, or smoke where his work is considered foundational. However, the icon,
as a sign of ire. What is crucial to indices is that the object has index, symbol distinction has had some inluence in philoso-
a causal efect upon the sign (as in the case of ire causing the phy, particularly through the work of Arthur Burks (1949), and
smoke that indicates it) or has some spatio-temporal proximity has even been used in such diverse areas as literary theory (see,
to its sign, which can be used to aid an interpreter of the sign for example, Sherif 1989), ilm theory (see, for example, Wollen
to grasp that object (as in the case of pointing to some nearby 1969; see also film and language), and musicology (see Turino
object). 1999; see also music, language and). he use and relevance of
he third way in which a sign might be understood as signify- this distinction to linguistics are similarly diverse, but it features
ing is in virtue of some convention or law that connects it to its most prominently in analyses of the relation between animal
object. Peirces own description of a symbol is as a sign which communication and human language and in some expla-
refers to the object that it denotes by virtue of a law, usually an nations of the evolution of language.
367
Icon, Index, and Symbol Ideal Speech Situation
In explaining animal communication, the distinction is espe- allows for the symbolic communication typical of human lan-
cially useful since it allows us to classify various cases of ani- guage is never attained, and vervet monkey calls and chimpan-
mal language without treating all such instances as uniform. zee symbol manipulation never rise above the level of indexical
Consequently, a diverse range of animal camoulage or cases communication.
of mimicry can be classiied as iconic instances of communica-
Albert Atkin
tion. For example, the harmless milk snakes mimicking of the
poisonous coral snakes red, black, and yellow coloring in order
to avoid predation is easily explained as an instance of iconic
communication these colors mean poisonous! As for indexical Brinck, Ingar, and Peter Grdenfors. 2003. Co-operation and communi-
communication, a well-discussed case is vervet monkey warning cation in apes and humans. Mind and Language 18: 484501.
calls (see Seyfarth, Cheney, and Marler 1980; see also primate Burks, Arthur. 1949. Icon, index and symbol. Philosophy and
Phenomenological Research 9: 67389.
vocalizations). In such an example, the calls are classiiable
Deacon, Terence. 1997. he Symbolic Species: he Co-evolution of
as indexical since they rely upon a causal and physical connec-
Language and the Human Brain. New York: Norton.
tion with particular predators in order to refer the calls are made Liszka, James Jacob. 1996. A General Introduction to the Semiotic of
in response to the snakes, eagles, or leopards whose presence is Charles S. Peirce. Bloomington: Indiana University Press.
perceived. And this is all in contrast to human language, which Peirce, C. S. 1935a. he Collected Papers of Charles S. Peirce. Vol 2.
is predominantly symbolic and can enable communication even Cambridge: Harvard University Press.
if the objects referred to are not present. Ingar Brinck and Peter . 1935b. he Collected Papers of Charles S. Peirce. Vol. 3.
Grdenfors (2003) make compelling use of the icon, index, sym- Cambridge: Harvard University Press.
bol trichotomy in explaining animal communication where they Seyfarth, R. M., D. L. Cheney, and P. Marler. 1980. Monkey responses
discuss the role of such communication in cooperation. to three diferent alarm calls: Evidence for predator classiication and
he most prominent use and interesting development of semantic communication. Science 210: 8013.
Sherif, John K. 1989. he Fate of Meaning: Charles Peirce Structuralism
Peirces icon, index, symbol trichotomy is Terence Deacons
and Literature. Princeton, NJ: Princeton University Press.
(1997) account of the coevolution of human language and
Short, homas. 2004. he development of Peirces theory of signs.
brains. According to that account, language evolution is to be In he Cambridge Companion to Peirce, ed. Cheryl Misak, 21440.
explained by seeing iconic, indexical, and symbolic communi- Cambridge: Cambridge University Press.
cation and reference as related to one another in a hierarchy. Turino, homas. 1999. Signs of imagination, identity, and experience: A
What this means is that in order to master symbolic communi- Peircian semiotic theory for music. Ethnomusicology 43: 22155.
cation, we must irst master indexical communication. And in Wollen, Peter. 1969. Signs and Meaning in he Cinema. London: Secker
order to master indexical communication, we must irst master and Warburg/British Film Institute.
iconic communication. For instance, a predators inability to
distinguish the milk snakes coloring from that of a coral snake
IDEAL SPEECH SITUATION
is suggestive of iconic reference it is manifest in the preda-
tors inability to distinguish one type of snake from the other. his term was coined by the German social theorist and philoso-
However, this iconic communication needs to be in place in pher Jrgen Habermas to refer to the conditions necessary for free
order for the predator to take the coloring of those snakes as and transparent communication and discussion. he concept of
an indexical signiier of the poisonous status of the snake red, ideal speech situation plays a key part in his early formulations of
yellow, and black banding are an index of a venomous snake. a theory of communicative action and of universal prag-
Other instances of indexical reference work in just this way. It is matics (Habermas 1979, 168; 1984; 1987). In his later writings,
because the vervet monkey sees the eagle above as being quali- the term has tended to be replaced by Karl-Otto Apels notion
tatively similar to previously experienced eagles (that is, as an of an unrestricted communication community (Apel 1980;
icon of a recognized predator) that it is able to produce a warn- Habermas 1990, 88).
ing cry (an indexical reference) when that predator is present. An ideal speech situation may be understood as the condi-
Symbolic reference requires the presence of indexicals but also tions that would allow for open discussion between free and
requires that the indexical relationship between words/sounds equal participants, who strive to come to an agreement upon
and their objects has become ingrained, habitual, and appropri- any topic purely through the force of better argument. hus, the
ately interconnected with other symbols so that reference and participants enter a discussion assuming that their ideas may be
communication are maintained even if the stimulus to indexical challenged by any other participant, but that only those ideas
reference is lost or removed. and arguments that are rationally formulated and supported
Once this symbolic threshold is achieved, complex relation- by relevant and persuasive evidence will survive interrogation.
ships between words develop, allowing words to signify other he personality, status, power, or rhetorical abilities of the per-
words and explain the relationships that exist among them. Such son holding the idea will be rendered irrelevant in the course of
a model is useful for explaining various diferences between debate.
cases like vervet monkey warning calls, captive chimpanzee he idea of an ideal speech situation has its origins in the work
symbol manipulation, and human language learning in the of the American pragmatist philosopher Charles Sanders Peirce.
two former cases, the connection between sign and object is lost In his philosophy of science, Peirce proposed the notion of an ideal
when the object is absent for sustained periods. Consequently, community of scientists. He recognized that scientiic research is
the habituation and interconnectedness of indexical signs that a necessarily communal enterprise. Typically, scientists work in
368
Ideal Speech Situation Identity, Language and
teams, but even if they work in individual isolation, they will still manipulation of one another, Habermas argues, they all presup-
be required to submit their research results to a process of peer pose, with a rather studied navet, that the other participants
review. Scientiic hypotheses are thus formulated, reined, and are telling the truth and being sincere in their participation in
inally accepted as (provisionally) true only through a process a the conversation, and that they have the right to speak and act
collective debate, criticism, and defense. Peirce was aware that as they do. In practice, these assumptions can quickly be over-
in practice, scientiic debate falls well short of any ideal process turned. However, Habermass point is that a person could not
of rational scrutiny. Imperfections will occur in part due to prac- enter into a conversation assuming that the other participants
tical limitations. Certain evidence may be unavailable due, for were lying or systematically trying to deceive. Every utterance
example, to lack of suiciently reined experimental technology would be treated with suspicion, and ultimately no ixed mean-
to test a hypothesis rigorously. More importantly for Habermass ing could be attributed to it. While some form of social interac-
use of Peirce, however, is the distorting role that hierarchies of tion might continue, it would not be a true conversation or what
status and power may play within the scientiic community. he Habermas understands as communicative action. hat is to say,
opinions of certain igures within the scientiic community will the participants would not be seeking to reach a mutual agree-
carry more weight than those of others. he opinions of a senior ment (constrained only by the force of better argument). Rather,
researcher trump those of a laboratory assistant. Peirces con- each would be trying to manipulate the other (in what Habermas
cern is that open and rational debate is then being compromised calls strategic action; 1982, 266).
by hierarchies of power and status within the scientiic commu-
Andrew Edgar
nity. he senior researchers opinions count not because of his or
her greater rationality or insight but simply because he or she is
in a position of power. Junior scientists may feel unable to raise
their criticisms in debate or may believe that their opinions have Apel, Karl-Otto. 1980. Towards a Transformation of Philosophy.
no place in the debate. Peirces notion of an ideal community of London: Routledge and Kegan Paul.
scientists can then be understood, in potential at least, as a crit- Habermas, Jrgen. 1970. On systematically distorted communication.
Inquiry 13: 36075.
ical tool that draws attention to the deiciencies of real scientiic
. 1979. Communication and the Evolution of Society. Trans. homas
communication.
McCarthy. Boston: Beacon.
Habermas may, therefore, be seen to use the notion of the . 1982. A reply to my critics. In Habermas: Critical Debates, ed.
ideal speech situation, particularly in his early formulations of John B. homas and David Held, 21983. London: Macmillan.
it, in a similarly critical manner. It encapsulates a perhaps unre- . 1984. he heory of Communicative Action. Vol 1. Reason
alizable standard against which actual communication and dis- and the Rationalisation of Society. Trans. homas McCarthy.
cussion can be measured. Real communication will be distorted, Cambridge: Polity.
perhaps because of a lack of relevant information, perhaps . 1987. he heory of Communicative Action. Vol. 2. Life World and
because of deiciencies in the participants ability to recognize System: A Critique of Functionalist Reason. Trans. homas McCarthy.
good argument and evidence, but also, crucially, because some Cambridge: Polity.
will exercise power over the discussion. Power can be used to . 1990. Moral Consciousness and Communicative Action. Trans.
Christian Lenhardt and Shierry Weber Nicholsen. Cambridge, MA: MIT
introduce topics, to silence certain forms of criticism or sup-
Press.
press evidence, and to silence certain potential contributors.
Power can be exercised openly through, for example, threats and
intimidation. More subtly, it may be exercised through rhetorical
IDENTITY, LANGUAGE AND
means so that the other participants fail to recognize that weak
arguments have been given or that relevant evidence has not Linguistic Identity
been presented. Perhaps most signiicantly, Habermas argues Our identities who we are are bound up with how we speak,
that power diferentials may be so ingrained in a culture that write, and sign. Whether or not we intend it or are even aware of
participants take their inferiority or superiority for granted and, it, other people interpret clues from our use of language in order
as such, do not notice the inluence of power on discussion. For to assign to us identity categories of all sorts, including gender,
example, in a patriarchal society, women will typically have less race, ethnicity, nationality, the region, or even the precise local-
opportunity to raise topics in conversation or to challenge and ity we come from, age or generation, sexual orientation, religion,
interrupt other (male) speakers (see gender and language). level of education, and that vague complex of factors that bundle
Such implicit and unacknowledged power is characterized as together as class. All this is in addition to and part and parcel
systematically distorted communication (Habermas 1970). with the decisions they make about our intelligence, likeability,
he images of an ideal community of scientists and an ideal and trustworthiness and whether or not to believe what we are
speech situation have a utopian ring about them, suggesting per- telling them.
fect societies at the end of human history. Habermas is keen to Although self-identity has long been given a privileged role,
reject such utopian interpretations of the ideal speech situation the identities we construct for ourselves and others are not difer-
(Habermas 1982, 261f). his becomes clearer in his later formu- ent in kind, only in the status we accord to them. he gap between
lations of the argument. he ideal speech situation is under- the identity of an individual and of a group seems most like a true
stood as a counterfactual assumption made by all participants diference of kind, with group identities more abstract than indi-
in conversation and discussion. Upon entering a conversation vidual ones. Brazilianness, after all, does not exist separately
where the participants strive for mutual agreement rather than from the Brazilians who possess it, except as an abstract concept.
369
Identity, Language and
Yet combinations of such abstractions are what our individual statistically based examination of social networks to more inter-
identities are made up of, and group identity frequently inds pretative examination of communities of practice, deined
its most concrete manifestation in a single, symbolic individual. as an aggregate of people who come together around mutual
Group identities nurture our individual sense of who we are but engagement in an endeavor (Eckert and McConnell-Ginet
can also smother it. 1992, 464). In the course of this endeavor, shared beliefs, norms,
Recent work on the evolution of language has suggested that it and ideologies emerge, including, though not limited to, linguis-
came about to fulill something more than the two purposes tra- tic and communicative behavior. his line of research is thus
ditionally ascribed to it, communication and representation. continuous with another one that has focused more directly
Language also exists for the purpose of reading other people, in on the normative beliefs or ideologies by which national and
order to discriminate useful allies from potential competitors. other group identities are maintained (see Verschueren 1999;
sociolinguistic inquiry into identity and language is con- Blommaert 1999; Kroskrity 2000).
cerned with the way people read each other, in two senses. First, Other features of recent work on language and identity
how are the meanings of utterances interpreted, not just follow- include the view that identity is something constructed, rather
ing idealized word senses and rules of syntax as recorded in than essential, and performed, rather than possessed features
dictionaries and grammars, but in the context of who is address- that the term identity itself tends to mask, suggesting as it does
ing whom in what situation? Secondly, how are speakers them- something singular, objective, and reiied. Each of us performs
selves read, in the sense of the social and personal identities their a repertoire of identities that are constantly shifting and that we
listeners construct for them based on what they say and how they negotiate and renegotiate according to the circumstances.
say it? his is a complex process because speakers output is usu-
ally shaped in part by how they have read their listeners and their
Co-constructing National Identity and Language
expectations. Every day, each of us repeatedly undertakes this
Within these repertoires, any particular identity can become the
process of constructing our reading of the people we encounter,
salient one in a given context. None inherently matters more
in person, on the telephone, on the radio or the screen, or in writ-
than the rest. However, national identity requires a separate
ing, including on the Internet, on the basis of their language, at
discussion because of its unique impact on views about what
least in part and in some of the media just mentioned, on the
a language is. Modern nationalism has been grounded in a
basis of that alone.
belief that the best proof of a peoples historical authenticity and
right to self-determination is the possession of a language that
Targeting Identity in the Analysis of Language is uniquely theirs. Hence, one of the irst obstacles to be over-
Modern linguistics has moved slowly but steadily toward embrac-
come in establishing a national identity is the nonexistence of a
ing the identity function as central to language. he impediment
national language.
has been the dominance of the traditional outlook that takes
he nation-state myth that basic view of the world as con-
representation alone to be essential, with even communication
sisting naturally of autonomous states, each corresponding to an
relegated to a secondary place. Although signiicant develop-
ethnically uniied nation assumes that national languages are
ments within linguistics (surveyed in Joseph 2004) pushed it in
a primordial reality. Dantes treatise De vulgari eloquentia (ca.
the direction of attending to identity over the course of the twen-
1306) lays out the process by which he claimed to discover, not
tieth century, a crucial prompting came from social psychology,
invent, the national language of a nation, Italy, that would take
where one approach in particular needs to be singled out: social
ive and a half centuries to emerge politically. his all seems a
identity theory, developed in the early 1970s by Henri Tajfel (see
iction, a pretense of discovery in what will actually be Dantes
ethnolinguistic identity). his approach was novel in not
invention of an illustrious vernacular which will, in turn, cam-
being concerned with power but in the status we give ourselves
oulage how much of it is actually based on his native Tuscan.
as members of in-groups and out-groups. his would come
But Dantes volgare illustre became the template upon which
into even greater prominence in the self-categorization theory
other modern European standard languages were modeled.
that developed as an extension of the original model, notably in
Once the national languages existed, their invention was
the work of Tajfels collaborator J. C. Turner (see Turner et al.
promptly forgotten. he people for whom they represented
1987).
national unity inevitably came to imagine that the language had
Partly under the inluence of such work, many sociolin-
always been there and that such dialectal diference that existed
guists reoriented their object of investigation. L. Milroy (1980)
within it was the product of recent fragmentation when, in fact, it
reported data from studies she conducted in Belfast showing
had preceded the uniication by which the national language was
that the social class of an individual did not appear to be the
forged. By the early nineteenth century, this nationalist mythol-
key variable allowing one to make predictions about the forms
ogy would lead to Romantic theorizations of national political
that the person would use. Rather, the key variable was the per-
identities being grounded in a primordial sharing of language.
sons social network, a concept borrowed from sociology, and
One of the strongest expressions was that of Johann Gottlieb
deined as the informal social relationships contracted by an
Fichte ([1808] 1968, 1901):
individual (Milroy 1980, 174). Where close-knit localized net-
work structures existed, there was a strong tendency to maintain he irst, original, and truly natural boundaries of states are
nonstandard vernacular forms of speech. beyond doubt their internal boundaries. hose who speak the
Over the next two decades, sociolinguistic investigation same language are joined to each other by a multitude of invis-
of groups ideologically bound to one another shifted from ible bonds by nature herself, long before any human art begins;
370
Identity, Language and Ideology and Language
they understand each other and have the power of continuing to Fichte, J. G. [1808] 1968. Addresses to the German Nation. Trans.
make themselves understood more and more clearly; they belong R. F. Jones and G. H. Turnbull, ed. G. A. Kelly. New York: Harper Torch
together and are by nature one and an inseparable whole. Books.
Fishman, J. A., ed. 1999. Handbook of Language and Ethnic Identity.
Fichte was writing in order to rouse the German nation Oxford: Oxford University Press.
to repel the advance of Napoleon. However, in 1870, the shoe Hobsbawm, E. J. 1990. Nations and Nationalism since 1780: Programmes,
was on the other foot when the Franco-Prussian War led to the Myth, Reality. Cambridge: Cambridge University Press.
German annexation of Alsace, a German-speaking province that Holmes, J., and M. Meyerhof, eds. 2003. he Handbook of Language and
had been part of France for more than two centuries and whose Gender. Malden, MA, and Oxford: Blackwell.
population was mainly loyal to France in spite of their linguistic Joseph, J. E. 2004. Language and Identity: National, Ethnic, Religious.
Houndmills: Basingstoke; New York: Palgrave Macmillan.
diference. his provoked a sharp turn away from the Fichtean
Kroskrity, P. V., ed. 2000. Regimes of Language: Ideologies, Polities, and
view on the part of French linguists, such as Ernest Renan (1882),
Identities. Santa Fe, NM: School of American Research Press.
who formulated a new view of national identity as based not in
Milroy, L. 1980. Language and Social Networks. Oxford and New
any primordially determining characteristic such as language York: Blackwell.
but on a shared will to be part of the same nation, together with Renan, E. 1882. Quest-ce quune nation? Confrence faite en Sorbonne,
shared memories. le 11 mars 1882. Paris: Calmann Lvy.
he nation, in other words, exists in the minds of the people Smith, A. D. 1998. Nationalism and Modernism: A Critical Survey
who make it up. his is the conception that B. Anderson ([1983] of Recent heories of Nations and Nationalism. London and New
1991, 6) would return to in deining the nation as an imagined York: Routledge.
political community. he legacy of memories to which Renan Tajfel, H. 1978. Social categorization, social identity and social com-
pointed would dominate future philosophical and academic parison. In Diferentiation between Social Groups: Studies in the
Social Psychology of Intergroup Relations, ed. H. Tajfel, 6176.
attempts to analyze national identity. M. Billig, a colleague and
London: Academic Press.
collaborator of Tajfel, has explored how the continual acts of
Turner, J. C., M. A. Hogg, P. J. Oakes, S. D. Reicher, and M. J. Wetherell.
imagination on which the nation depends for its existence are 1987. Rediscovering the Social Group: A Self-Categorization heory.
reproduced (1995, 70), sometimes through purposeful deploy- Oxford: Blackwell.
ment of national symbols but mostly through daily habits of Verschueren, J., ed. 1999. Language and Ideology: Selected Papers from
which we are only dimly aware. Examples include the national the 6th International Pragmatics Conference. Antwerp: International
lag hanging in front of the post oice and the national symbols Pragmatics Association.
on the coins and banknotes we use each day. Billig introduced
the term banal nationalism to cover the ideological habits that
enable the established nations of the West to be reproduced. In IDEOLOGY AND LANGUAGE
Billigs view, an identity is to be found in the embodied habits of
social life (1995, 8), including language. A. D. Smith (e.g., 1998, he concept of ideology refers broadly to the ways in which a
Chapter 8) has emphasized how much of the efort of nation- persons beliefs, opinions, and value systems intersect with the
alism construction is aimed at reaching back to the past in the wider social and political structures of the society in which he
interest of ethno-symbolism, and this can be seen particularly or she lives (cf. politics of language). Many linguists, espe-
in the strong investment made by modern cultures in maintain- cially those working in the traditions of critical linguistics (e.g.,
ing the standard language, by which is meant a form resistant to Fowler et al. 1979) and, more recently, critical discourse
change, hence, harking backward (see Hobsbawm 1990). Every analysis, take the view that language or more exactly, a range
time we attend to the fact that someone has spoken or written of language practices are inluenced by ideology. From this
in a standard or nonstandard way, we take part, usually without perspective, all texts, whether spoken or written, are seen as
realizing it, in both the national construction of our language and being inexorably shaped and determined by a mosaic of political
the linguistic construction of our nation. beliefs and sociocultural activities. he critical linguistic position
on language is, therefore, one that challenges directly the liberal
John E. Joseph construal of texts as natural products of the free communicative
interplay between individuals in society. For critical linguists,
WORKS CITED AND SUGGESTIONS FOR FURTHER READING texts are anything but neutral or disinterested, and so it falls to
Anderson, B. [1983] 1991. Imagined Communities: Relections on the Origin close linguistic analysis to help us understand how ideology is
and Spread of Nationalism. 2d ed. London and New York: Verso. embedded in language and, consequently, to become aware of
Billig, M. 1995. Banal Nationalism. London: Sage. how the relexes of dominant or mainstream ideologies are
Blommaert, J., ed. 1999. Language Ideological Debates. Berlin: Mouton de sustained through textual practices.
Gruyter. Although coined in the early 1800s by the French philoso-
Bourdieu, P. [1982] 1991. Language and Symbolic Power: he Economy
pher Destutt de Tracy, the term ideology is normally associated
of Linguistic Exchanges. Trans. G. Raymond and M. Adamson, ed.
with Karl Marx, particularly with his treatise on he German
J. B. hompson. Cambridge: Polity, in association with Basil Blackwell.
Ideology, a project developed in 18456, but published, in
Cameron, D., and D. Kulick. 2003. Language and Sexuality.
Cambridge: Cambridge University Press. various languages and installments, from the 1930s onward (see
Eckert, P., and S. McConnell-Ginet. 1992. hink practically and look Marx [1933] 1965). Over the intervening years, the concept has
locally: Language and gender as community-based practice. Annual been adopted more widely (and without any necessary adher-
Review of Anthropology 21: 46190. ence to Marxist doctrine) to refer to the belief systems that are
371
Ideology and Language
held either individually or collectively by groups of people and these ideological assumptions are transmitted surreptitiously and
to the social conditions that frame these systems. In Marxs are mediated through forms of language that present as natural
original conception, ideology is seen as an important means by or common sense certain beliefs and values that may prove to
which dominant forces in society, such as royalty, the aristoc- be highly contestable or dubious in their own terms.
racy, or the bourgeoisie, can exercise power over subordinated Take as an example the following discourse event that
or subjugated groups, such as the industrial and rural proletar- unfolded over a few months in the British tabloid he Sun. his
iat. His famous axiom that the ideas of the ruling class are in popular daily voiced vehement opposition to the British gov-
every epoch the ruling ideas ([1933] 1965, 61), along with his ernments plans to celebrate the advent of the millennium by
observation that the ruling material force is at the same time the the construction, at taxpayers expense, of a festival dome in
ruling intellectual force, has had a profound impact on the way Greenwich, London. Notice how in these extracts the paper
contemporary linguistic research has understood discourse sometimes uses italicization to enforce the common-sense sta-
in the public sphere. Ideology, and its expression in the textual tus of its position on the Millennium Experience:
practices that shape our everyday lives, is not something that
he Sun Speaks Its Mind: DUMP THE DOME, TONY! (June 17,
exists in isolation as a product of free will but is instead partial
1997; original emphasis)
and contingent. It is something whereby, as Louis Althusser
suggests, ideas are inserted into the hierarchical arrangement of MPS, businessmen and charities yesterday backed our see-sense
socially and politically determined practices and rituals, which campaign to axe the 800 million Millennium Exhibition planned
are themselves deined by material ideological state apparatuses for Greenwich. (June 18, 1997; italics in original)
(Althusser 1971, 158). In short, in the Marxist tradition, ideology hat dammed Dome has disaster written all over it. he creative
is, most importantly, a system of beliefs that fosters consent to director accuses the Dome secretary of acting like a dictator
social hierarchy, particularly class hierarchy. Subsequent writ- who is too easily swayed by public opinion. If only he was. Maybe
ers inluenced by this tradition have expanded the notion from then this waste of public money would be axed. For thats what
class hierarchy (as in class or, more speciically, capitalist ideol- public opinion wants. (Jan. 1, 1998; italics in original)
ogy; see marxism and language ), to sex (patriarchal ideol-
ogy; see gender and language ), to colonialism (colonial An appeal to commonsense values of the sort displayed here
ideology), and so on. allows the paper to present its objection to the dome as a posi-
Against this theoretical backdrop, scholars researching the tion with which any sensible member of society could concur.
interconnections between language and ideology build from the Among other things, the papers tactic is a good example of natu-
premise that patterns of discourse are framed in a web of beliefs ralization (Fairclough 1992, 678), which is the process whereby
and interests. A texts linguistic makeup functions to privilege cer- an ideological position is presented as if it were simply part of
tain ideological positions while downplaying others such that the natural order of things. Naturalization encourages us to align
the linguistic choices encoded in this or that text can be shown ourselves with mainstream or dominant thinking, even when
to correlate with the ideological orientation of the text. Even the that thinking is itself partisan, self-serving, and driven by eco-
minutiae of a texts construction can reveal an ideological stand- nomic and political interests. Indeed, to demur from he Suns
point, and productive comparisons can be drawn between the position would be to place oneself outside the community of
ways in which a particular linguistic feature is deployed across notional sensible subjects who share the same set of norma-
diferent texts. For instance, the following three simple examples tive values as the paper. Yet if proof were needed of the partisan
difer only in terms of the main verb used: and capricious nature of such naturalized ideological positions
in discourse, consider as a footnote the following breathtaking
he senator explained the cutbacks were necessary. U-turn that appeared in the same tabloid newspaper shortly
he senator claimed the cutbacks were necessary. after the publication of the previous diatribes:
he senator imagined the cutbacks were necessary. he plans for the Millennium Experience are dazzling. If it all
comes of, the Prime Ministers prediction will be correct: the
Whereas the irst example suggests that while the cutbacks were
Dome will be a great advert for Britain. (Feb. 24, 1998; italics in
unavoidable, the senators actions are helpfully explanatory,
original)
the more tenuous claimed of the second example renders the
senators attempt to justify an unpopular measure less convinc- It may have been entirely coincidental that this sudden change
ing. he third example is arguably more negative again, with the in direction occurred on the same day that the papers owner
nonfactive verb imagined suggesting that the obverse condi- pledged 12 million worth of sponsorship to the Millennium
tion applies in the embedded clause, namely, that the senator is Dome.
mistaken in his belief that the cutbacks were necessary. A range of linguistic models have been used over the last
Another important assumption in work on ideology and lan- quarter of a century to explore the interconnections between
guage is that the linguistic structure of a text often works silently language and ideology. Prominent among these has been
or invisibly to reproduce relationships of power and dominance. the concept of register, which is a variety of language that
In consequence, the processor of a text such as the reader of a is deined according to context and use. Linguists have
tabloid newspaper, for example is encouraged to see the world in noticed that in times of war and conlict, in particular, spuri-
particular ways and in ways that are often aligned with the domi- ous specialist registers of discourse are quietly disseminated
nant or mainstream ideology espoused by the paper. Crucially, through the public sphere by inluential social and political
372
Ideology and Language Idioms
groups. In the speciic context of the widespread proliferation of criticism is about the sorts of texts that analysts choose to
of nuclear arms in the 1970s, critical linguists adopted the term subject to ideological analysis. To be blunt, if we know a text to
Nukespeak, in an echo of George Orwells Newspeak, to refer be ideologically problematic at the outset, then any subsequent
to a (mis)use of register in order to mask what for the general linguistic analysis will only conirm what we already know, and
public were the unpalatable horrors of nuclear conlict (see any linguistic feature uncovered through the analysis can by
Chilton 1985). In fact, Nukespeak still reverberates in the con- imputation be passed of as ideologically insidious. his deter-
temporary discourses of war: Collateral damage refers to the ministic approach connects with the third major area of con-
unintentional killing of civilians and noncombatants, incon- cern, which is simply that studies of ideology and language tend
tinent ordnance to poorly aimed missiles, and friendly ire to to be elitist. If the main purpose of the analysis is to uncover and
the inlicting of injury or death to ones allies. While the phrase challenge the repressive discourse practices of powerful, inter-
human remains transportation pods is a heavily sanitized label ested groups, then what needs to be considered before anything
for body bags, the expression advanced marine biological sys- else are the efects of these practices on ordinary (nonacademic)
tems refers rather improbably to dolphins, which, when suitably people. Reactions of ordinary communities to what the analysts
armed, apparently make excellent seaborne weapons systems. deem ideologically insidious discourse are rarely considered;
In addition to the exploitation of register, the strategic use instead, the academic analyst comfortably assumes the perspec-
of metaphorical language has also been identiied as a mecha- tive of those for whom the text was intended, moving seamlessly
nism for sustaining and disseminating ideological dogma (see in and out of the multiple interpretative positions of specialist
Charteris-Black 2004). Paul Simpson (2004, 42) ofers the follow- and nonspecialist alike. It is still early to say how these serious
ing examples from print and broadcast coverage of the conlict and far-reaching criticisms will afect the ways in which scholars
in Iraq in 2003: investigate the widespread interconnections between ideology
and language.
he third mechanized infantry are currently clearing up parts
of the Al Mansur Saddam village area. Paul Simpson
he regime is inished, but there remains some tidying up
to do. WORKS CITED AND SUGGESTIONS FOR FURTHER READING
Oicial sources described it as a mopping up operation. Althusser, L. 1971. Lenin and Philosophy and Other Essays.
London: NLB.
hese examples rehearse the same basic conceptual meta- Blommaert, J. 2005. Discourse: A Critical Introduction. Cambridge:
phor through three diferent linguistic realizations. he experi- Cambridge University Press.
ence of war, the target domain of the metaphor, is relayed by the Charteris-Black, J. 2004. Corpus Approaches to Critical Metaphor Analysis.
idea of cleaning, the source domain (see source and target), Basingstoke and New York: Palgrave MacMillan.
such that the metaphorical formula might be rendered thus: WAR Chilton, P. 1985. Language and the Nuclear Arms Debate: Nukespeak
IS CLEANING. he ideological signiicance of this metaphor is Today. London: Pinter.
that it downplays the signiicance (and indeed risk) of the con- Fairclough, N. 1992. Discourse and Social Change. Cambridge: Polity
Press.
lict, implying that it is nothing more than a simple exercise in
. 2001 Language and Power. 2d ed. London: Longman.
sanitation, a perspective, it has to be said, that is unlikely to be
Foucault, M. 1984. Truth and power. In he Foucault Reader, ed.
shared by military personnel on the opposing side. P. Rabinow, 5175. Harmondsworth, UK: Penguin Books.
Ideological standpoint in language can also be productively Fowler, R. 1991. Language in the News: Discourse and Ideology in the
explored by means of comparisons between diferent texts, espe- Press. London: Routledge.
cially when the texts analyzed cover the same subject matter. Of Fowler, R., R. Hodge, G. Kress, and T. Trew. 1979. Language and Control.
the range of linguistic models that have been thus employed, London: Routledge.
those from functional linguistics have proved particularly Marx, K. [with Frederick Engels]. [1933] 1965. he German Ideology. Ed.
useful as an analytic tool for investigating ideological standpoint and trans. S. Ryazanskaya. London: Lawrence and Wishart.
across diferent portrayals of the same event or experience (see Simpson, P. 1993. Language, Ideology and Point of View.
Fowler 1991; Simpson 1993). London: Routledge.
. 2004. Stylistics. London: Routledge.
he investigation of ideology in language is an undeniably
Stubbs, M. 1997. Whorfs children: Critical comments on critical dis-
important focus for the language sciences. hat said, there have
course analysis (CDA). In Evolving Models of Language, ed. A. Ryan
been a number of stinging attacks on this area of study from and A. Wray, 10016. Swansea: BAAL.
respected authorities (e.g., Widdowson 1995, 1996; Stubbs 1997; Widdowson, H. 1995 Discourse analysis: A critical view. Language and
Blommaert 2005), which have called into question the valid- Literature 4.3: 15772.
ity of key parts of the whole endeavor. hese criticisms have . 1996. Reply to Fairclough: Discourse and interpreta-
tended to cluster around three main issues. he irst concerns tion: Conjectures and refutations. Language and Literature
the term ideology itself, which, even since its inception in the 5.1: 5770.
work of Marx and de Tracy, has proved too broad and too vague
a concept to slot comfortably into a formal analytic framework.
IDIOMS
Indeed, Michel Foucault has argued that the notion of ideol-
ogy is diicult to make use of because it always stands in vir- Idioms hold an important place in the class making up ixed, non-
tual opposition to something else (1984, 60). he second type literal expressions. hese have a perplexing characteristic: hey
373
Idioms Idle Talk and Authenticity
communicate something other than what the words usually Wray, Alison. 2002. Formulaic Language and the Lexicon.
mean. Idioms are the best and most well known representative Cambridge: Cambridge University Press.
of this class. Other examples are speech formulas (Nice to meet
you), proverbs (While the cats away, the mice will play), IDLE TALK AND AUTHENTICITY
clichs (Easy does it), and expletives (For heavens sake!). hese
difer from metaphors, which, though also nonliteral, are made up Martin Heideggers magnum opus, Being and Time, advances a
of novel word combinations. Terminology is inconsistent, but the hermeneutical phenomenology that is, an interpretive descrip-
term formulaic language has become standard (Wray 2002). tion of what it means to be Dasein (a German word for exis-
Deinitions are elusive, and lines between the categories are tence, which Heidegger uses to mean human being). Dasein
not always clear. In deining formulaic expressions, it is easier is not the disembodied rational soul, mind, or subject described
to say what they are not: hey are not newly created phrases or by traditional metaphysics and epistemology but an embodied,
sentences made up of lexical elements (words) according to worldly person. Having a world is constitutive of human exis-
grammatical rules. Instead, they are learned, stored, and pro- tence; as Heidegger says, Daseins being is essentially being-in-
cessed as unitary conigurations. he meaning of the idiom She the-world (In-der-Welt-sein).
has him eating out of her hand does not convey information To be in the world is to be thrown into an environment and a
about eating or hands but, instead, refers to a complex human tradition beyond your choosing and to take future-deining pos-
relationship whereby one person is submissive to the other. sibilities from it as your own or to fail to do so. he way in which
Idioms beneit from this ability to pack an aura of connotations people typically fail to make their possibilities, hence their being,
in a complex meaning. their own is by simply doing what one does or, as Heidegger
Idioms have two characteristic properties: stereotyped form says, being ruled by the one (das Man). Social conformity
and conventional meaning. Stereotyped form means that cer- cannot be avoided altogether, of course, but beyond a certain
tain words appear in a particular order with a particular speech point it amounts to what he calls inauthenticity, or disowned-
melody. he idiom I wouldnt want to be in his shoes, to be ness (Uneigentlichkeit). Authenticity, by contrast, he describes
well formed, must have precisely those words in that order, as forerunning resoluteness (vorlaufende Entschlossenheit),
with the accent on his. Changes may be introduced because which is to say, facing up (resolutely) to the concrete situation
idioms are decomposable. Linguists have attempted to charac- and embracing (running up into) your death. Forerunning res-
terize syntactic operations that may be performed on idioms, oluteness is nothing self-destructive or suicidal, though, since
and many psychological studies have pursued this question. by death, Heidegger means neither the biological end nor the
It appears that much potential variation exists, depending on biographical conclusion of a life but, rather, the constant closing
communicative need. he complex meanings associated with down of possibilities.
idioms contain emotional and attitudinal nuances. he idi- A crucial contributing factor to Daseins characteristic lapse
omatic expression Its a small world signals recognition of a into inauthenticity is its unavoidable involvement in a public
chance meeting in an unexpected place by two acquaintances, language governed by anonymous norms of correctness and
relecting surprise and serendipity. In contrast, the literal propriety. In Being and Time, Heidegger refers to language
statement Its a small tree merely conveys information about not as a formal syntactic or semantic system, but as the
relative tree size. he stereotyped forms and conventional concrete manifestation of discourse (Rede), or expressive-
meanings of idioms are known to the native speaker but dif- communicative behavior broadly construed. Most everyday
icult for the second language learner. Idioms are learned and discourse, he says, is idle talk (Gerede) or chitchat, generic
processed by cognitive and neurological mechanisms diferent conversation in which we merely pass the word along, as
from those underlying novel expressions (Van Lancker Sidtis opposed to speaking authentically. Although the public, generic
2006). heir number is often underestimated; in eforts at com- character of a shared language contributes to our lapse into idle
piling lists, no upper limit has yet been determined. Corpus talk, not all speech is inauthentic. Dasein, that is, can speak in
studies often utilize computer search techniques to quantify conformity with publicly recognized norms of correctness and
incidence, but a human interface is necessary for identifying yet still speak in its own voice.
idiomatic forms. Heideggers concept of idle talk owes a large but unacknowl-
edged debt to Sren Kierkegaards account of talkativeness, or
Diana Van Lancker Sidtis chat, in his critique of the present age in A Literary Review
([1846] 2001). Heidegger even follows Kierkegaard in describing
WORKS CITED AND SUGGESTIONS FOR FURTHER READING the banalizing, conformity-inducing efect of chatter as a kind of
Cacciari, C., and P. Tabossi, eds. 1993. Idioms: Processing, Structure, and leveling process. But whereas Kierkegaard saw such conform-
Interpretation. Hillsdale, NJ: Lawrence Erlbaum. ism and supericiality as characteristic of modern European
Nunberg, Geofrey, I. Sag, and T. Wasow. 1994. Idioms. Language culture, Heidegger regarded it is essential to our being-in-the-
70: 491538. world. For Heidegger, that is, there could be no shared public
Titone, Deborah, and C. Connine. 1999. On the compositional and non- world at all in the absence of a relatively bland background
compositional nature of idiomatic expressions. Journal of Pragmatics
of established ways of conducting oneself, linguistically and
31: 165574.
nonlinguistically.
Van Lancker Sidtis, Diana. 2006. Where in the brain is nonliteral lan-
guage? Metaphor and Symbol 21. 4: 21344. Taylor Carman
374
Ijtihd (Interpretive Effort)
WORKS CITED AND SUGGESTIONS FOR FURTHER READING is what writers in the European ethical tradition refer to as the
Blattner, William. 2006. Heideggers Being and Time. London:
problem of relevant act descriptions [see Chapter 2 of Nell
Continuum. 1975].)
Carman, Taylor. 2003. Heideggers Analytic: Interpretation, Discourse, and here are some general principles that may guide ijtihd. For
Authenticity in Being and Time. Cambridge: Cambridge University example, in cases that involve punishment of an ofender, one is
Press. to incline toward mercy (see Waines 1996, 81). More generally,
Dreyfus, Hubert L. 1991. Being-in-the-World: A Commentary on interpretation is to some degree constrained by two sorts of
Heideggers Being and Time, Division I. Cambridge, MA: MIT Press. coherence coherence with the body of the law and coherence
Heidegger, Martin. 1962. Being and Time. Trans. J. Macquarrie and with the views of jurists. On the other hand, jurists often disagree.
E. Robinson. New York: Harper and Row. Moreover, apparent contradictions with the body of law may be
Kierkegaard, Sren. [1846] 2001. A Literary Review. Trans. A. Hannay.
resolved by reinterpreting other parts of the law. hus, in debat-
New York: Penguin.
ing a passage P, one jurist may cite passage N against the inter-
Wrathall, Mark. 2005. How to Read Heidegger. New York: Norton.
pretation put forth by a second jurist. But this second jurist may
interpret the passage N in such a way that it its with his or her
IJTIHD (INTERPRETIVE EFFORT) initial interpretation of passage P. In this way, the criterion of
Ijtihd sometimes contrasted with taqld or uncritical accep- coherence is often not decisive.
tance of someone elses opinion (Chaumont 1997, 11) is a con- In many ways, ijtihd is as much an attitude as a method. he
cept from Muslim legal theory (see legal interpretation). It point is suggested by a hadth that contrasts Quranic verses that
refers to the interpretive efort involved in understanding a point anyone can understand with those that someone with a pure
of law. he various schools of Muslim legal thought difer on the mind can understand (Gleave 1997, 33). Ijtihd necessarily
degree to which they accept or advocate ijtihd. draws on techniques of inference (such as analogy) and princi-
here are two places in which ijtihd is likely to arise as an ples of selection across possible inferences (such as coherence).
issue. One is in the extension of particular judgments to new But it fundamentally involves sincere and open-minded work to
cases. When this occurs, the common method of inference is understand the purport of a legal or other passage.
qiys or analogy (see Jokisch 1997). Take a simple example In this respect, ijtihd may be seen as an instance of a gen-
using one hadth. (A hadth is a brief narrative of a decision or eral pattern in Islam. he word Islam means submission.
practice, most often from Muhammad, that has been passed Submission to Allah is paired with a struggle against anything
down by tradition and has roughly the status of a precedent in that would inhibit that submission. A possible inhibition may be
law.) Muhammad says that there is no tax payment required a itnah, or trial. When faced with such a possible inhibition, the
for fewer than ive camels (Ali n.d., 215). One might narrowly believer engages in a struggle, or jihad, in order to bear witness to
constrain the application of the hadth, not extending it beyond truth. hat inhibition may be ones own inclination to err. In that
camels. Alternatively, one might calculate the worth of the case, ones jihad is against that inclination. It may also be against
camels and use that as a basis for application. A third possi- some violence perpetrated upon oneself or others. In that case,
bility involves qiys. But this is not simple. For example, ive jihad may take the form of battle (hence, the popular conception
cars would probably not parallel ive camels. Perhaps the anal- of jihad as holy war).
ogy should be based on the importance of the objects for ordi- One may think of ijtihd as analogous to jihad, thus as a strug-
nary functioning in society. his is where interpretive efort is gle to establish and bear witness for truth, in this case through
required. interpretation. (he words are etymologically related.) In keep-
he second obvious place for ijtihd is in understanding the ing with this, the Quran opposes right interpretation to itnah
general meaning of texts. he Quran suggests this, explaining of connected with false interpretation (Ali 1995, 3.6).
itself that some of its verses are decisive and others are alle- hough linked in its origins to religious attitudes and faith in
gorical (Ali 1995, 3.6). he mere existence of the allegorical or, revelation, the general concept of ijtihd may not be irrelevant
in Dawoods translation, ambiguous verses indicates the to secular forms of interpretation within or outside of Muslim
necessity of interpretive efort in these cases. Indeed, the dif- hermeneutic traditions (see philology and hermeneutics).
ference in translations suggests the importance of interpretive Indeed, the concept seems germane to a range of legal, liter-
efort. Moreover, once meaning is determined, interpretative ary, conversational, and even scientiic discourses (on the
efort may be required for understanding the purpose of a pas- relevance of issues in interpretation, including scriptural inter-
sage. For example, many stories in the Quran are presented as pretation, to science, see Lecture 4 of van Fraassen). It suggests
literally true accounts of Allahs responses to human behavior. something about the ways in which we come to understand
hese frequently serve as warnings. But understanding these these discourses when our spontaneous or automatic response
warnings may require interpretive efort to discern the exact has been interrupted and we must engage in self-conscious
nature of the actions forbidden. When someone is punished, relection on meaning (cf. passing theories). Indeed, its
we must infer what construal to give that persons actions so as opposition to taqld suggests partially parallel oppositions
to understand the point of the punishment. For example, Lots found in inluential Western theories, such as those of Bakhtin
wife was punished because she remained behind when Lot (see dialogism) and Martin Heidegger (see, for example,
left (Dawood 1990, 7:83). What is her sin disbelief in a prophet, Heidegger 1962, 21114).
disobedience of her husband, association with the sinful? (his Patrick Colm Hogan
375
I-Language and E-Language Illocutionary Force and Sentence Types
WORKS CITED AND SUGGESTIONS FOR FURTHER READING A last inluential tradition of E-language is platonism,
Ali, Maulana Muhammad, ed. and trans. 1995. he Holy Qurn. revived by J. Katz and others. According to this view, language
Columbus, OH: Ahmadiyyah Anjuman Ishaat Islam. consists of abstract objects and properties that have an existence
. N.d. A Manual of Hadith. Lahore: Ahmadiyya Anjuman Ishaat independent of both our mind and cultural record.
Islam. As for I-language, Chomsky (1986, 21) refers to O. Jespersen,
Chaumont, ric. 1997. Ijtihd et histoire en islam sunnite classique selon who claimed that there is some notion of structure in the mind
quelques juristes et quelques thologiens. In Gleave and Kermeli of the speaker. According to Chomsky, it is a distinct system of
1997, 723. the mind/brain that grows in the individual from an initial state
Dawood, N. J., ed. and trans. 1990. he Koran. New York: Penguin. S0 to a stable state Ss. his growth, comparable to the growth of
Gleave, Robert. 1997. Akhbr Sh usl al-iqh and the Juristic heory of
an organ, involves only minimal external factors, such as those
Ysuf b. Ahmad al Bahrn. In Gleave and Kermeli 1997, 2445.
that help set the parameters that distinguish the grammars of dif-
Gleave, R., and E. Kermeli, eds. 1997. Islamic Law: heory and Practice.
ferent languages. Seen this way, the study of language is part of
London: Tauris.
Heidegger, Martin. 1962. Being and Time. Trans. John Macquarrie and individual cognitive psychology and, ultimately, part of human
Edward Robinson. New York: Harper and Row. biology. In order to counter the obvious objection that language
Jokisch, Benjamin. 1997. Ijtihd in Ibn Taymiyyas fatw. In Gleave also involves external elements, Chomsky makes a distinction
and Kermeli 1997, 19937. between the faculty of language in the narrow sense (FLN) and
Nell, Onora. 1975. Acting on Principle: An Essay on Kantian Ethics. the faculty of language in the broad sense (FLB). he notion of
New York: Columbia University Press. I-language particularly applies to FLN, which has recursion as its
van Fraassen, Bas C. 2002. he Empirical Stance. New Haven, CT: Yale core property (Hauser et al. 2002).
University Press. It is a matter of controversy whether the opposition of
Waines, David. 1996. An Introduction to Islam. Cambridge: Cambridge
E-language to I-language makes sense, since FLN, no matter how
University Press.
internal, involves words and therefore externally coded conven-
tions. Even in its narrowest sense, then, language seems to inte-
I-LANGUAGE AND E-LANGUAGE grate E- and I-elements.
In order to counter earlier misunderstandings, Noam Chomsky Jan Koster

(1986, 20f.) made a distinction between E-language and
I-language. E-language stands for externalized language and WORKS CITED AND SUGGESTIONS FOR FURTHER READING
I-language for internalized language. E-language is deined Chomsky, Noam. 1986. Knowledge of Language. New York: Praeger.
as language independent of the properties of the mind/brain. Hauser, Marc D., Noam Chomsky, et al. 2002. he faculty of lan-
I-language, in contrast, is language seen as a property of an guage: What is it, who has it, and how did it evolve? Science 298
individuals mind/brain. he neologism mind/brain relects (November): 156979.
Chomskys belief that theories of mental faculties, particularly
generative grammars, are ultimately about the brain at
ILLOCUTIONARY FORCE AND SENTENCE TYPES
some level of abstraction.
here are various ways of seeing language as external to the John L. Austin (1962) and John R. Searle (1969, 1975) distinguish
human mind. he immediate target of Chomsky (1986, 20) is the meaning (see sense and reference) of a sentence from
language as a collection (or system) of actions or behaviors of its illocutionary force. Austin and Searle conceive of the illo-
some sort. his is the behavioristic view of language that Chomsky cutionary force as the act performed by the speaker with his/
criticized in L. Bloomield and B. F. Skinner. A related notion, her utterance. Examples of illocutionary force are expressing a
defended by W. V. Quine and R. Montague, sees human languages belief, asking the addressee a question, and warning or advis-
as analogous to formal languages. According to this conception, ing someone. he illocutionary force must be intended by the
languages are extensionally deined as sets of sentences or well- speaker, and it must be possible for the addressee to recognize
formed formulas, and there is no empirical dimension to the ques- it (see also speech-acts , performative and constative ,
tion which grammar is the correct one: Grammars are equally valid perlocution).
variants if they generate extensionally equivalent languages. Illocutionary force interacts with sentence grammar. his
A very common tradition of E-language sees languages as short entry focuses on illocutionary consequences of the syn-
largely existing in some mind-external cultural record. his tra- tactic sentence types declarative and interrogative, with some
dition goes back to Aristotle, was selectively revived by J. G. von reference to imperatives. (See Knig and Siemund 2007 on
Herder and the German Romanticists, and deeply inluenced the cross-linguistic usefulness of this traditional three-way
the European structuralism of Ferdinand de Saussure and the distinction.)
American structuralism of F. Boas and E. Sapir. Ludwig
Wittgensteins idea that language consists of conventional rules THE MEANING OF DECLARATIVES. In philosophical logic and
constitutive of language games also its this tradition, as does linguistic semantics, propositions traditionally represent
Karl Poppers proposal that language is part of his world 3 (some the content of both that-clauses (that it is raining) and declara-
recent ideas related to E-language may be found in meaning tives (It is raining). Austin and Searle draw on this notion in dis-
externalism). Relativistically interpreted, this tradition is at tinguishing the meaning of a clause from its illocutionary force
variance with universal grammar. according to the schema Force(proposition). For example, the
376
Illocutionary Force and Sentence Types
two aspects of the statement It is raining can be rendered as he hus, the semantic distinction between declaratives and
speaker expresses that (s)he believes (that it is raining). interrogatives is useful for distinguishing typical and untypical
uses of declaratives and for distinguishing typical and untypical
THE MEANING OF INTERROGATIVES. Semantic theory (see, for uses of interrogatives. he lexibility for diferent uses for both of
example, Karttunen 1977; Groenendijk and Stokhof 1982) has these sentence types may suggest a purely pragmatic assignment
since developed meanings for syntactic interrogatives that cover of illocutionary force to propositions and partitions.
unembedded interrogatives like (1a), (2a) and embedded inter-
rogatives like (1b), (2b). hese meanings (difering in detail GRAMMAR AFFECTS ILLOCUTIONARY FORCE. Yet, there is evi-
between authors) may be paraphrased as in (1c), (2c). he mean- dence that grammar interferes with the assignment of illocution-
ings assigned by Jeroen Groenendijk and Martin Stokhof (1982) ary force:
are called partitions.
First, a purely pragmatic account of declarative questions leads
(1) a. Is it raining? (2) a. Whom did he invite?
to the expectation that a declarative questions like John is at
b. Mary wonders b. Mary wonders
home? shares the illocutionary force of a yes/no question like
[whether it is raining] [whom he invited]
Is John at home? In the declarative question, the speaker would
c. the true one of c. the true one(s) of
indicate by the intonation that he/she does not commit to p, and
{that it is raining, {that he invited Bill,
so the next likely interpretation would be that the speaker is won-
that it is not raining} that he invited Jane, }
dering whether p is true and seeks the help of the addressee in
Note that the partitions in (1c) and (2c) do not contain a compo- this regard. In fact, however, declarative questions strongly difer
nent of illocutionary force. hey are also the meanings of embed- from yes/no questions.
ded interrogatives like (1b) and (2b), and embedded clauses are
(3) [Telephone conversation] (4) [Steve calls Ann on the phone]
standardly assumed not to have illocutionary force.
Ann: Hang on, Ill ask John Ann (picks up phone): Hello?
hus, the meaning of a declarative is a proposition and the
Steve: John is at home? a. Steve: # John is at home?
meaning of an interrogative is a partition. Let us consider how
cf. b. Steve: Is John at home?
illocutionary force of two kinds may be added to these mean-
ings: i) illocutionary force as a statement: the speaker expresses he declarative question John is at home? can be asked only
that (s)he believes something to be true, and ii) illocutionary force where the speaker can assume that the addressee believes that
as a question: the speaker wants to learn from the addressee the John is at home (Gunlogson 2001). his is the case in (3) but
truth in an as-yet open issue. Let us begin by assuming that the not in (4a), where the declarative question is accordingly infe-
choice between statement and question is pragmatically inferred licitous (symbolized by #; see also felicity conditions).
rather than grammatically triggered. Interrogatives do not carry this strong requirement; compare
(4b). hus, even though declaratives cannot have a statement
THE ILLOCUTIONARY FORCE OF DECLARATIVES. he meaning of a interpretation hardwired into them, they seem to have some-
declarative (It is raining) as a proposition (that it is raining) makes thing more lexible hardwired into them that goes beyond the
it likely that the speaker intends a statement interpretation of his/ propositional content: A declarative sentence seems to either
her utterance (the speaker conveys that [s]he believes that it is rain- commit the speaker to p (statement) or assume a commitment of
ing), perhaps in connection with the maxim of quality in Grice the addressee to p (declarative question) (see Gunlogson 2001).
(1975), which requires such truthfulness (see also cooperative Second, root interrogatives also seem to have an illocutionary
principle). If we assume that this statement interpretation is not element hardwired into them. If Steve and Ann share ignorance
hardwired into declaratives, we correctly allow declarative ques- of things mechanic, Steve may say (5a) to Ann, but not (5b).
tions, as in It is raining? Here, a declarative sentence with ques-
(5) Steve to Ann (both ignorant about car problems):
tion intonation is used as a question. Unlike in statements, the
a. he carburetor of my car is broken. I wonder whether the
speaker does not convey that he/she believes that it is raining.
repairs will be expensive.
b. he carburetor of my car is broken. # Will the repairs be
THE ILLOCUTIONARY FORCE OF INTERROGATIVES. No single state
expensive?
of afairs (ordinary proposition) is ofered in the interrogative
meaning (the true one among , see [1c] and [2c]), and so his is unexpected on the view that a semantic partition is lexibly
there is no speciic content for an interpretation as a statement. augmented with illocutionary force by pragmatic inferences. In
he interrogative meaning makes a question interpretation (5a), the matrix predicate I wonder conveys that Steve is inter-
particularly likely, in which the speaker wants the addressee ested in the answer but does not have an expectation toward Ann
to pick the true one among the possibilities. If we assume that to provide it. he felicity of (5a) shows that this is not excluded
this illocutionary force is not hardwired into root interrogatives, for lack of conversational relevance or for other pragmatic rea-
we correctly allow the lexibility for untypical uses. In mono- sons. Why cannot the unembedded partition Will the repairs be
logical questions, such as in a lecture, the speaker later pro- expensive? in (5b) have a pragmatically inferred similar illocu-
vides the answer him-/herself: Who was behind these reforms? tionary interpretation? It seems that some hardwired element
Everything points towards Bismarck (Brandt et al. 1992). In rhe- of illocutionary interpretation here leads to an expectation of
torical questions like Am I your servant? the true one among an answer from the addressee and thus to the inappropriate-
the given possibilities is to be inferred and need not be stated. ness in (5b). A promising hypothesis about such a hardwired
377
Illocutionary Force and Sentence Types Image Schema
requirement is again that a commitment by a salient individual can be approximated in terms of a commitment by a salient indi-
is assumed or expressed. In the case of interrogatives, a commit- vidual that is expressed or assumed. Sentence mood (such as
ment to (1c)/(2c) is a commitment to the true element(s) of the imperative vs. indicative) plays a further crucial role in condi-
partition, that is, knowledge of the correct answer (whatever it tioning the illocutionary interpretation of a clause.
may be). he assumption of such a commitment in root interrog-
Hubert Truckenbrodt
atives is compatible with the examples discussed here. In rhetor-
ical questions, knowledge of the correct answer is assumed of the
speaker and perhaps of the addressee. In monological questions,
knowledge of the correct answer can be assumed of the speaker, Austin, John L. 1962. How To Do hings with Words. Oxford: Clarendon.
who will later provide this answer. In standard questions, Brandt, Margareta, Marga Reis, Inger Rosengren, and Ilse Zimmermann.
knowledge of the correct answer would then be assumed of the 1992. Satztyp, Satzmodus, und Illokution. In Satz und Illokution 1,
ed. Inger Rosengren, 190. Tbingen: Niemeyer.
addressee. hen (5b) would be infelicitous because knowledge of
Grice, H. P. 1975. Logic and conversation. In Syntax and Semantics
the correct answer could not be assumed of either the addressee
3: Speech Acts, ed. P. Cole and J. Morgan, 4158. New York: Academic
or the speaker (see Truckenbrodt 2004, 2006a, 2006b). Press.
hird, German declaratives (syntactically V2-clauses) can Groenendijk, Jeroen, and Martin Stokhof. 1982. Semantic analysis of
replace embedded dass-(that)-clauses under a range of atti- WH-complements. Linguistics and Philosophy 5: 175233.
tude verbs, including sagen (say) and glauben (believe) and Gunlogson, Christine. 2001. True to form: Rising and falling declaratives
excluding bezweifeln, (doubt) and leugnen (deny): Maria as questions in English. Ph.d. diss., University of California at Santa
glaubt/*bezweifelt, Hans ist zu Hause, Mary believes/*doubts, Cruz.
Hans is at home. A semantic restriction seems to be in efect that Heycock, Caroline 2006. Embedded root phenomena. In he Blackwell
relates declaratives to a salient individuals beliefs (or a similar Companion to Syntax. Vol. 2. Ed. M. Everaert and H. van Riemsdijk,
domain) Marias beliefs in the preceding example, the beliefs 174209. Oxford: Blackwell.
Karttunen, Lauri 1977. Syntax and semantics of questions. Linguistics
of the speaker or the addressee in unembedded use (statements
and Philosophy 1: 344.
and declarative questions) (Truckenbrodt 2006a, 2006b). More
Knig, Ekkehard, and Peter Siemund. 2007. Speech act distinctions in
generally, there is a class of syntactic phenomena (root phe- grammar. In Language Typology and Syntactic Description. Vol. 1. Ed.
nomena) that occur in unembedded clauses, as well as embed- T. Shopen, 276324. Cambridge: Cambridge University Press.
ded under the verbs believe, say, and other predicates that have Portner, Paul. 1997. he semantics of mood, complementation, and
been characterized as assertive (see discussion and references conversational force. Natural Language Semantics 5: 167212.
in Heycock 2006). It is possible that root phenomena are phe- Schwager, Johanna Magdalena. 2005. Interpreting imperatives. Ph.d.
nomena that trigger a semantic requirement of a commitment diss., University of Frankfurt am Main.
by a salient individual. Such a requirement might lead to the Searle, John R. 1969. Speech Acts. An Essay in the Philosophy of Language.
observed restrictions on embedded use. he requirement would Cambridge: Cambridge University Press.
crucially contribute to illocutionary force in unembedded use. . 1975. A taxonomy of illocutionary acts. In Language, Mind,
and Knowledge, ed. K. Gunderson, 34469. Minneapolis: University
Finally, verbal mood (indicative, subjunctive, imperative) is a
of Minnesota Press. Repr. John R. Searle, Expression and Meaning
grammatical category that interacts with illocutionary force. he
(Cambridge: Cambridge University Press, 1979), 129.
imperative seems to be interpreted deontically (Schwager 2005), Truckenbrodt, Hubert. 2004. Zur Strukturbedeutung von
that is, along the lines of You should with strength varying Interrogativstzen. Linguistische Berichte 199: 31350.
between a demand or request (Open the window!) to invitation . 2006a. On the semantic motivation of syntactic verb movement
(Have another piece of cake!) or even a wish (Have a good break!). to C in German. heoretical Linguistics 32: 257306.
A strong interaction of verbal mood with illocutionary force is . 2006b. Replies to the comments by Grtner, Plunze and
this: Have inished eating by 12:30! cannot be a statement to the Zimmermann, Portner, Potts, Reis, and Zaeferer. heoretical
efect that the addressee has inished eating by 12:30 (rather than Linguistics 32: 387410.
by 12:45, as the addressee may mistakenly believe). Nor can it
be a question that wants the addressee to clarify whether in fact
IMAGE SCHEMA
the addressee has inished eating by 12:30. More generally, the
deontic interpretation of the imperative cannot be replaced with A foundational concept in cognitive linguistics, image
an epistemic interpretation (i.e., one that negotiates knowledge), schemas are associated most closely with the work of Mark
while the indicative in declaratives and interrogatives typically Johnson (1987) and his collaborator George Lakof (1987). Image
leads to epistemic (statement and question) interpretations. (See schemas are thought to play a key role in the acquisition, struc-
Portner 1997 on indicative and subjunctive verbal mood.) ture, and use of language; a central tenet of cognitive linguistics
In sum, the grammatical distinction between declaratives is that the perceptual interactions and motor engagements with
and interrogatives leads to a semantic distinction between a the physical world shape the way we think. An image schema
proposition and a partition. his distinction is useful for a irst can be deined as a condensed representation of bodily motor
understanding of the typical and untypical illocutionary force of experience for purposes of mapping spatial structure onto con-
declaratives and of interrogatives. Illocutionary force may in part ceptual structure. According to Johnson, these patterns emerge
be assigned pragmatically. However, the assignment seems also as meaningful structures based on our bodily movements
to be subject to a grammatically triggered interpretation in root through space, our manipulations of objects, and our percep-
clauses. For declaratives and interrogatives, these interpretations tual interactions (1987, 29). Jean Mandler aptly likens image
378
Image Schema Implicational Universals
schemas to the representations one is left with when one has motion; and caused motion emerge as preverbal, perceptually
forgotten most of the details (2004, 7981). based conceptual primitives underlying cognitive development
Lack of speciicity and content makes image schemas highly and language acquisition. Chris Sinha and K. Jensen de Lopez
lexible preconceptual and primitive patterns used for reasoning take a slightly diferent tack in emphasizing the social-cultural
in an array of contexts (Johnson 1987, 30). A partial inventory of and artifact dimensions of the acquisition of locative preposi-
image schemas is as follows: container; balance; blockage; tions in and under in Danish-acquiring, English-acquiring, and
counterforce; restraint removal; source-path-goal; link; Zapotec-acquiring children. hey conclude that the sociocul-
center-periphery; verticality; scale; part-whole; support. tural environment plays a greater role in cognitive development
Since the publication of the inluential works of Johnson and than previously thought. Image schemas may in fact be distrib-
Lakof, the notion of image schemas has been theorized, inves- uted throughout the local environment, with considerable dif-
tigated, and applied in multiple domains of inquiry. Within ferences appearing among cultures with very diferent material
cognitive linguistics, the notion of an image schema has taken makeups.
center stage in research in semantic and grammatical analy- Finally, image schema theory is a central pillar of the neural
sis, psycholinguistics, cognitive development, and neuro- theory of language project initiated by Jerome Feldman (2006),
computational modeling, to name the most prominent areas of Lakof (1987), Srini Naryanan, and others. A crucial component
activity in the language sciences. Controversy has ensued over of this neural computational model is the execution or x-schema
the deinition of and criteria for positing something as an image protocol for representing human actions, which include models
schema, over its status as conscious or unconscious representa- for enacting drop-schema that simulate the neural computa-
tions, and even over its status vis--vis individual and social cul- tional activity involved. For instance, a computational represen-
tural cognition. tation for distinguishing between verbs such as push and shove
invokes the slide x-schema but difers in the microdetails of
Image Schemas Among Cognitive Linguists body-part movement and acceleration.
Image schemas play an important role in studies focusing on
Todd Oakley
polysemy of words and constructions, semantic change,
and text analysis. Studies of words and constructions include WORKS CITED AND SUGGESTION FOR FURTHER READING
Alan Cienkis (1998) comparison of the metaphoric projec-
tions of straight and prjamo in English and Russian. Likewise, Cienki, Alan. 1998. Straight: An image schema and its transformations.
Cognitive Linguistics 9.2: 10749.
image schemas igure prominently in arguments about seman-
Feldman, Jerome. 2006. From Molecule to Metaphor. Cambridge,
tic change. For instance, Marolijn Verspoor (1995) argues that
MA: MIT Press.
semantic change preserves image schematic structure, whereas Gibbs, Raymond W. 2005. he psychological status of image schemas.
Yo Matsumoto (1995) challenges a strong version of that hypoth- In Hampe 2005, 11335.
esis. Much text analysis and literary criticism inspired by cog- Hampe, Beate, ed. 2005. From Meaning to Perception: Image Schemas
nitive linguistics specify image schemas as that which make in Cognitive Linguistics. Berlin: Mouton De Gruyter. An edited collec-
linguistic innovation possible. Literary and textual criticism tion of essays presenting a wide range of views on the nature of image
within the image schema tradition is represented most promi- schemas.
nently by Mark Turner (1991). Johnson, Mark. 1987. he Body in the Mind. Chicago: University of
Raymond W. Gibbs (2005) has conducted extensive psycho- Chicago Press. his book is considered the locus classicus of image
linguistic experiments designed to demonstrate that image sche- schema theory.
Lakof, George. 1987. Women, Fire, and Dangerous hings.
mas organize not only experience but also semantic structure and
usage. His early experiments support the claim that image sche-
Mandler, Jean. 2004. he Foundations of Mind. New York: Oxford
mas are psychologically real and imply that they are enduring University Press.
mental representations, while his later experiments reine Matsumoto, Yo. 1995. From attribution/purpose to cause: Image
this view, suggesting that they are real but not representational schema and grammaticalization of some cause markers in Japanese.
structures per se. Rather, they are emergent structures con- In Lexical and Syntactic Constructions and the Construction of
tinuously created on the ly as part of human beings dynamic Meaning, ed. Marolijn Verspoor, K. Dong, and E. Sweetser, 287307.
simulations of actions and situations. In short, Gibbs argues that Amsterdam: John Benjamins.
the psychological reality of image schemas emerges from con- Sinha, Chris, and K. Jensen de Lopez. 2000. Language, culture, and the
tinuous interaction in a three-dimensional world and not from embodiment of spatial cognition. Cognitive Linguistics 11.1/2: 1741.
their being prestored representations in long-term memory Turner, Mark. 1991. Reading Minds. Princeton, NJ: Princeton University
Press.
(2005, 132).
Verspoor, Marolijn. 1995. Predicate adjuncts and subjectiication.
Image schemas igure prominently in some developmen-
In Lexical and Syntactic Constructions and the Construction of
tal accounts of the acquisition of concepts and language. Two Meaning, ed. Marolijn Verspoor, K. Dong, and E. Sweetser, 43349.
notable lines of research in this area include Mandler (2004) Amsterdam: John Benjamins.
and Sinha and Jensen de Lopez (2000). Mandler has argued that
image schemas arise from a process of perceptual meaning anal-
IMPLICATIONAL UNIVERSALS
ysis. Human neonates and infants appear to be engaging in forms
of perceptual analysis of space and motion, such that primitive typological universals seek to capture the limits of gram-
schemas for goal-path; linked path; self-motion; animate matical variation, that is, the extent to which particular traits
379
Implicational Universals
or features of languages may covary, both within languages also favor SOV order over non-SOV order. However, given the
and across languages. Typological universals may be classifed data in (2b), the strength of the implicational universal with SOV
along several dimensions. he most important distinction is as the antecedent and postpositions as the consequent is greater
between substantive and implicational universals. he for- (95%) than the converse universal with postpositions as the ante-
mer deal with a single property and relect what occurs in all cedent and SOV order as the consequent (91%).
languages (e.g., all languages have vowels). Since such unre- he implicational universals in (1) and (2) are simple ones,
stricted universals tell us little about variation, they are con- as they specify a dependency between only two traits. Relations
sidered to be of moderate interest. For the study of variation, between several traits are captured by means of complex univer-
implicational universals are of considerably greater interest sals that may involve conjunctions or disjunctions of traits within
since they relect not only the existence of variation but, cru- the antecedent and/or the consequent, as depicted in (3).
cially, the constraints imposed on it.
(3) a. X (Y & Z)
Implicational universals relate two (or more) logically inde-
b. (X or Y) Z
pendent features or properties and apply to some subset of
languages for which the given features or properties obtain. Individual implicational universals may be combined into
Implicational universals may be absolute or statistical. chains or hierarchies, such that the implicatum (or conclusion)
Absolute implicational universals hold for all languages; that is, of the irst universal is the implicans (or premise) of the second,
they specify that if a language has feature A, then it will have the implicatum of the second is the implicans of the third, and so
feature B. By contrast, statistical implicational universals spec- on. Since representing chains of implicational universals as such,
ify relationships that hold only at a certain level of probability. that is, as in (4a), is quite cumbersome, they tend to be depicted
Statistical implicational universals take the form, If a language in the form of a hierarchy, as in (4b).
has feature A, then with greater than chance frequency it will
(4) a. ((A B) & B C) & C D
have feature B. Examples of the two types of implicational uni-
b. D > C > B > A
versals are given in (1a) and (2a), respectively, and the distribu-
tion of features that each universal relects is presented in (1b) It is important to note that the distributions captured in the form
and (2b) (in the form of a tetrachoric table). (he data depicted of these typological hierarchies deine a frequency cline, such
in (2b), where the numerals relect the number of languages that the phenomenon in D is more frequent than that in C, which
with the speciied features, are taken from Greenberg (1963, in turn is more frequent than in B, and so on. his follows from
Appendix II.) the fact that if any term involved in the hierarchy is present, all
the terms to the left of it on the chain must also be present. And if
(1) a. If the demonstrative follows the head noun, then the rela-
any term involved in the hierarchy is absent, all the terms to the
tive clause also follows the head noun
right of it must also be absent.
b. DemN NDem
Typological hierarchies thus make very strong statements
RelN +
about the possible distribution of properties across languages
NRel + +
and their overall frequency of occurrence. Consequently, much
typological research has been aimed at elaborating such hier-
(2) a. If a language has basic subject-object-verb (SOV) order,
archies, be it with respect to segmental inventories (see, e.g.,
than with greater than chance frequency it will have
the sonority hierarchy of Hopper 1976, 196), morpho-syntactic
postpositions
encoding (see, e.g., the complement deranking-argument
b. SOV non-SOV
hierarchy of Cristofaro 2003, 131), or behavioral properties
prep 5 73
(see, e.g., Keenan and Comries 1977 noun phrase accessi-
post 97 10
bility hierarchy).
Note that the ive prepositional SOV languages are counterex- While statistical implicational universals are the dominant
amples to the universal. Although many absolute implicational means of expressing typological generalizations by typologists,
universals have been posited in the typological literature, most, it must be pointed out that there is a long-standing controversy
particularly of the simple kind (see the following), have turned over whether such universals do indeed capture signiicant
out to be in fact statistical.. Needless to say, these may be assessed relationships between aspects of linguistic structure or merely
from the point of view of their relative strength (the number of incidental relationships resulting from historical accident
overall languages considered, the number of languages display- (see, e.g., Maslova 2000; Bakker 2008). A technique of critically
ing the features in question, the number of exceptions to the evaluating such universals has been recently developed by
universals) and their validity in relation to criteria such as those Elena Maslova (2003), and an excellent account of typological
presented in Stassen (1985, 201). universals in general is provided by William Croft (2003, 4969,
Implicational universals may be monodirectional or bidirec- 1228).
tional. he absolute implicational universal in (1) is monodi-
Anna Siewierska
rectional since NRel order is compatible with both DemN and
NDem. Consequently, while NDem order entails NRel, NRel
order does not entail NDem. he statistical implicational univer- WORKS CITED AND SUGGESTIONS FOR FURTHER READING
sal in (2), on the other hand, is bidirectional; not only does SOV Bakker, Dik. 2008. LINFER: Inferring implications from the WALS data-
order favor postpositions over prepositions, but postpositions base. STUF 61.3: 18698.
380
Indeterminacy of Translation
Cristofaro, Sonia. 2003. Subordination. Oxford Studies in Typology and ind a native speaker apparently referring to brass as vagavai. I
Linguistic heory. Oxford: Oxford University Press. might conclude that vagavai does not mean gold. But I might also
Croft, William. 2003. Typology and Universals. 2d ed. conclude that the speaker does not know the diference between
Cambridge: Cambridge University Press. brass and gold, that the speaker (e.g., a young child) is mistaken
Greenberg, Joseph. 1963. Some universals of grammar with particu-
about the meaning of vagavai, and so on.
lar reference to the order of meaningful elements. In Universals of
Of course, Quines claims are more signiicant and less intui-
Human Language, ed. Joseph Greenberg, 73113. Cambridge, MA:
MIT Press.
tive than this example suggests. To get a better idea of what is at
Hopper, Joan Bybee. 1976. An Introduction to Natural-Generative stake in the indeterminacy of translation, we might distinguish
Phonology. New York: Academic Press. two levels or perhaps two poles of uncertainty and indetermi-
Keenan, Edward, and Bernard Comrie. 1977. Noun phrase accessibility nacy. he irst is global. his is the level at which we may say that
and universal grammar. Linguistic Inquiry 8: 6399. any meaning may be mapped onto any translation, given enough
Maslova, Elena. 2000. A dynamic approach to the veriication of distri- manipulation of the rest of the system. For example, if we are will-
butional universals. Linguistic Typology 3/4: 30733. ing to revise enough of our beliefs about the world, we can main-
. 2003. A case for implicational universals. Linguistic Typology tain a translation of the French word lapin (rabbit) as stone.
7.1: 1018. Of course, it wont be easy. But it is possible. We expect any word
Stassen, Leon 1985. Comparison and Universal Grammar. Oxford: Basil
to have a number of misuses even by luent speakers. Typically,
Blackwell.
however, the correct uses overwhelmingly outnumber the incor-
rect uses. We may have to change our assumed proportions in the
INDETERMINACY OF TRANSLATION case of lapin. For example, we might infer that French speakers
are correct in their use of lapin only when they point to rabbit
In Word and Object (1960), one of the most inluential books in sculptures or when, at a distance, they mistake a stone for a rab-
the history of American philosophy, W. V. O. Quine presents an bit to put the matter in our commonsense, lapin = rabbit idiom.
argument along the following lines. One may always formulate Chomskys inluential criticisms of Quines views on inde-
mutually exclusive hypotheses regarding the meaning of a lin- terminacy most obviously concern this pole. Chomsky rightly
guistic item such that there is no fact as to which hypothesis is points out that all theories are underdetermined by evidence
correct. Put diferently, there are always divergent ways of dein- (1980, 15). One consequence is that we rely on other criteria for
ing a given word such that no one of these ways is correct. his adjudication, such as simplicity. It is presumably simpler (or, in
highly inluential, and highly controversial, indeterminacy of another terminology, less ad hoc) to assume that luent speakers
translation thesis extends Quines argument against analyt- err at similar rates for all frequently used common nouns than to
icity (the view that some statements are true simply due to the assume that lapin is exceptional in this regard. hus, one might
meaning of the terms see Quine 1961; see also meaning and conclude that there is uncertainty here. However, the uncertainty
belief) and points toward his conception of ontological relativ- is adjudicable (at least within limits), and there is no reason to
ity (that reference is not absolute, but relative to a coordinate conclude that this uncertainty implies indeterminacy.
system [Quine 1969, 48], that Factuality is internal to our he other level or pole is highly local. It is limited to difer-
theory of nature [Quine 1981, 23]). ences that we cannot formulate or, more generally, that we do
To explore Quines idea, we might begin with a simple dis- not encode (i.e., roughly, make into information that may be
tinction between decidability criteria and demarcation crite- processed cognitively) in particular cases. For instance, one of
ria. A decidability criterion allows us to chose among diferent Quines central examples is that of a ield linguist encounter-
hypotheses regarding a words meaning (see word meaning). ing gavagai. he ield linguist takes the term to mean rab-
A demarcation criterion distinguishes in principle between the bit. However, the available evidence is equally consistent with
validity of two such hypotheses. A demarcation criterion presum- undetached rabbit part, rabbit stage, and so on. Here, one
ably isolates a factual diference; a decidability criterion, irst of might argue that this is really best understood as a case of global
all, isolates an evidential diference (or, more broadly, a distinc- uncertainty/indeterminacy, which we have already consid-
tion by the lights of scientiic method; thus, it may incorporate ered. Speciically, given the structure of human perception and
simplicity or other adjudicative criteria). Quine argues that there human memory, it seems very unlikely that speakers of other
is no strict decidability criterion for meaning. We may refer to languages would not encode part and stage, thus distinguishing
this as uncertainty. But he also maintains that for at least some rabbit, undetached rabbit part, and rabbit stage in some way. But
set of mutually exclusive hypotheses, there is no demarcation that is merely a problem with the particular example. here are
criterion either. he latter is indeterminacy proper. clear cases where one language involves distinctions that speak-
Quines arguments are bound up with his holism, the ers of another language do not encode. For instance, a particular
view that hypotheses of any sort, including hypotheses regard- group may not distinguish between gold and brass or, better
ing meaning, are part of larger complexes of belief and are not still, fools gold. Examples such as this may suggest that there is a
understandable outside those complexes. Crucially, this means sort of indeterminacy at the subencoding level across languages,
that apparent counterevidence regarding one part of the whole or even within languages. For example, when I use the word gold,
need not have simple, local consequences. Such counterevi- I have no sense of diferent types of gold. In this sense, relative to
dence may be accommodated by alterations elsewhere in the jewelers, my use of gold is indeterminate among those diferent
whole. Suppose I am a ield linguist investigating a new language. types. On the other hand, one might reasonably contend that this
I encounter the word vagavai, which I take to mean gold. I then is not really indeterminacy at all, since there is presumably some
381
Indexicals
psychological fact about the degree of vagueness or ambigu- deictic use (see deixis), but indexicals have many other uses.
ity of my usage at particular times and places. I is used demonstratively when (1) is written near a picture of
hese arguments, then, point toward global, though still adju- Truman with an arrow pointing at the picture. It is used ana-
dicable uncertainty along with local, subencoding indeterminacy phorically in (2) as opposed to (3):
in the limited sense of vagueness or ambiguity. Both are signii-
(2) Truman believes I am president.
cant and both are highlighted by Quines discussions. However,
neither is radical indeterminacy. (3) Truman believes that I am president.
On the other hand, to make these arguments is to go against a It refers to Truman in (2), the speaker in (3). Indexicals are often
range of Quines other views for example, his behaviorism and used pseudodeictically in novels. When Gore Vidal wrote (4) in
his commitment to ontological relativity. Like all theories, Quines Live From Golgotha,
own theories operate holistically. Even if he were to accept some
version of the preceding arguments, he could accommodate (4) I am Timothy,
these arguments by alterations elsewhere in the system. hus, he was not referring to himself. Finally, indexicals are often used
despite such arguments, radical indeterminacy remains at least non-referentially, as in
a continuing and important philosophical challenge.
(5) Je means I in French.
Patrick Colm Hogan
(6) Many a car is such that it needs gas.
WORKS CITED AND SUGGESTIONS FOR FURTHER READING In a semantic use like (5), it is crucial that I be used with the irst
Chomsky, Noam. 1980. Rules and Representations. New York: Columbia person meaning we have been focusing on; if it means iodine,
University Press. (5) is false. Sentence (6) illustrates the quantiicational use, with
Fllesdal, Daginn, ed. 2001. Philosophy of Quine. Vol. 3. Indeterminacy of pronouns bound by quantiiers like variables in quantifica-
Translation. New York: Garland. tion theory.
Glock, Hans-Johann. 2003. Quine and Davidson on Language, hought Indexicals present a problem for formal theorists because
and Reality. Cambridge: Cambridge University Press. assigning extensions and intensions (functions from possible
Quine, W. V. 1960. Word and Object. Cambridge, MA: MIT Press. worlds to extensions) to indexicals cannot adequately represent
. 1961. From a Logical Point of View: Logico-Philosophical Essays. their role in determining the truth conditions of sentences (see
2d ed. New York: Harper and Row.
truth conditional semantics). David Kaplans ([1977]
. 1969. Ontological Relativity and Other Essays. New York: Columbia
1989) seminal solution was to represent the meanings of indexi-
University Press.
. 1981. heories and hings. Cambridge: Harvard University Press.
cals by assigning them characters (functions from contexts to
intensions). For I, Kaplan had in mind the function satisfy-
ing the following condition: i(c) is the intension whose value in
INDEXICALS any world w is the speaker uttering I in c. hus, the value of
i(c) in any context in which heodore Roosevelt used I is the
An indexical is a linguistic expression whose reference (exten- constant function tr(w) whose value is heodore Roosevelt for
sion) may vary from one context of use to another even when every world. he value of i(c) in any context in which Franklin
used with the same meaning (sense) and with respect to the Roosevelt used I is the constant function fdr(w) whose value
same possible world or story. An example is the pronoun I, is Franklin Roosevelt for every world.
which I use to refer to myself and you to yourself. Sentence (1) he character theory of meaning has many of the diiculties
itself is indexical: true when used by Harry Truman in 1951 but of referential theories generally. For example, I and the per-
not when used by Churchill. son identical to me have the same character, but not the same
(1) I am president. meaning (Freges problem). he character function for he would
seem to be undeined when it is used in After Santa came down
Tensed verbs (am, was) are indexical, as are many adjectives the chimney, he left the presents; Santa does not exist (Russells
(foreign, local), common nouns (enemy, neighbor), and adverbs problem). Quantiicational and semantic uses are nonreferential
(yesterday, recently). for diferent reasons. Another diiculty is the Enterprise problem.
Indexicals are often deined simply as terms with diferent Suppose Mary points at the bow of a big ship and says hat is
extensions in diferent contexts. But plane has diferent exten- the Enterprise and then points at the stern and says hat is not
sions (airplanes versus wood planes) because it is ambiguous the Enterprise. he character function for that would assign it
rather than indexical. I too is ambiguous, meaning one (1) or the same intension in both these contexts the one whose value
iodine as well as me. But it is used with the very same mean- at every world is the Enterprise. Yet Mary is not contradicting
ing when you and I use it to refer to ourselves. Furthermore, while herself the way she would if the same pointing gesture accompa-
the 34th president refers to Dwight Eisenhower when describ- nied both her utterances. he essential indexical problem is that
ing the actual world, it refers to Adlai Stevenson when describ- sentences like (2) and (7) may difer in truth value, as in cases of
ing a hypothetical case in which he won in 1954. he referent of amnesia or delusion.
an indexical varies even when speakers are describing the same
possible world. (7) Truman believes that Truman is president.
It is often said that the meaning of I is given by the rule that On the standard analysis, S believes p is true if and only if S
its referent in any context of use is the speaker. his holds for the stands in the belief relation to the proposition expressed by
382
Inequality, Linguistic and Communicative
p. But if propositions are intensions, then the complement in varieties of language, varieties that did not ofer the same social,
(2) expresses the same proposition as the one in (7): the proposi- cultural, and political rewards as others.
tion true in any world if and only if Truman is president. he central insight here is indeed the parallelism between
Fregean theories solve the essential indexical problem by tak- linguistic variation and social diferentiation, between linguis-
ing indexicals to express a distinctive type of mode of presentation tic diferences and social hierarchies and forms of stratiication
(concept or mental representation) and taking proposi- often organized around the dynamics of prestige and stigma.
tions to be structured entities consisting of modes. hen, (2) his insight has been developed by a number of scholars in soci-
entails that Truman believes the proposition whose subject is his olinguistics and related ields of study (Irvine 1989). I irst give a
own self-concept, whereas (7) entails that Truman believes brief survey of the development of this topic in sociolinguistics
the proposition whose subject is the concept of Truman. Marys and then turn to a discussion of two authors: Pierre Bourdieu
two utterances are not contradictory because they express prop- and Dell Hymes.
ositions with diferent modes of presentation of the Enterprise. A
diiculty for Fregean theories is to explain why I is not ambigu- Inequality in Sociolinguistics
ous if it is used by diferent speakers to express diferent con- Bernsteins thesis about elaborated and restricted codes became
cepts. Another is to account for nonreferential uses, in which I is a deeply controversial one, though often for the wrong reasons. It
not used to express the speakers self-concept. coincided with the emergence of modern sociolinguistics, a dis-
cipline that saw itself initially as devoted to the study of linguistic
Wayne A. Davis variation and distribution as a horizontal exercise: an exercise
in mapping linguistic varieties over fragments of a population,
WORKS CITED AND SUGGESTIONS FOR FURTHER READING based on an assumption of the fundamental equivalence of every
Braun, D. 1996. Demonstratives and heir linguistic meanings. Nos linguistic variety. hus, Bernsteins vertical image of stratiied
30: 14573. variation variation that comes with diferential value attribu-
Burks, A. W. 1949. Icon, index, and symbol. Philosophy and tion was countered by William Labov (1972) and others, who
Phenomenological Research 9: 67389. argued that a linguistic variety such as black American English
Evans, G. 1980. Pronouns. Linguistic Inquiry 11: 33762. should not be seen as bad English (which Labov chose to equate
Forbes, G. 1987. Indexicals and intensionality: A Fregean perspective. with Bernsteins restricted code) but as a complex, sophisticated
Philosophical Review 96: 331. code used by virtuoso speakers. Labov, like Bernstein, started
Frege, G. [1918] 1977. houghts. In Logical Investigations, ed. from an awareness of real inequalities in language: he black
P. T. Geach, 130. Oxford: Basil Blackwell.
American English variety was stigmatized in U.S. education and
Gale, R. M. 1967. Indexical signs, egocentric particulars, and token-
its speakers were often negatively categorized. But Labovs eforts
relexive words. In Encyclopedia of Philosophy. Vol. 4. Ed. P. Edwards,
1514. New York: Macmillan.
were aimed at demonstrating the intrinsic linguistic equiva-
Kaplan, D. [1977] 1989. Demonstratives. In hemes from Kaplan, ed. lence of these diferent varieties, whereas Bernstein addressed
J. Almog, J. Perry, and H. Wettstein, 481563. Oxford: Oxford University the extrinsic attributional, ideological nonequivalence of
Press. the varieties. Seen from a historical distance, both eforts were
Knne, W. 1997. First person propositions: A Fregean account. connected by a joint concern for how particular language vari-
In Direct Reference, Indexicality, and Propositional Attitudes, ed. eties disenfranchized speakers, not because of their intrinsic
W. Knne, A. Newen, and M. Anduschus, 4968. Stanford, CA: CSLI inferiority but because of social and political perceptions of the
Publications. varieties and their speakers. In other words, both were addressing
Nunberg, G. 1993. Indexicality and deixis. Linguistics and Philosophy a language-ideological efect in which linguistically equivalent
16: 143
varieties indexed socially unequal features and categories.
Perry, J. 1979. he problem of the essential indexical. Nos 13: 321.
Similar ideas were to be found in the work of many of the early
sociolinguists. John Gumperz (1982) stressed the diferent ways
in which intercultural misunderstandings were not a result of
INEQUALITY, LINGUISTIC AND COMMUNICATIVE
participants intentions but were an efect of small diferences
he idea that linguistic variation goes hand in hand with linguis- in linguistic and communicative structures in a particular con-
tic inequality was brought on stage by Basil Bernstein and started text of diferent contextualization cues. In his work as well, we
inluencing the development of the new discipline of sociolin- encounter an awareness of relativity: Diferent language varieties
guistics, which became a discipline devoted to the study of the can fulill the same functions, but the perception of these func-
unequal and nonrandom distribution of linguistic resources in tions by people not sharing the contextual conventions of a par-
society. Bernstein (1971), in an attempt at understanding class ticular variety can difer. Here again, we encounter the tension
diferences in the education system, distinguished between two between intrinsic equivalence and extrinsic nonequivalence.
codes, one elaborate and characteristic of middle-class chil- From a diferent vantage point, sociolinguists concerned
dren, another restricted and characteristic of working-class with language policy stressed the fact that the political
and minority children. he elaborate code was the normative institutionalization of language in multilingual societies (see
one: It was privileged by the teachers and used as the yardstick, bilingualism and multilingualism) usually involved a
not only for good language but also for wider behavioral and stratiication based not on intrinsic superiority of one language
cognitive assessments of pupils. Bernstein demonstrated that but on prestige hierarchies and ideological images of society.
speaking diferent varieties of language meant speaking unequal J. Fishman (1974) presented studies in which former colonial
383
Inequality, Linguistic and Communicative
as well as local languages were lifted ideologically to the status a constant reenactment of communicative practices dependent
of prestige language by means of language policies favoring a on these exclusive resources (Bourdieus analyses of academic
standardized, tightly controlled, and preferably ethnically discourse are telling in this respect; see Bourdieu, Passeron, and
neutral language; C. Eastman (1983) described the diferent de Saint Martin 1994).
stages of language planning, demonstrating how language vari- It is at this point that the contribution of Hymes (1980, 1996)
eties could be turned into institutionalized, power-emanating comes into view. He draws on a long anthropological tradition of
elements of social structure. seeing language in context and use speech and when lan-
A central ingredient of every form of language planning is the guage is seen from this practical angle, diversity and inequality
construction of a written, orthographically standardized variety are the rule rather than the exception. Whereas Bernsteins and
of language. Scholars in the ield of literacy also devoted atten- Bourdieus relections on language remained largely abstract
tion to the ways in which literacy may introduce stratiication in and general, Hymess approach is soundly empirical. Hymes
languages and communities and often becomes an opportunity starts from registers, repertoires, and genres rather than
as well as an obstacle deining peoples social mobility trajec- languages, and such practical linguistic and communicative
tories (Street 1995; Kress 2000). he creation of a literate norm instruments are performed in contexts where they take actual
almost invariably means that access to the prestige variety of shapes: conversations, stories, lectures, and so on. Hymes
the language comes to be controlled by the education system, himself focused on narrative, and he observed that one
which functions as a very efective ilter on social mobility. form of inequality in our society has to do with rights to use
narrative, with whose narratives are admitted to have cog-
State of Affairs nitive function (1980, 126). he reason is that conventions for
In all of the work discussed so far, there is an awareness of producing narratives are culturally and socially sensitive, and
inequality, notwithstanding that the emphasis would be on dif- particular contexts in society require particular types of narra-
ference and variation, rather than on the power and inequality tive. People can be very good storytellers in their neighborhood
dimensions of diference and variation. Other scholars developed but inefective ones in front of a judge in court, because the
full-blown theories of linguistic and communicative inequality, stylistic and genre conventions for those particular events are
and two stand out: Bourdieu and Hymes. fundamentally diferent. hus, the capacity to be recognized as
Bourdieu (1991) emphasized the ways in which language is a competent speaker or an articulate storyteller, lecturer,
part of the superstructural apparatus of society; as a nonmate- or conversationalist is a socially sensitive phenomenon on
rial resource, it can nevertheless be imbued with an economic which social forces of diferential value attribution operate. For
value: social or cultural symbolic capital. As in hard economic Hymes, this dialectic between real capacity and expected per-
sectors, the value of diferent resources is diferent and can luc- formance deines voice, and one can be voiceless for many
tuate, and not every resource can be traded for another one; the reasons (Blommaert 2005).
terms of exchange are unpredictable. Bourdieus work on lan- he approaches of Bourdieu and Hymes both present a fully
guage itted into his larger program of analyzing the economies developed theory of inequality in the ield of language and com-
of taste, culture, and ideas from within a generalized materialism munication. From their work, we can see how the production of
and with the aim of redeining (and empirically substantiating) meaning is a regulated, regimented process in which the speak-
the notion of social class. In his view, the traditional distinc- ers choice is never unlimited and in which the efects of his/her
tion between material and immaterial resources in society was words are judged by others using social and ideological yard-
unjustiied since the same forces of diferential value allocation sticks. he advantage of Hymess approach is its strongly devel-
appeared to operate on both: Class is both a material and an oped empirical dimension, which ofers a range of opportunities
immaterial thing, and language diferences do play a role in this for applied research.
value allocation. Consequently, the stability of class as a material
complex is also there in the immaterial aspects: A lack of real capi- Evaluation
tal is often paralleled by a lack of symbolic capital, and social hier- Despite its implicit focus on inequality, sociolinguistics privi-
archies play as much into the symbolic diacritics as they do into leges diference and variation as its objects and avoids explicit
the material ones. (See also field and habitus, linguistic.) analyses of how such variation converts into social inequality.
It is not diicult to see the similarities between the project his is an efect of the descriptive bias in sociolinguistics, as well
developed by Bourdieu and the (more limited) one presented as of the hesitant and ambivalent relationship between socio-
by Bernstein. Both treat language not just as an opportunity a linguistics and social theory (Williams 1992). Many studies of
positive feature of humans but also as a constraint, as some- variation, consequently, consider diferences merely in terms of
thing inherently limited and limiting. And rather than enabling spread and distribution and see the power and inequality rela-
people to perform particular acts, language also restricts them tionships among language varieties in simpliied terms. he fact
and prevents them from performing other acts. Underlying this that speaking with a stigmatized accent is not merely a matter
idea of language as constraint is a model of society as nonegali- of distribution but a matter of social opportunities, determined
tarian and stratiied, with relatively stable strata, some of which by the way in which others rank such accents in an ideologically
are tightly controlled as to access while others are more demo- informed hierarchy, could be more central to sociolinguistic
cratically accessible. Particular language resources are required relections.
to move into the more controlled strata, and membership of his is a critical insight and eminently applicable to many
these strata for example, elite, professional milieux requires sociolinguistic and discursive phenomena in the contemporary
384
Infantile Responses to Language
world. Emergent applied studies have shown its relevance, recently, brain measures. he major behavioral measures rely on
for example, in the ield of asylum seekers narratives (Maryns infants ability to associate a behavioral response with stimuli.
2006), literacy in education (Collins and Blot 2003), and social Behavioral measures examine either their responses to new
work (Hall, Slembrouck, and Sarangi 2006). stimuli after habituation or their attention (e.g., eye gaze, head
turn) to certain types speech or language stimuli. For example,
Jan Blommaert
High-Amplitude Sucking and Conditioned Headturn are both
habituation paradigms. Researchers using these paradigms
habituate the infant to the stimulus, change the stimulus, and
Bernstein, B. 1971. Class, Codes and Control. Vol. 1. heoretical Studies then measure whether the infant responds to the change. Other
Towards a Sociology of Language. London: Routledge and Kegan Paul. procedures, namely, the Headturn Preference Procedure and the
Blommaert, J. 2005. Discourse: A Critical Introduction. Cambridge: Intermodal Preferential Looking Paradigm, measure how long
infants attend to stimuli by measuring either headturns toward
Bourdieu, P. 1991. Language and Symbolic Power. Cambridge,
attention-getting lights or gazes to pictures/video accompanying
UK: Polity.
Bourdieu, P. J. C. Passeron, and M. de Saint Martin. 1994. Academic
the stimuli. Biological techniques measure reactions to speech,
Discourse: Linguistic Misunderstanding and Professorial Power. most frequently by measuring infant heart rate. More recent
Stanford, CA: Stanford University Press. research has turned to brain measures, including electrophysi-
Collins, J., and R. Blot. 2003. Literacy and Literacies. Cambridge: cal responses such as event related-potentials (ERP), electroen-
Cambridge University Press. cephalogram (EEG), and neuroimaging techniques such as
Eastman, C. 1983. Language Planning: An Introduction. San Francisco: fMRI (functional magnetic resonance imaging) to examine early
Chandler and Sharp. speech and language perception. For an overview of methods,
Fishman, J., ed. 1974. Advances in Language Planning. he Hague: see Jusczyk (1997); Mills, Cofey-Corina, and Neville (1997).
Mouton. Using experimental methods, researchers have investigated
Gumperz, J. 1982. Discourse Strategies. Cambridge: Cambridge University
a wide range of issues concerning infants understanding of
Press.
speech and language, including responses to individual speech
Hall, C., S. Slembrouck, and S. Sarangi. 2006. Language Practices in Social
Work. London: Routledge.
contrasts, diferences in speakers, sensitivity to the prosody of
Hymes, D. 1980. Language in Education: Ethnolinguistic Essays. the language, lexical learning, and ability to detect gram-
Washington, DC: Center for Applied Linguistics. matical patterns. (See also speech perception in infants.)
. 1996. Ethnography, Linguistics, Narrative Inequality: Toward an his research has shown that fetuses can hear during the third
Understanding of Voice. London: Taylor and Francis. trimester of development (although the sound is low-pass il-
Irvine, J. 1989. When talk isnt cheap: Language and political economy. tered by the amniotic luid and the uterine walls) and will recog-
American Ethnologist 12: 73848. nize recurrent patterns (e.g., rhymes) from the mothers speech
Kress, G. 2000. Early Spelling: Between Convention and Creativity. (DeCasper et al. 1994). Neonates prefer their mothers voices
London: Routledge. to others (DeCasper and Fifer 1980). In addition, neonates and
Labov, W. 1972. Language in the Inner City. Philadelphia: University of
infants prefer to listen to infant-directed speech (speech with
Pennsylvania Press.
fewer words per utterance, more repetition, slower articula-
Maryns, K. 2006. he Asylum Speaker: Language in the Belgian Asylum
Procedure. Manchester: St Jerome.
tion, and greater prosodic swings) over adult-directed speech
Street, B. 1995. Social Literacies. London: Longman. (Fernald 1985; Cooper and Aslin 1990)
Williams, G. 1992. Sociolinguistics: A Sociological Critique. Two-month-old infants can distinguish their native language
London: Routledge. from other languages on the basis of prosodic characteristics; by
six months infants are sensitive to their native language stress
patterns and by nine months to native language phonotactics.
INFANTILE RESPONSES TO LANGUAGE
Infants use prosodic information to group words and are sen-
Infant responses to speech and language have been used to study sitive to prosodic units that mark both clauses (by 46 months)
both early communicative development and early language per- and phrases (by 9 months). As prosodic units often correspond to
ception in infants. Observations of interactions between infants grammatical units, this ability may give infants additional infor-
and caregivers have shown that infants under three months of mation about grammatical structure. here is also evidence that
age become quiet in response to speech. Infants as young as prosody can help infants remember information about speech
six weeks engage in interactions or protoconversations, or turn- (Jusczyk 1997).
taking in gaze and/or vocalizations with caregivers (Bruner 1975; Work on the perception of speech contrasts has shown that
Trevarthen 1974). Between four and six months, infants respond from birth, infants can discriminate both consonants and vowels
to speech with vocalizations (see babbling), and by six to nine that contrast in any of the worlds languages, even if the sounds
months they begin initiating interactions, including games such are not found in the infants native language (Werker and Curtin
as peek-a-boo. 2005; Kuhl 2000). his ability holds even if the sounds are pro-
Experimental investigations into infant responses to speech duced by diferent speakers of diferent ages. Work has also
have yielded information about infants abilities to decode lan- investigated the acoustic cues infants use to detect diferences,
guage well before they begin to produce language themselves. as well as the nature of categorical perception. Results suggest
Numerous experimental methods have been employed to exam- that infants can be sensitive to the same range of acoustic cues
ine their perception, including behavioral, biological, and, more as adults, including the efects of coarticulation (where sounds
385
Infantile Responses to Language
in a word afect the articulation of each other) and the multiple a compound noun (Oscar and Elmo are running) are matched
acoustic cues that can indicate the identity of a phonetic con- with intransitive actions. hus, infants are able to both track
trast. he ability to discriminate phonetic contrasts from native grammatical information and use word order to determine rela-
and unfamiliar languages is maintained until 6 months, but by tionships in English (Hirsh-Pasek and Golinkof 1996).
1012 months, infants only discriminate contrasts that are pho- More work is needed to examine how infants from diferent
nologically contrastive in the target language. his shift from backgrounds, in particular infants from non-Western cultures,
language-general to language-speciic discrimination occurs non-Indo-European languages, and bilingual infants, perceive
irst for vowels (by 68 months) and later for consonants (1012 speech and language. Existing work with bilingual infants sug-
months) (Werker and Curtin 2005). gest that exposure to more than one language helps infants
Infants perception of speech sounds shifts as they begin maintain categorical contrasts found in both languages (Werker
to learn words. Infants younger than 8 months are sensitive to and Tees 1984) and that lateralization can occur independently
phonetic diferences, while infants approaching one year ignore for each language to which an infant is exposed (Conboy and
allophonic variation and attend only to phonological diferences Mills 2006; see bilingualism, neurobiology of). Research
found in their native language(s). At the same time, young word is just beginning to explore the relationship between early speech
learners (14-month-olds) do not attend to some phonetic detail perception and later language development. Early results sug-
(e.g., dih vs. bih) when in a word-learning task (Stager and gest that some aspects of speech perception are related to later
Werker 1997), even though younger infants can discriminate vocabulary development (e.g., Newman et al. 2006).
these contrasts in perception tasks and older infants attend to One area of debate concerns protoconversations (turn-
these diferences in word-learning tasks. hus, the experience taking in gaze and/or vocalizations), whether they are driven
learning language inluences how and when contrasts are per- primarily by the caregivers or the infant takes an active role in
ceived (Kuhl 2000). shaping them. Research suggests an active role for infants in
Infants also attend to statistical regularities in the speech co-constructing interactions (e.g., Trevarthen, Kokkinaki, and
stream. hese abilities have been shown in the perception of Fiamenghi, Jr. 1999). Another long-standing debate resulting
phonetic segments, in word segmentation, and grammatical from experimental work concerns whether infants responses
patterns (Safran 2003). Attention to statistical regularities can result from a specialized speech-processing system or from
help infants extract familiar patterns and track relationships general auditory processing plus categorization abilities. Most
within a speech stream and, thus, may help infants uncover researchers currently argue that the ability to discriminate
basic structural relations in sentences as they move into the speech contrasts is domain-general with some innate perceptual
acquisition of grammar. Infants sensitivity to grammar has biases (Kuhl 2000). A further current debate parallels an ongo-
been demonstrated both behaviorally and with brain measures. ing debate in cognitive science concerning whether infants use
It begins with sensitivity to the phonetic form of function words statistical regularities alone or create abstract rules from statis-
that mark grammatical relations. English-learning infants of tical information (Safran 2003). Evidence from other domains
1011 months demonstrate sensitivity to the phonetic proper- (e.g., production) suggests that infants can abstract rules, and
ties of functor words and can use them to help segment nouns. most linguists would argue for rule creation rather than pure
French-learning and German-learning infants show even earlier statistical learning (Marcus et al. 1999), while many connec-
sensitivity to functor words, segmenting them by 78 months of tionists would argue that rule-like efects are artifacts of stable
age. By 16 months, infants can distinguish passages in which connections. hey argue that learning is based on the speciic
English functors are properly or improperly ordered (Shady, stimuli and is possible without the abstraction of rules (Conway
Jusczyk, and Gerken 1998). By 18 months, infants are beginning and Christiansen 2006).
to be able to track grammatical relations, such as that which
Lynn Santelmann
occurs between is and -ing.
Infants acquiring both English and German are able to per-
form these tasks, though their ability to do so is inluenced by WORKS CITED AND SUGGESTIONS FOR FURTHER READING
language-speciic factors (Santelmann and Jusczyk 1998; Hhle Bruner, Jerome S. 1975. From communication to language: A psycho-
et al. 2006). English infants are sensitive to the distance between logical perspective. Cognition 3: 25587.
the morphemes, whereas German infants are sensitive to the Conboy, Barbara T., and Debra L. Mills. 2006. Two languages, one devel-
type of constituent. In addition, by 17 to 18 months, infants are oping brain: Event-related potentials to words in bilingual toddlers.
able to use the presence of nouns in a sentence to help decode Developmental Science 9.1: F1F12.
verb meanings. his result is most likely due to the structure of Conway, Christopher M., and Morten H. Christiansen. 2006. Statistical
the two languages: English allows few elements between auxil- learning within and between modalities: Pitting abstract against stim-
ulus-speciic representations.. Psychological Science 17: 90512.
iary and main verbs (mostly adverbs, or in questions, subjects).
Cooper, R. P., and R. N. Aslin. 1990. Preference for infant-directed
he structure of German (with noninite verbs occurring at the
speech in the irst month after birth. Child Development 61: 158495.
end of the sentence) requires objects and adverbs to occur in
DeCasper, A. J., and W. P. Fifer. 1980. Of human bonding: Newborns
between the auxiliary and main verbs. It appears that infants prefer their mothers voices. Science 208: 11746.
in the two languages are sensitive to these distributional prop- DeCasper, Anthony J., Jean-Pierre Lecanuet, Marie-Claire Busnel,
erties in the syntax and are using these patterns to help orga- Carolyn Granier-Deferre, and Roselyne Maugeais. 1994. Fetal reac-
nize the speech stream. Sentences with two nouns (Oscar chases tions to recurrent maternal speech. Infant Behavior and Development
Elmo) are matched with transitive actions, while sentences with 17: 15964.
386
Information Structure in Discourse
Fernald, A. 1985. Four-month-old infants prefer to listen to motherese. or salience, will have certain repercussions on its linguistic
Infant Behavior and Development 8: 18195. realization; in particular, it will inluence grammatical choices
Hirsh-Pasek, Kathy, and Roberta M. Golinkof. 1996. he Origins of (most prominently word order patterns but also, for instance,
Grammar: Evidence from Early Language Comprehension. Cambridge, voice), prosodic choices (choice of intonation contours and,
MA: MIT Press. Overview of studies examining infants sensitivity to
especially, placement of sentential stress), and lexical choices
grammar, including description of methodologies.
(such as deiniteness, ellipsis, pronominalization, and use of
Hhle, B., M. Schmitz, L. M. Santelmann, and J. Weissenborn. 2006.
he recognition of discontinuous verbal dependencies by German
speciic particles).
19-month-olds: Evidence for lexical and structural inluences on Modern notions of information structure can be traced back
childrens early processing capacities. Language Learning and to the Prague School work on functional sentence perspective, as
Development 2.4: 277300. summarized most accessibly in Firbas (1992). J. Firbas developed
Jusczyk, P. W. 1997. he Development of Speech Perception. Cambridge, the idea that sentence elements have varying degrees of commu-
MA: Bradford Books. Overview of studies examining infants sensitivity nicative dynamism (CD), depending on the extent to which they
to grammar, including description of methodologies. carry the message forward. In the unmarked case (the basic
Kuhl, Patricia K. 2000. A new view of language acquisition. Proceedings distribution), degrees of CD will be relected in the linear order-
of the National Academy of Science 97: 118507. Overview of historical ing of elements, with sentences starting with the element carry-
positions and framework incorporating both innate biases and statisti-
ing the lowest CD (deined as the theme), followed by a gradual
cal learning.
rise in CD. It should be noted that there is a relationship between
Marcus, G. F., S. Vijayan, S. Bandi Rao, and P. M. Vishton. 1999. Rule
learning by seven-month-old infants. Science 283: 7780.
the degree of CD of an element and its status in terms of given-
Mills, Debra L., Sharon Cofey-Corina, and Helen J. Neville. 1997. new (see the following).
Language comprehension and cerebral specialization from 13 to 20
months. Developmental Neuropsychology 13: 397445. Given-New Structure versus Theme-Rheme Structure
Newman, Rochelle S., Nan Bernstein Ratner, Ann Marie Jusczyk, Peter W. Two dimensions, which may be to some extent correlated but
Jusczyk, and Kathy A. Dow. 2006. Infants early ability to segment the should be kept conceptually distinct, are important here. On
conversational speech signal predicts later language development: A the one hand, the degree of givenness of a piece of information
retrospective analysis. Developmental Psychology 42: 64355. relects the extent to which an element can be treated as in some
Safran, Jenny R. 2003. Statistical language learning: Mechanisms and
way recoverable from what precedes or can be assumed to be
constraints. Current Directions in Psychological Science 12.4: 11014.
present in the hearers consciousness; its thematicity, on the
Santelmann, Lynn M., and Peter W. Jusczyk. 1998. Sensitivity to discon-
other hand, relects the extent to which it represents what the
tinuous dependencies in language learners: Evidence for limitations in
processing. Cognition 69: 10534. message is about. As an illustration of how these dimensions
Shady, Michele, Peter W. Jusczyk, and LouAnn Gerken. 1998. Infants inluence linguistic choices, consider the following example:
sensitivity to function morphemes. Proceedings of the Annual Boston
(1) A: What happened to Mary? Is she still single?
University Conference on Language Development 19: 55363.
B: Well, Mary married John last year, but shes already
Stager, Christine, and Janet F. Werker. 1997. Infants listen for more pho-
divorced him.
netic detail in speech perception than in word learning tasks. Nature
388: 3812. In Bs response, the noun phrase (NP) Mary in the irst sen-
Trevarthen, Colwyn. 1974. Conversations with a two-month old. New tence (or, more accurately, the piece of referential informa-
Scientist 62: 2305.
tion represented by the NP Mary) is given information, as it
Trevarthen, Colwyn, heano Kokkinaki, and Geraldo A. Fiamenghi, Jr.
was mentioned in the immediately preceding turn; John, on
1999. What infants imitations communicate: With mothers, with
the other hand, is new information, as it cannot be assumed to
fathers and with peers. In Imitation in Infancy, ed. Jacqueline Nadel
and George Butterworth, 12785. New York: Cambridge University be in the hearers consciousness. In Bs second sentence, both
Press. Mary and John are now given information, which explains why
Werker, Janet F., and Suzanne Curtin. 2005. PRIMIR: A developmen- they can be pronominalized as she and him, respectively. At
tal framework of infant speech processing. Language Learning and the same time, it could be argued that both sentences are more
Development 1.2: 197254. Develops a model of early speech percep- about Mary than about John and that Mary is, therefore, the
tion/word learning. Concise overview of indings of speech perception theme of both sentences, the remaining information being rhe-
research. matic (i.e., providing additional information about the theme).
Werker, J. F., and R. C. Tees. 1984. Cross-language speech percep- Although both levels are functionally independent, they do
tion: Evidence for perceptual reorganization during the irst year of
appear to correlate strongly, in that themes tend to consist of
life. Infant Behavior and Development 7: 4963.
given information.
Example (1) also illustrates some tendencies with regard to
word order and prosody. First of all, new information tends to
INFORMATION STRUCTURE IN DISCOURSE
occur toward the right of the sentence, whereas given informa-
he phrase information structure is used to indicate the organization tends to be initial (in English, therefore, being an subject-
tion of elements within a sentence in terms of their pragmatic verb-object (SVO) language, there is a strong correlation between
contribution (in terms of givenness-newness, theme-rheme) givenness and grammatical subjecthood, as shown by Mary in
to a piece of discourse or text, as opposed to their syntactic Bs contribution). Secondly, thematic information tends to be
role (subject, object, etc.) or their semantic role (agent, goal, sentence-initial. In fact, some linguists argue that the theme
beneiciary, etc.). A sentence elements degree of importance, is initial by deinition (e.g., Halliday 1994). hirdly, the main
387
Information Structure in Discourse
sentence stress, or tonic nucleus (i.e., the syllable carrying the Previous Research on Given-New
stongest degree of prosodic prominence), normally falls some- In her inluential paper on the given-new distinction, E. F. Prince
where on the new information, namely, on John in sentence 1 of (1981) starts with an appealing classiication of the literature on
Bs turn and perhaps on the second syllable of divorced in sen- given-new according to three approaches to givenness: given-
tence 2. ness in terms of shared knowledge, in terms of cognitive salience,
he efects of givenness and/or thematicity on word order and, inally, in terms of recoverability or predictability. When
are acknowledged by most authors. However, in a substantial givenness in terms of shared knowledge is evaluated, the basic
proportion of the worlds languages, word order is partially, or criterion is the speakers assumption that the hearer knows,
even predominantly, determined by syntactic rather than infor- assumes, or can infer a particular thing (but is not necessarily
mation structure considerations. English is a case in point since thinking about it). his deinition is problematic in that it fails
it has a fairly strict SVO order from which it is hard to deviate. to distinguish systematically between knowledge that the hearer
Some languages, therefore, have developed other mechanisms can derive through contextual clues (such as is needed for the
for marking information structure, such as particular nonpro- decoding of an anaphoric element) and what he/she knows
totypical patternings resulting in diferent ordering of elements. as part of his or her background knowledge of the world.
Examples that are prominent in English are the passive voice Givenness in terms of cognitive salience can be deined as
(not, strictly speaking, a word order variation but nevertheless knowledge that the speaker assumes to be in the hearers con-
a syntactic operation that has a big impact on linear order); cleft sciousness at the time of utterance. One problem with this view
and pseudocleft constructions (e.g., It is a beer that I would like is that while it might well represent an accurate picture of the
or What I would like is a beer); the fronting of an element (some- decisions that the speaker has to make on a cognitive level, this
times referred to as topicalization, or Y-movement; e.g., A dimension is inherently inaccessible to the outside observer. A
beer I would like); extraposition of an element (e.g., It is enjoy- discourse analyst confronted with textual data can only assess
able to drink a beer on a warm evening); and left and right dis- what the speaker might have assumed as given by examining the
location (e.g., [As for] John, he loves beer and He loves beer, John linguistic choices the speaker has made (in terms of word order,
[does], respectively). Some languages may employ other formal intonation, pronominalization, deiniteness, and so forth).
resources for marking thematicity or givenness-newness, the he view of givenness in terms of recoverability, inally, is most
best known examples perhaps being particles (such as ga and often associated with Hallidays systemic-functional framework
wa in Japanese). (e.g., 1994), which was in turn inluenced by the Prague School.
Example (1) also reveals some potential problems for any Given information is deined as information that is predictable
deinitions of givenness or thematicity. First of all, it raises the or derivable from the preceding discourse context, new infor-
question whether the given-new distinction is a simple binary mation being deined as what the speaker presents as not being
distinction. he verb married in Bs irst sentence, for instance, recoverable. Important to note here is that, for Halliday, given-
is new in the sense that it has not been mentioned, but one could new information is marked almost exclusively through prosody,
also argue that previous mention by A of the element single has more particularly the placement of the tonic nucleus (or sentence
to some extent activated its antonym (see spreading activa- stress). Word order does play an important part in information
tion), so that married does not have quite as high a degree of structure, but is argued by Halliday to be an indicator of thema-
newness as, say, last year. Secondly, it is important here to distin- ticity rather than givenness-newness. Having said that, he does
guish between thematicity on a sentence level, which attempts to acknowledge that some noncanonical word order formats, such
provide an answer to the question what a sentence (or clause) is as cleft sentences, can be used as markers of given-new low.
about, and the notion of aboutness on the macro level of a longer Princes taxonomy of given-new information (1981) deines
stretch of discourse; in the latter case, the term topicality or topic- givenness (or assumed familiarity, as she calls it) in terms of
hood is often employed (see Brown and Yule 1983). In the previ- speaker assumptions regarding an elements cognitive salience
ous example, one could argue that the entire exchange is more but ofers an attempt at a more sophisticated classiication.
about Mary than it is about John, making Mary the discourse First of all, elements can be new, inferrable, or evoked (given).
topic for the whole exchange (but not necessarily the theme for New entities can be either brand new (i.e., not assumed to be in
all sentences comprising the exchange). any way known to the hearer) or unused (i.e., part of the hear-
It should be noted, incidentally, that conceptual vagueness ers background knowledge but not in his/her consciousness at
and terminological confusion abound in the literature: he the time of utterance). Inferrables can be retrieved from other
terms topic and theme are often used interchangeably, as are entities via inferential processes (as in he car was useless, as
terms such as given-new and background-focus. As has already the battery was lat, where the battery is inferrable from the car
been pointed out, Firbas deines the theme as the element with through the inference that cars have batteries). Evoked entities,
the lowest degree of CD. Others deine theme in terms of about- inally, can be either textually evoked (i.e., from the surround-
ness, which is reminiscent of the concept of psychological sub- ing text) or situationally evoked (i.e., through the physical con-
ject, or what the speaker is talking about. M. A. K. Halliday text). he familiarity status of an element will have an inluence
deines the theme as the point of departure for the message, on its potential linguistic realization: Brand-new entities, for
which always correlates with clause-initial position; the rheme instance, will be typically realized as indeinite full NPs (e.g.,
then represents what is said about the theme. He thus assumes a guy in I met a weird guy yesterday), whereas unused entities
a direct link between discourse function (theme) and linguistic will be deinite NPs (e.g., the game in I went to the Knicks game
form (word order). yesterday).
388
Information Theory
Challenges for Future Research through the channel (for instance, the speaker produces Paul
One important drawback of most research in this area of dis- but the hearer understands ball).
course analysis has been that quite often, a top-down analytical While mainstream linguistics is focused on human language,
apparatus is employed, whereby a classiication of givenness information theory has been applied to many other contexts,
types is proposed on the basis of constructed examples and such as the communication systems of other species (McCowan,
is then applied to actual texts (often focusing on narra- Hanser, and Doyle 1999; Suzuki, Buck, and Tyack 2006), genetic
tives, mostly written narrative texts). On the whole, informa- information storage in the DNA (Li and Kaneko 1992; Naranan
tion structure in conversation has not received the attention it and Balasubrahmanyan 2000), and artiicial systems such as
deserves; some so-called word order variations appear primar- computers and other electronic devices (Cover and homas
ily to be interactive phenomena, cases in point being left and 1991).
right dislocations (see Geluykens 1992, 1994). In particular, the Information theory has myriad applications even within the
dynamic, procedural nature of conversation as a collaborative domain of the language sciences. I give only a few examples. First,
enterprise and the efect that the turn-taking system may have it provides powerful metrics in psycholinguistics for mea-
on the givenness status of elements have not yet been exam- suring the cognitive cost of a) processing a word (McDonald and
ined systematically. In addition, there is a dearth of experimen- Shillcock 2001), b) an inlectional paradigm (Moscoso del Prado
tally based analyses trying to determine the exact efect of one Martn, Kostic, and Baayen 2004), or c) the whole mental lexi-
potential variable (such as the referential distance between an con (Ferrer i Cancho 2006). Second, information theory allows
element and its previous mention, for instance) on the given- one to explain certain actual properties of human language. For
ness status of an item. If one assumes that givenness erodes instance, the tendency of words to shorten as their frequency
over time due to the limits of short-term memory (a reasonable increases (Zipf 1935) can be interpreted as increasing the speed
enough assumption), then that efect should be measurable of the information transmitted (e.g., the number of messages
under controlled conditions. Experimental studies, however, per second) by assigning shorter codes to more frequent codes.
tend to be limited to the impact of givenness status on prosodic Another well-known property of human language is G. Zipfs
realization. law for word frequencies, one of the most famous laws of
language. It has been argued that this law could be an optimal
Ronald Geluykens
solution for for maximizing the information transmitted when the
mean length of words is constrained (Mandelbrot 1966) or maxi-
WORK CITED AND SUGGESTIONS FOR FURTHER READING mizing the success of communication while the cognitive cost of
Birner, B. J., and G. Ward. 1998. Information Status and Noncanonical using words is minimized (Ferrer i Cancho 2006). hird, informa-
Word Order in English. Amsterdam and Philadelphia: John tion theory has shed light on the evolution of language. It
Benjamins. has been hypothesized that the presence of noise in the commu-
Brown, Gillian, and George Yule. 1983. Discourse Analysis. nication channel could have favored the emergence of syntax
Cambridge: Cambridge University Press. in our ancestors (Nowak and Krakauer 1999), which turns out to
Firbas, J. 1992. Functional Sentences Perspective in Written and Spoken be a reformulation of fundamental results from standard infor-
Communication. Cambridge: Cambridge University Press.
mation theory (Plotkin and Nowak 2000). Finally, information
Geluykens, R. 1992. From Discourse Process to Grammatical
theory ofers an objective framework for studying the diferences
Construction: On Left-Dislocation in English. Amsterdam: Benjamins.
. 1994. he Pragmatics of Discourse Anaphora in English: Evidence
between animal communication and human language.
from Conversational Repair. Berlin: Mouton de Gruyter. It is well known that the occurrence of a certain word depends on
Halliday, M. A. K. 1994. An Introduction to Functional Grammar. 2d ed. distant words within the same sequence, for example, a text, in
London: Edward Arnold. human language (Montemurro and Pury 2002; Alvarez-Lacalle
Lambrecht, K. 1996. Information Structure and Sentence Form. et al. 2006), and information theory studies in other species
Cambridge: Cambridge University Press. provide evidence that long-distance dependences are not
Prince, E. F. 1981. Toward a taxonomy of given-new information. In uniquely human (Suzuki, Buck, and Tyack 2006; Ferrer i Cancho
Radical Pragmatics, ed. P. Cole, 22355. New York: Academic Press. and Lusseau 2006). Furthermore, research on humpback whale
songs (Suzuki, Buck, and Tyack 2006) questions the conjecture
of Marc Hauser, Noam Chomsky, and W. Tecumseh Fitch (2002)
INFORMATION THEORY that only humans employ recursion to structure sequences.
his is a general purpose and abstract theory for the study of Ramon Ferrer i Cancho
communication. Standard information theory was founded
by Claude Shannon (1948) and is based on a communication WORKS CITED AND SUGGESTIONS FOR FURTHER READING
framework in which a) the sender must transform a message
Alvarez-Lacalle, Enric, Beate Dorow, Jean Pierrer Eckmann, and Elisha
into a code and send it through a channel to the receiver, and
Moses. 2006. Hierarchical structures induce long-range dynamical
b) the receiver must obtain a message from the received code.
correlations in written texts. Proceedings of the National Academy of
Communication is successful if the message of the sender is the Sciences USA 103.21: 795661.
same as the message obtained by the receiver. For instance, a Cover, homas M., and Joy A. homas. 1991. Elements of Information
speaker utters a word for a certain meaning and then the hearer heory. New York: Wiley.
must infer the meaning that the speaker had in mind. Noise can Ferrer i Cancho, Ramon. 2006. On the universality of Zipfs law for word
alter the code when traveling from the sender to the receiver frequencies. In Exact Methods in the Study of Language and Text: To
389
Innateness and Innatism
Honor Gabriel Altmann, ed. Peter Grzybek and Reinhard Khler, mathematics. he issue in linguistics is empirical, though. he
13140. Berlin: Gruyter. It discusses the problems of classic models for question of whether some knowledge of language must be innate
explaining Zipfs law for word frequencies. has sparked a great amount of controversy over the last decades,
Ferrer i Cancho, Ramon, and David Lusseau. 2006. Long-term cor- during which philosophical positions (rationalism versus empir-
relations in the surface behavior of dolphins. Europhysics Letters
icism) were not always kept separate from the empirical issue.
14: 10951101.
Prominent protagonists include Noam Chomsky, Jerry Fodor,
Hauser, Marc, Noam Chomsky, and W. Tecumseh Fitch. 2002. he fac-
ulty of language: What is it, who has it and how did it evolve? Science
Ray Jackendof, Steven Pinker, Kenneth Wexler, Geofrey Pullum,
298: 156979. Terrence Deacon, Michael Tomasello, and earlier, from some-
Li, Wentian, and Kunihiko Kaneko. 1992. Long-range correlations and what diferent perspectives, W. V. O. Quine and Eric Lenneberg.
partial 1/f spectrum in a noncoding DNA sequence. Europhysics he debate arises in essentially two domains: concept learn-
Letters 17: 65560. ing and the acquisition of the structure of a language. Concept
Mandelbrot, Benoit. 1966. Information theory and psycholinguistics: A learning is associated with Quines thesis of the indetermi-
theory of word frequencies. In Readings in Mathematical Social nacy of translation (1960), illustrated by his gavagai
Sciences, ed. P. F. Lazarsield and N. W. Henry, 15168. Cambridge, example: Suppose one is trying to learn a hitherto unknown lan-
MA: MIT Press. guage. One is walking around accompanied by one of the speak-
McCowan, Benoit, Sean F. Hanser, and Laurance R. Doyle. 1999.
ers of the language. Suddenly a rabbit runs by and the speaker
Quantitative tools for comparing animal communication sys-
utters gavagai. As Quine shows, there are innumerable English
tems: Information theory applied to bottlenose dolphin whistle reper-
toires. Animal Behavior 57: 40919.
translations one could come up with for gavagai, among which
McDonald, Scott A., and Richard Shillcock. 2001. Rethinking the word it would be impossible to decide on the basis of the experience
frequency efect: he neglected role of distributional information in alone. To the extent to which in practice we succeed in doing
lexical processing. Language and Speech 44: 295323. so, this is something to be explained, for instance, by ascribing
Montemurro, Marcelo, and Pedro A. Pury. 2002. Long-range fractal cor- to us an innate conceptual structure, which is how Fodor (1983
relations in literary corpora. Fractals 10: 45161. and subsequent work) proposes that we should account for our
Moscoso del Prado Martn, Fermn, Alexander Kostic, and Harald R. convergence.
Baayen. 2004. Putting the bits together: An information theoretical he task of acquiring a language can be characterized as fol-
perspective on morphological processing. Cognition 94: 118.
lows (as in Chomsky 1986 and subsequent work). A human infant
Naranan, Sundaresan, and Vriddhachalam K. Balasubrahmanyan. 1998.
who is exposed to a language will in the course of roughly four to
Models for power law relations in linguistics and information sci-
six years (seven years if one counts full mastery of irregular mor-
ence. Journal of Quantitative Linguistics 5.3: 3561. A summary of
many optimization models of Zipfs law for word frequencies that are phology, or raising verbs; for the latter, see Hirsch and Wexler
based on information theory. 2007) acquire a knowledge of that language identical to that of
. 2000. Information theory and algorithmic complex- a human adult who knows the same language (with the excep-
ity: Applications to linguistic discourses and DNA sequences as com- tion of the lexicon, which keeps on growing in adulthood). In this
plex systems. Part I: Eiciency of the genetic code of DNA. Journal of respect, he or she is entirely unlike apes, dogs, bees, and so on,
Quantitative Linguistics 7.2: 12951. that, regardless of how much they are exposed to a language, will
Nowak, Martin A., and David Krakauer. 1999. he evolution of language. never reach anything like human adult competence. It is also
Proceedings of the National Academy of Sciences USA 96: 802833. uncontroversial that a child is not more predisposed to learn-
Plotkin, Joshua, and Martin A. Nowak. 2000. Language evolution and
ing one language than another. But given that a child exposed
information theory. Journal of heoretical Biology 205.1: 14759.
to Dutch ends up learning Dutch and not, for instance, Chinese,
Shannon, Claude E. 1948. A mathematical theory of communication.
and that the converse holds for a child exposed to Chinese, the
Bell Systems Technical Journal 27: 379423, 62356.
Suzuki, Ryuji, John R. Buck, and Peter L. Tyack. 2006. Information input must be crucial in determining which language ends up
entropy of humpback whale songs. Journal of the Acoustical Society being learned.
of America 119: 184966. Consequently, language acquisition can be schematically rep-
Zipf, George Kingsley. 1935. he Psycho-biology of Language. resented as in Figure 1, where the various stages that a child goes
Boston: Houghton Milin. through are represented by Si and the Di are the data to which
he or she is exposed, yielding a change in state (not prejudging
the question of whether these stages involve major qualitative
INNATENESS AND INNATISM
discontinuities). S0 is termed the initial state, before the child is
What does our knowledge come from? Innatism is the position exposed to language. (For the discussion, it is immaterial at what
that at least some of our knowledge is inborn rather than derived age we put S0. If exposure to language can already take place in
from experience. If so, the question naturally comes up con- some form in the womb, as there is evidence to believe, S0 can be
cerning what types of knowledge should be taken to be innate, put just before the irst exposure in the womb.) Sn is the steady
giving rise to speciic hypotheses about innateness. In linguis- adult state that does not change anymore if over time more data
tics, innateness became an important issue with the develop- are presented (with the exception of the lexicon).
ment of generative grammar, giving rise to a debate that his sketch as such should be uncontroversial, and even by
is inextricably linked to language acquisition, namely, whether a professed non-nativist such as Tomasello (2003), it can be
or not language acquisition is based on an innate universal rejected only at the price of incoherence. he real issue involves
grammar (UG). Innatism originated as a philosophical posi- the properties of S0 (abstracting away from the possible role of
tion, often focusing on concepts varying from god to justice to maturation of the brain). By deinition, S0 is the state of being
390
S0 S1 S2 Si Sn Sn Sn
Figure 1.
D1 D2 Di Dn Dn+1 Dn+2 ........
able to acquire language, distinguishing humans from apes, a reasonable length, the number of well-formed sentences in
dogs, and so on. hat is, it relects mans innate capacity for English is astronomical. It has been estimated that the number
language. By the deinition of the term universal grammar as it of grammatical English sentences of 20 words and less is 1020
is used in generative grammar, S0 coincides with UG. Whereas (Levelt 1967). (Note that this very normal sentence is exactly 21
a non-nativist could object to the term UG, nothing more is words and costs nine seconds to pronounce). At an average of six
involved than a terminological issue. What is really at stake are seconds per sentence, it will cost 19 trillion years to say (or hear)
questions such as the following: i) What properties must S0 have them all. In the case of nonstop listening, the percentage of these
in order to be able to account for the fact that language can be a child could have heard in six years time is 0.000000000031,
acquired, given what we know about time course and access clearly still a gross overestimation as compared to what the child
to data? ii) Which of these properties are speciic to man? And can actually be expected to hear. So on the basis of at most such
iii) What aspects of S0 are speciic to language, and how are the an extremely small percentage of the potential input, the child
other aspects related to other human cognitive capacities? he gets to know how language works.
irst question can only be successfully approached by carefully here are many further practical complications we ignored,
investigating necessary and sufficient conditions for such as lack of homogeneity and the presence of errors in the
learning and for learnability of (classes of) languages of the data. If we were to take these into account, the task would only
human type. he second and third questions require an under- become more formidable. his sets the stage for the logical
standing not only of the human language capacity but also of problem of language acquisition (for instance, Chomsky 1965,
those other cognitive capacities among which it is embedded. 1986), which can be formulated as the projection problem:
Logically, S0 could be empty. However, this would entail no
Consider a given inite set taken from some (ininite) superset.
diference between humans and animals, contrary to what we
Determine (the characteristic function of) this superset on the basis
know. So for this empirical reason alone, S0 cannot be empty.
of this subset.
But as is discussed in the next section, it would also entail that
humans cannot acquire language contrary to what we know. Like anyone can see, this task is in its generality impossible. For
any inite given set, the projection problem has ininitely many
The Logical Problem of Language Acquisition solutions.
A useful strategy for demonstrating the diiculty of a problem is For a concrete illustration, consider the following task involv-
to simplify it. If the simpliied problem is still hard, one knows ing the completion of a series of numbers:
that the original problem is at least as hard. So we take a highly
(1) 1,2,3,4,5,6,7, .
simpliied question as a starting point (Wexler and Culicover
1980): What does a person who knows a language minimally One might say that inding the next number is easy. It should
know? A reasonable answer is the following: A person who knows obviously be 8. But of course, this is not at all guaranteed. It is easy
a language knows at least which strings of words correspond to think of a perfectly well-behaved function that enumerates the
to well-formed sentences in that language and which strings irst seven natural numbers, followed by their doubles, triples,
dont. (his simpliication is valid irrespective of the changes in quadruples, and so on. his illustrates a very simple point: here
the signiicance attached to this particular aspect of linguistic is no general procedure to establish the correct completion of
knowledge, from Chomsky 1957 to Chomsky 1995.) some initial part of a sequence, whether a sequence of numbers
In this simpliied picture, we view a language as a subset as in (1) or a data sequence (D1, D2, Di, Dn) as in Figure 1.
of the set of all expressions one can form over a given vocabu- his fact relects a poverty of the stimulus in a fundamental
lary. hat is, assuming that the vocabulary of English contains sense, as a trivial logical truth.
the elements the, dog, bites, man, the set of English sentences he completion task may become possible, however, if it is
will contain the dog bites, the dog bites the man, and so on, but redeined as the task to ind a solution within a restricted space of
not the bites man, bites dog the, and so on. he task of the child possible solutions. In that case, certain instances of the projection
acquiring English, therefore, minimally includes determining problem become even trivial. For instance, (1) can be trivially
what the full set of English sentences is like on the basis of the completed if it is given that there is a constant diference between
sentences he/she is exposed to for some period of time, lets say each member of the series and its successor.
for six years. he question is then to get an impression of how As E. Gold (1967) showed, even highly restricted hypothesis
hard this task is. spaces may not ensure a solution of the projection problem as
Note that there is no upper bound to the length of individual deined. he task may become easier if the input does contain
sentences. his makes the set of sentences in a language efec- systematic evidence as to what is not in the target language.
tively ininite. However, even if restricted to sentences under Since, as is generally acknowledged, the input to the child does
391
not contain systematic negative evidence, it becomes of prime require a theory of how to acquire the ability to use the context of
importance to identify the types of hypothesis spaces that do utterance, or a formal proof that this structuring of the presenta-
allow learning of natural languages by presentation only. he tion suiciently aids the child in setting up correct hypotheses
absence of negative evidence, together with the fact that a sub- and rejecting incorrect ones but, crucially for the non-nativists,
stantial number of actual utterances a child may hear will be less without attributing to the child innate knowledge as to what evi-
than entirely well formed, are often referred to as poverty of the dence is to be absorbed and what evidence ignored. hese are all
stimulus as well. But this is, in fact, not the same notion as the equally susceptible to the poverty of the stimulus argument in
fundamental, logical one employed in the discussion of Figure 1 the logical sense.
and (1). For a rational debate it is crucial to keep the, irst, logical Clearly, it is important to have a conception of what it means
sense and the, second, narrower, empirical sense apart. for something to be a property of UG. For instance, Chomsky
Given the poverty of the stimulus in the logical sense, lan- (1980) suggests that the speciied subject condition (SSC) dein-
guage acquisition cannot be accounted for without the assump- ing the domain in which anaphors must be bound (condi-
tion of innate genetically determined restrictions. he poverty tion A) and pronominals must be free (condition B) is part of
of the stimulus in the empirical sense may help provide further our genetic endowment. he question is, then, what it means
evidence on what these are. In the generative literature, it is for UG. Must an SSC be hardwired as such? Or is it suicient
often claimed that these restrictions have the form of an inven- if the restrictions on binding descriptively captured by the SSC
tory of grammatical principles. It is presumably this reference follow from basic properties of our computational system, mod-
to grammatical principles that led to the poverty of the stimulus ulo properties of the mental space in which these computations
debate that is, for or against the existence of innate principles take place? For instance, Chomsky (1995) and subsequent work
of grammar as it is usually conducted with its emphasis on the argue that grammar is based on a very simple set of combina-
poverty of the stimulus in the narrower sense. But for a fruitful tory principles (essentially merge and Agree) and conditions
discussion, it is crucial to distinguish between the minimal prop- that follow from general properties of computation. If so, that
erties that S0 must have in order to explain language acquisition is what UG essentially amounts to. Reuland (2005) shows
and the further question of whether S0 has properties that are that the core of condition B the need to license relexivity of
speciic to language. a predicate follows from the fact that no computational sys-
tem can distinguish between indistinguishables, as in the case
Learnability and Complexity of the arguments of a relexive predicate. Elements like self or
Each restriction on the hypothesis space deines a class of gram- other morphological markers must be added for the system
mars and languages. As pointed out by Gold (1967) and Wexler to handle these arguments. To derive condition A, no more is
and Peter Culicover (1980), learnability does not depend on the needed than a general principle of economy of encoding, the
complexity of the individual language/grammar but only on the general combinatorics of the language system, and the lexical
structure of the class in which the selection must be carried out. semantics of self as an identity predicate. hus, prima facie
Many contributions to the debate center on speciic exam- substantive properties of language and UG reduce to the inter-
ples, such as the question of whether the child uses and under- action between general properties of mental computations and
stands utterances that are unexpected given the input up to a lexical representations. If so, there is indeed no sense in which
certain stage, whether or not the input is restricted (as in the case conditions A and B are acquired. hey relect basic properties
of motherese, a restricted register caretakers use in addressing of the system embedded in our wetware but it takes extensive
their children), or whether the input is richer and provides more linguistic research to show that this is so.
clues than one might have initially expected. As Pullum and
Barbara Scholz (2002) point out, one must distinguish between Language and Pattern Recognition
the general issue of a speciic genetic endowment and debates Non-nativists crucially invoke general learning strategies that
as to whether particular clues that have been argued to be non- originate in our general cognition. However, a statement that
existing can or cannot be found in natural language corpora. But language is an emergent property of our general cognitive sys-
important as issues of the latter type are, they are independent of tem requires a substantive theory of its workings, specifying how
the general problem. its operations account for language with the same amount of
Deacon (1997) attributes to Chomsky the position that natural precision as the rules of formal linguistics (none of the proper-
language is too complex to be learned without rich innate mech- ties in the explanans should invoke the explanandum). So far, no
anisms and then proceeds to argue that it is in fact not as com- precise proposals have been made available.
plex as is being claimed. Part of the argumentation in Tomasello he essence of Tomasellos claim is that there is no interest-
(2003) is based on the same premises. As demonstrated in the ing problem in language acquisition, since whatever is needed is
previous section, none of these issues bears on the poverty of the provided by our abilities for pattern recognition. Tomasello (and
stimulus in its logical sense. others) assume that there exists an ability for pattern recognition
he same applies to arguments stating that the input is much that provides the tool for inding the patterns in language. Of
richer than nativists presuppose. As Wexler and Culicover course, we humans have the ability to ind patterns. However, it
(1980) show, proposals involving an enriched input (a more is a fallacy to think that what we do is inding the patterns that
structured presentation in which pragmatic clues are provided are out there. he main message about concept learning to be
by the context of the utterance, as in the case of motherese ) mag- gleaned from Quines insights is that our mind must impose pat-
nify the logical problem instead of decreasing it. Such proposals terns. Elementary considerations from particle physics show us
392
Innateness and Innatism Integrational Linguistics
the same: Our common senses are blatantly incapable of seeing depth of misunderstanding involved: It is an empirical problem,
reality as it is. As our extended senses (in the form of experi- but analyzing the logical problem is essential for solving it.
mentation and model building) teach us, what is actually out All this does not demonstrate that the restrictions necessary
there bears minimal resemblance to what we can observe with for language acquisition are speciic to language. It does show
our common senses. If even individual events that we observe, that such restrictions are there and have to be studied if we are to
imprint in our memory, and store can only have a remote resem- understand language acquisition. No insight can be gained unless
blance to what is there, the more so for the patterns that we precise and substantive hypotheses are formulated and tested in
ind. a way that relects what we already know about language.
Any pattern involves extrapolation beyond what can be
Eric Reuland
observed; even the simplest of observations requires an active
mind shaping our internal representation of what is out there.
We can do no more than impose a pattern (one of the zillions of
possible patterns any piece of reality embodies), hopefully in Chomsky, Noam. 1957. Syntactic Structures. he Hague: Mouton.
a way compatible with our survival. he good thing is that evo- . 1965. Aspects of the heory of Syntax. Cambridge, MA: MIT Press.
lution resulted in our patterns being useful enough for every- . 1980. On cognitive structures and their development. In
Language and Learning, he Debate between Jean Piaget and Noam
day purposes since we managed to survive (for the moment).
Chomsky, ed. M. Piattelli-Palmarini, 3552. Cambridge: Harvard
he bad thing is that we can do so not because we are so smart
University Press.
but because we are so limited. We can learn only if we ignore . 1986. Knowledge of Language: Its Nature, Origin and Use. New
the many logically possible alternatives. Evolution keyed us York: Praeger.
to the universe. he key to understanding how we learn is to . 1995. he Minimalist Program. Cambridge, MA: MIT Press.
understand our limitations, the logically possible patterns we Deacon, Terrence. 1997. he Symbolic Species. New York: Norton.
ignore. Fodor, Jerry. 1983. he Modularity of Mind. Cambridge, MA: MIT Press.
In this respect, learning a language is like inding the pat- Gold, E. 1967. Language identiication in the limit. Information and
terns in the surrounding universe. In another respect, there is Control 16: 44774.
a diference, although it has been misconstrued. Deacon (1997) Hirsch, C., and K. Wexler. 2007. he late development of raising: What
proposes that our capacity to learn language is not surprising children seem to think about seem. In New Horizons in the Analysis
of Control and Raising, ed. W. Davies and S. Dubinsky, 3770.
since language is a human product and, therefore, is made to
Heidelberg: Springer.
be learned. However, unless we are careful, this leads us back
Kandel, Eric, James Schwartz, and homas Jessell. 2000. Principles of
to the whole range of issues discussed, framed slightly difer- Neural Science. 4th ed. New York: McGraw-Hill.
ently: How are the properties allowing language to be learned Lenneberg, Eric H. 1967. Biological Foundations of Language. New
relected in its structure, and what does the fact that we acquire York: Wiley.
language tell about our cognitive abilities? Levelt, Willem. 1967. Over het Waarnemen van Zinnen. Groningen,
Nevertheless, it also contains a relevant insight. Unlike the Germany: Wolters.
physical world, language is a product of the human mind. So we Pullum, Geofrey K., and Barbara C. Scholz. 2002. Empirical assessment
know that the childs mind is keyed to getting to know language of stimulus poverty arguments. Linguistic Review: 19.1/2: 950.
in a way he or she will never know the physical world: Complete Quine, Willard Van Orman. 1960. Word and Object. Cambridge, MA: MIT
knowledge is attainable. As in the case of the physical world, the Press.
Reuland, Eric. 2005. Binding conditions: How are they derived? In
input for learning language is external. However, unlike in the
Proceedings of the HPSG05 Conference Department of Informatics,
case of the physical world, we know the nature of the input to
University of Lisbon, ed. Stefan Mller. Stanford, CA: CSLI Publications.
an extent that is unique. So language acquisition relects labo- Available online at: http://csli-publications.stanford.edu/
ratory conditions for learning, facilitating an understanding of Tomasello, Michael. 2003. Constructing a Language. Cambridge: Harvard
learning per se. University Press.
It is surprising that so many researchers of human learning Wexler, Kenneth, and Peter Culicover. 1980. Formal Principles of
have such a hard time accepting the implications of the projec- Language Acquisition. Cambridge, MA: MIT Press.
tion problem, though the moral is so simple, like the irst law of
thermodynamics prohibiting the perpetuum mobile. At it reads
INTEGRATIONAL LINGUISTICS
in the well-known words of C. P. Snow:
his is the application of integrational semiology to the study
You cannot win (that is, you cannot get something for nothing)
of language. Integrational linguistics is based on the assump-
Applied to learning: tion that human communication, whether verbal or nonverbal,
involves the creation of signs in particular contexts whereby two
Learning a recursive step by presentation only, without restrictions
or more individuals engage in joint integration of their respec-
on the hypothesis space, is as impossible as creating the perpetuum
tive activities. hus, for example, speech communication would
mobile.
be impossible without integration of the biomechanically sepa-
he last sentence in Tomasello reads: How children become rate activities of vocalization and hearing. Written communica-
competent users of a natural language is not a logical problem tion requires the production of marks on a surface that can be
but an empirical problem (2003, 328). Paradoxically, I agree integrated with programs of optical scanning. he appropriate
and, at the same time, would like to say that it illustrates the integration of these and many other activities is one of the major
393
Integrational Linguistics
functions of the cerebral cortex, and failure to achieve the inte- The Principle of Noncompartmentalization
grational proiciency required for social luency in communi- It also follows from the semiological axioms that there is no strict
cation is commonly perceived as a defect or handicap of some or objective dividing line between linguistic knowledge and non-
kind (e.g. deafness, dyslexia, etc.) when due to physiological linguistic knowledge, or, as some theorists put it, between knowl-
factors. Integrationists diferentiate forms of communication edge of a language and knowledge of the world. Recognition of
according to the range of activities typically integrated by the this indivisibility is referred to by integrationists as the principle
participants, and the kinds of integrational proiciency typically of noncompartmentalization. In other words, human beings do
required for participation. A blind person, for obvious reasons, not live in a communicational environment where what pertains
lacks the integrational proiciency presupposed in various forms to language belongs to one compartment and the rest belongs to
of visual communication (see blindness and language). some other compartment (or compartments).
From this perspective, the term language does not correspond Non-compartmentalization is also heresy in orthodox lin-
to any one mode of communication but straddles or conlates guistics, since it implies that linguistics cannot be a science.
several. In this respect, integrational linguistics difers radically (A physicist who confessed inability to diferentiate between the
from mainstream schools of thought in linguistics and neigh- facts of the physical world and the nonphysical world would be
bouring disciplines, which tend to assume that language is a sin- confessing to a similar heresy.) In an academic milieu where
gle human faculty, common to all humanity, and that languages every inquiry aspires to be scientiic, this doctrine is not
(English, French, Latin, etc.) are diferent social codes enabling popular.
individual users to exercise this faculty. For integrationists, on he integrationist view can be illustrated by considering what
the other hand, individuals are not language users but language happens at a cocktail party. Physically, a certain level of audible
makers. hey make language by their creative integration of ver- vibration is generated (often said to be deafening, though very
bal signs into a myriad diverse activities, in both expected and small as compared with the energy required to light one electric
unexpected ways, with due regard for the circumstances, just as lamp). Physiologically, there is much expenditure of efort in
they make human relationships by the ways in which they inter- terms of the muscular action of vocal apparatus (but again very
act with others in particular cases. small by comparison with the efort required to walk across the
road). Mentally, there is doubtless engagement in interactions
Axioms of Integrational Semiology with others, but it cannot be quantiied. So where in all of this is
he axioms of integrational semiology are as follows: the language component? It seems to be in there somewhere,
1. What constitutes a sign is not given independently of the but exactly where deies exact location. To ask where is itself a
situation in which it occurs or of its material manifestation in nonsense question. And to grasp why it is a nonsense question is
that situation. already halfway to subscribing to the integrationist principle of
noncompartmentalization.
2. he value of a sign (i.e., its signiication) is a function of the
he orthodox linguistic answer is that the language compo-
integrational proiciency that its identiication and interpre-
nent resides somewhere in the heads of the talkers and listen-
tation presuppose.
ers. But so, presumably, does their knowledge of football, food,
As applied to language studies, this means that verbal com- local politics, and everything else; that is, all the things being
munication of whatever kind cannot be decontextualized. talked about at the cocktail party. So exactly the same compart-
Episodes of communication are episodes in the lives of partic- mentalization perplexity arises at one remove. Human beings
ular individuals at particular times and places. hese episodes cannot in their everyday lives distinguish between knowing
have to be studied as such. We learn nothing from an analysis something about X and being able to talk about it.
telling us, for instance, that someone uttered the sentence John
loves Mary, that John is the subject of the sentence, Mary is the The Principle of Cotemporality
direct object of the verb love, love is a transitive verb, and so on he integrationist principle of cotemporality complements the
nothing, that is, except information about the metalinguistic principle of noncompartmentalization. Everyday experience
assumptions of the analyst. (hese assumptions may be worth recognizes that an event occurring at time t may afect how we
studying in their own right, but that is not the same as studying interpret an event occurring earlier or later than t. Temporal
the facts pertaining to the utterance in question. On the contrary, sequence is an intrinsic aspect of contextualization.
the analysis is one that already embarks on a decontextualization Compare the situation in which (1) landlord says he water
of the episode allegedly described.) is turned of today and tenant says I must have a shower with
It is not simply a matter of knowing who said what to whom, the situation in which (2) tenant says I must have a shower and
where, and in what circumstances. Nor are the circumstances landlords response is he water is turned of today. Ostensibly,
what happened immediately before and after. Orthodox modern the same information has been exchanged and the same words
linguistics, like traditional grammar, routinely assumes the legit- used. But what emerges from the communicational episode is
imacy of abstracting from all these features of context. Its state- quite diferent in the two cases.
ments are supposedly generalizations across indeinitely many For integrationists, the question of temporal sequence
unidentiied episodes of language use. Integrational linguistics involves both verbal and nonverbal behavior. In brief, there is no
rejects on principle the legitimacy of such generalizations: Again, way that what is said can be set apart from the train of events in
they tell us nothing about linguistic facts only about the intel- which it occurs, whether these are verbal or nonverbal. We all
lectual preferences or prejudices of the analyst. know this. hat is why in legal disputes courts treat diferently
394
Integrational Linguistics
a case in which A insulted B, who then struck A, from a case in to agree about the meanings of words than they are obliged to
which A struck B, who then insulted A. As individuals, we are agree about the value of goods. In both cases, it usually suits their
time-bound agents in all our activities. Our linguistic acts do purposes to make some kind of compromise with those they are
not have some special time track of their own. here is no such dealing with. But the nature of this semantic compromise is
thing as a contextless linguistic sign. essentially ad hoc.
his does not mean that the context simply is a given
sequence of events. Integrationists take a very diferent view of Parameters of Communication
context from that usually found in orthodox linguistics. Context For integrationists, there are certain capacities required of an
is not to be equated with situation (which may be irrelevant) individual in order to participate in communication. hese
and even less with preceding or following speech-acts. It is capacities are of three kinds: biomechanical, macrosocial, and
not some kind of local backdrop against which communication circumstantial. he irst relates to the human organism and the
takes place. Context, for the integrationist, is always the product ability to integrate activities requiring a wide variety of physi-
of contextualization, and each of us contextualizes in our own ological and mental processing (e.g., the very diferent biome-
way. he individual participants in any communication situation chanical requirements involved in speech production and
will each contextualize what happens diferently, as a function hearing). he second relates to the human ability to integrate
of the integrational proiciency each exercises in that situation. particular activities into sets of assumptions provided by social
his does not mean that we can never reach communicational conventions of various kinds. he third relates to the ability to
agreement, but it explains why we often do not. It is not enough integrate ones activities into whatever else is going on at the
to say that every act of communication is unique. Each such act time.
is in principle subject to multiple contextualizations and recon- A simple illustration of all three is provided by what happens
textualizations. hat is what makes it essential in linguistic analy- when a motorist encounters a pedestrian about to cross the road.
sis for the analyst to specify what forms of contextualization are At the macrosocial level, there are assumptions about the road,
presupposed (a requirement that the great majority of orthodox the conventions of the highway code, and so on. At the biome-
linguists ignore). For integrationists, a sign is not a sign until it chanical level, there are questions about what the motorist sees
has been contextualized: he act of contextualization and the within a certain range of visibility and the alertness necessary for
establishment of the sign are one and the same. initiating the appropriate physiological actions for driving the
vehicle. At the circumstantial level, even when all of these aspects
Meaning have been taken into account, the individual motorist must con-
It follows, then, that integrationists take a quite diferent view of stantly be prepared to modify any action taken, depending on the
meaning from that which informs most work in orthodox linguis- moment-to-moment behavior of the pedestrian and other road
tics, where the meaning of a linguistic form is usually construed users.
as some kind of concept (as in the deinitions of conventional his does not mean that the levels of integration are inde-
dictionaries) or, even more vaguely, mental representation. pendent. Circumstantial factors intervene across the board. An
his is traditionally imagined to yield a more or less permanent accident may be caused because the pedestrian is shortsighted
value attached to the form and known to all competent speak- or the road badly lit. he drivers exercise of biomechanical skills
ers and writers of the language in question (even if the language may be afected by knowing or not knowing that the brakes of
is no longer living). hus, the meaning of classical Latin aqua is this particular vehicle are not very reliable.
treated as timeless: It still means for Latin specialists what it Much recent work in integrational linguistics has focused on
meant in the days of Julius Caesar and will mean in a thousand analyzing how, at the macrosocial level, societies succeed in con-
years from now. structing supercategories, which integrate what would otherwise
For integrationists, this assumption conlicts with the princi- be quite separate activities in such a way as to set up a common
ple of cotemporality. here are no atemporal invariants in lan- framework for the intellectual and practical pursuits of society as
guage. Meaning is made by participants as part of the process of a whole. hese supercategories include science, art, history, and
communication. It is thus subject to the principle of cotemporal- religion, each with a dedicated discourse of its own.
ity in just the same way as all other aspects of communication.
Roy Harris
here are no ixed meanings. here is nothing in language to pro-
vide us with a miraculous guarantee of the stability of meaning(s)
over time or even from one moment to the next. To demand
such a guarantee for any mode of communication is as futile as Harris, R. 1998. Introduction to Integrational Linguistics. Oxford:
demanding that a currency remain stable in value from one day Pergamon. A concise survey of the whole ield.
Harris, R., and C. Hutton. 2007. Deinition in heory and Practice.
to the next. (he indeterminacy holds regardless of whether peo-
London: Continuum. An integrationist approach to problems of dei-
ple shopping in High Street are aware of it.)
nition, with particular reference to lexicography and the law.
We have no option but to interpret particular episodes of Love, N., ed. 2006. Language and History: Integrationist Perspectives.
communication by integrating them into the unique temporal London: Routledge. Collection of papers bearing on the construction
sequence of events that constitutes our previous experience. of the macrosocial supercategory history.
hus, where there are two or more participants, what is com- Toolan, M. 1996. Total Speech: An Integrational Linguistic Approach to
municated must be open to two or more interpretations. hese Language. Durham, NC: Duke University Press. Includes interesting
cannot be guaranteed to coincide. People are no more obliged discussions of literal meaning, metaphor, and related issues.
395
Intension and Extension
Wolf, G., and R. Harris , eds. 1998. Integrational Linguistics: A First Reader. difers in meaning from
Oxford: Pergamon. Collection of papers covering a wide range of topics
from an integrationist perspective. (4) Lola believes that Mark Twain is Samuel Clemens.
For if (3) and (4) have the same meaning, then they are either
INTENSION AND EXTENSION both true or both false. But (3) may be true while (4) is false for
example, if Lola does not realize that Mark Twain and Samuel
semantic theories commonly distinguish two aspects of linguis-
Clemens refer to the same man. Since (3) is just like (4) except for
tic meaning: intension and extension. Roughly, the intension of a
an occurrence of Samuel Clemens instead of Mark Twain, Mark
linguistic expression is what it means, and its extension is what it
Twain and Samuel Clemens difer in meaning.
refers to. For example, on some views, Neil Armstrong and the
Such are two arguments that Mark Twain and Samuel Clemens
irst human being to walk on the Moon have the same exten-
difer in meaning. But if the extension of a proper noun is its
sion but diferent intensions. Often it is held that intension deter-
referent, then Mark Twain and Samuel Clemens have the same
mines extension (expressions with the same intension have the
extension. hus, some (but not all) conclude, there is a distinc-
same extension) but is not determined by extension (expressions
tion between two aspects of meaning: intension and extension.
with the same extension may have diferent intensions).
his entry describes various roles that intensions and exten-
Extensional and Nonextensional
sions play in contemporary semantic theories. While much
Arguably, then, a true sentence (3) may be turned into a false sen-
recent discussion uses formal methods, the presentation here is
tence (4) merely by substituting an occurrence of Mark Twain by
nontechnical.
Samuel Clemens. Suppose that the extension of a sentence deter-
Some possible terminological confusions should be noted.
mines its truth value and that Mark Twain and Samuel Clemens
A long tradition of distinguishing diferent aspects of meaning
have the same extension. If so, then to change (3) into (4) is to
(including Porphyrys third-century commentaries on Aristotle,
replace part of sentence (3) with a coextensional expression,
the Port-Royal logic of 1662, Mill [1843] 1872, and Frege [1892]
while failing to preserve the extension of the entire sentence.
1997) has left us with a hodgepodge of terms and distinctions.
Contexts like Lola believes that are typically thought to
One should be wary whether, for example, the Port-Royal log-
be nonextensional. An extensional context is a context wherein
ics comprehension and extension, or Freges Sinn and
substitution by a coextensional expression always preserves the
Bedeutung (often translated sense and reference; see sense
extension of the larger expression. A nonextensional context is
and reference), or Mills connotation and denotation
thus a context wherein substitution by a coextensional expres-
mark the same distinction as intension and extension in
sion sometimes fails to preserve the extension of the larger
recent theories. Moreover, extension is sometimes, but not
expression.
always, taken to be synonymous with denotation, designa-
Nonextensional contexts appear to be widespread in natural
tion, or referent, while intension, intensional, and inten-
language. In English, for instance, propositional attitudes
sionality are sometimes, but should not be, confused with
(like Lola believes that , John desires that ), as well as
intention, intentional, and intentionality. Arguably, there
many other constructions (involving, for example, seeks,
is a relation between a speakers intentions and the meanings of
admires, avoids, resembles, necessary, possibly,
the words he or she uses; however, the connection should not
must, may, obviously, and because), have been held to
be drawn via terminological confusion. intentionality, the
be nonextensional.
distinctive property of thoughts and other mental phenomena,
Nonextensional contexts are often called intensional. But
is yet something else entirely.
sometimes, more carefully, intensional is reserved for those
contexts wherein substitution of a cointensional expression
Why Accept a Distinction Between Intension and
always preserves the intension of the larger expression. Contexts
Extension?
wherein cointensional substitution may fail are then called non-
Knowing what sentence (1) means is enough to be able to know
intensional or hyperintensional. Lola believes that is one
that it is true.
context thought by some to be non-intensional in this sense. For
(1) Mark Twain is Mark Twain. example, according to some, eye doctor and ophthalmolo-
gist have the same intension, but Lola believes that Eve is an
However, (2) difers.
eye doctor and Lola believes that Eve is an ophthalmologist
(2) Mark Twain is Samuel Clemens. have diferent intensions: If Lola doesnt know the word ophthal-
mologist, the former may be true and the latter false.
Although (2) is true, merely knowing what (2) means is not
enough to be able to know that it is true. So (1) and (2) difer
Semantics Without Intensions
in meaning. But (2) is just like (1) except for an occurrence of
W. V. O. Quine argued that intensions and other elements of tra-
Samuel Clemens instead of Mark Twain. So, Mark Twain and
ditional theories of meaning have no place in a scientiic descrip-
Samuel Clemens difer in meaning since the meanings of (1) and
tion of the world. He claimed that intensions are on a par with
(2) are determined only by the meanings of their parts and the
the Homeric gods: Intensions play no useful explanatory role in
way they are put together. (See compositionality.)
a scientiic description of the world. According to one of his most
Similarly,
inluential arguments, there is no noncircular way to make sense
(3) Lola believes that Mark Twain is Mark Twain. of traditional notions like meaning, synonymy, analyticity,
396
Intension and Extension
and the like. Indeed, he argued, there is no distinction between Montague proposed complex intensions, mapping not possible
sentences true simply in virtue of meaning analytic truths and worlds but indices a combination of a possible world with
other true sentences synthetic truths (Quine 1953, 1960). persons, places, times, and so on. David Kaplan (1989) instead
Greatly inluenced by Quine, Donald Davidson (1967) pro- divided intensions into two pieces: character (the linguistic
posed a semantic theory for natural language with no place for meaning of an expression type) and content (the meaning of an
intensions. In a Davidsonian theory, each linguistic expression occurrence of an expression). Roughly, character plus context
is paired with an extension (other terms like referent or semantic determines content, and content plus context yields extension.
value are often used instead of extension), and rules of composi- Kaplans arguments about indexicals and demonstratives, as
tion state how the extension of a larger expression is determined well as Saul Kripkes and Hilary Putnams on proper names and
by the extensions of its parts. For example, in one version of the natural kind terms, have led many to accept that these terms
theory, the extension of John is John, the extension of runs are devices of direct reference; in efect, occurrences of these
is the set of running things, and the extension of John runs is a terms have extensions but no intensions. Others have responded
truth value (see truth conditional semantics). by adopting the framework of two-dimensional semantics,
Challenging for this approach are the problems that drive where expressions are assigned two diferent intensions (Garcia-
some to posit intensions in the irst place. If Mark Twain and Carpintero and Maci 2006).
Samuel Clemens have the same extension, then how to explain While semantic theories inspired by Carnap and Montague
the diferences between (1) and (2) or between (3) and (4)? But do deal in intensions, they have much in common with the no-
if Mark Twain and Samuel Clemens have diferent extensions, intension theories discussed earlier. Not only are both sorts of
then how to explain why Mark Twain is Samuel Clemens is theories compositional and referential, but both sorts of theories
true? (For possible answers see, for example, Larson and Segal can also arguably fail in various ways to capture intuitive notions
1995.) of meaning. For example, if the intension of a sentence is a func-
tion from possible worlds to truth values, then if two sentences
Semantics with Intensions have the same truth value at every possible world, then they have
More commonly, intensions are taken seriously. In Meaning and the same intension. hat means that all true mathematical sen-
Necessity, Rudolf Carnap ([1947] 1956) presented his method tences have the same intension, for presumably a mathematical
of intension and extension as an improvement over Gottlob truth is true at every possible world. But 2 < 3 and 2 + 2 = 4 do
Freges way of distinguishing sense and reference. In Carnaps not have the same meaning.
system, each meaningful expression is assigned both an exten- Within current frameworks, lively discussion about these and
sion and intension. For example, the extension of human is the other questions continues. (For a useful introduction, see von
class of human beings, the intension of human is the property of Fintel and Heim 2007.) More radically, others propose to rework
being human, the extension of Walter Scott is Walter Scott, the the foundations of semantics to produce more ine-grained
intension of Walter Scott is an individual concept of Walter Scott, intensions (e.g., Fox and Lappin 2005). And then there is the
the extension of a sentence is its truth value, and the intension extreme view of J. J. Katz (1990): We should give up the claim that
of a sentence is the proposition it expresses. Notably, unlike intension determines extension and adopt an internalist notion
Freges senses, Carnaps intensions do not vary with linguistic of intension better suited to the traditional duty of explaining
context, thereby, according to Carnap, avoiding a serious objec- analyticity, synonymy, meaningfulness, and so forth.
tion both to Freges approach and the related proposal by the A variety of views about the distinction between intension and
inluential logician Alonzo Church (1951). extension are alive. Widely, but not universally, it is thought that
In applying a precise distinction between intension and exten- both intensions and extensions are needed in semantics. Debate
sion to natural language, Richard Montague (1974; see montague continues about what intensions and extensions are, about what
grammar) was particularly inluential. Inspired by Carnap and the relation between intension and extension is, and about the
Church, Montague took an intension to be a mathematical func- sorts of linguistic expressions that have them.
tion. his permitted a graceful way for meanings to combine, as
Patrick Hawley
the application of function to argument. In one version of this
sort of theory (possible world semantics), the intension of
a sentence is a function from possible worlds to truth values, and
the extension of that sentence relative to a possible world is the Carnap, Rudolf. [1947] 1956. Meaning and Necessity: A Study in Semantics
result of applying its intension function to that possible world. For and Modal Logic. Chicago: University of Chicago Press.
example, the intension of the sentence Hong Kong is in China Chierchia, Gennaro, and Sally McConnell-Ginet. 2000. Meaning and
might be a function yielding the value true for all possible worlds Grammar. 2d ed. Cambridge, MA: MIT Press.
Church, Alonzo. 1951. A formulation of the logic of sense and deno-
where Hong Kong is in China and false otherwise. Variations and
tation. In Structure, Method and Meaning: Essays in Honor of H. M.
improvements to Montagues general approach have dominated
Shefer, ed. P. Henle, H. Kallen, and S. Langer, 324. New York: Liberal
the ield of formal semantics (Gamut 1991; Heim and Kratzer
Arts Press.
1998; Chierchia and McConnell-Ginet 2000). Davidson, Donald. 1967. Truth and meaning. Synthese 17: 30423.
Reprinted in Inquiries into Truth and Interpretation (Oxford: Oxford
Complications and Problems University Press, 1984).
Complications ensue when accounting for context-sensi- Fox, Chris, and Shalom Lappin. 2005. Foundations of Intensional
tive expressions like the indexicals I or here or now. Semantics. Oxford: Blackwell.
397
Intentionality
Frege, Gottlob. [1892] 1997. On sense and reference. In he Frege categories. On the one hand, we have reference, denotation,
Reader, ed. M. Beaney, 15171. Oxford: Blackwell. and extension; on the other hand, we have content, meaning,
Gamut, L. T. F. 1991. Logic, Language and Meaning. Chicago: University sense, connotation, and intension. he relationship between
of Chicago Press.
these categories of terms is best illustrated via the intentionality
Garcia-Carpintero, Manuel, and Josep Maci, eds. 2006. Two-Dimensional
of linguistic expressions.
Semantics. Oxford: Oxford University Press.
Just as names are about, in the relevant sense, the objects for
Heim, Irene, and Angelica Kratzer. 1998. Semantics in Generative
Grammar. Oxford: Blackwell. which they stand, so, one might think, predicates are about
Kaplan, David. 1989. Demonstratives. In hemes from Kaplan, ed. J. the things of which they are true. Green is about the green
Almog, J. Perry, and H. Wettstein, 481563. New York: Oxford University things, happy about the happy things, and so on. he things
Press. that words are about, in this sense, are their references (deno-
Katz, J. J. 1990. he Metaphysics of Meaning. Cambridge, MA: MIT Press. tations, extensions). But, plausibly, a theory of reference for a
Kripke, Saul. Naming and Necessity. 1980. Cambridge: Harvard University language would not be a full account of the content (meaning,
Press. sense) of expressions of the language. To adapt an example from
Larson, Rich, and Gabriel Segal. 1995. Knowledge of Meaning. Cambridge, W. V. O. Quine (1953), the sentences Dolly is a renate and
MA: MIT Press.
Dolly is a cordate may be alike with respect to the reference
Mill, John Stuart. [1843] 1872. A System of Logic. 8th ed.
of the expressions that compose them (because the set of cor-
London: Longmans.
dates is identical to the set of renates) even though, intuitively,
Montague, Richard. 1974. Formal Philosophy: Selected Papers of Richard
Montague, ed. Richmond homason. New Haven, CT: Yale University the two sentences say diferent things about Dolly. So it seems
Press. that two expressions can have the same reference while difer-
Porphyry. 2003. Porphyrys Introduction. Trans. and commentary by ing in content. But many have thought that, as Gottlob Frege
Jonathan Barnes. Oxford: Oxford University Press. (1892) suggested, the converse does not hold: two expressions
Putnam, Hilary. 1975. he meaning of meaning. In Philosophical cant have the same content without also having the same refer-
Papers. Vol. 2. Mind, Language and Reality. Cambridge: Cambridge ence. Intuitively, two sentences cant say the same thing about
University Press. the world or express the same thought without being about the
Quine, W. V. 1953. Two dogmas of empiricism. Philosophical same things. his combination of views that the content of an
Review 60: 2043. Reprinted in From a Logical Point of View, 2d ed.
expression is standardly something over and above its reference,
(Cambridge: Harvard University Press, 1961).
and that the content of an expression determines its reference
. 1960. Word and Object. Cambridge, MA: MIT Press.
is very widely accepted. (hough not universally; it is rejected by
von Fintel, Kai, and Irene Heim. 2007. Intensional Semantics. Manuscript,
Massachusetts Institute of Technology. defenders of a Chomskyan internalist view of meaning who take
meanings to be internal to the language-processing systems of
language users [Pietroski 2003] and by skeptics about content
INTENTIONALITY [Quine 1953, 1960; Kripke 1982].)
hese views about the relationship between content and ref-
Aboutness erence structure much contemporary work on intentionality, for
he closest thing to a synonym for intentionality is aboutness;
if content determines reference, it is natural to think that content
something exhibits intentionality if and only if it is about some-
explains reference: Intentional phenomena come to be about
thing. he relevant sense of about is best elucidated by exam-
things by virtue of their possessing a content. his way of think-
ple: he sentence Saul Kripke is a philosopher is about Saul
ing about intentionality has several virtues. One is that it seems to
Kripke; my belief that the weather in South Bend is dreary is about
ofer an explanation of the aforementioned example of Zeus; if
the city of South Bend, Indiana; the black lines and curving blue
aboutness is typically explained by possession of a content, then
stripe on the map in my hand are about the streets of South Bend
perhaps the sense in which Zeus aims to be about something is
and the St. Joseph River; the position of the needle on the gas
that it, like expressions that are genuinely about something, has
gauge in my car is about the amount of gasoline in its tank. While it
a content. Its just that in the case of Zeus, this content fails to
is diicult to ind an uncontroversial and illuminating paraphrase
determine a reference.
of the relevant sense of about, its hard to deny that there is some
Virtually nothing more can be said about content, reference,
reasonably clear sense of aboutness common to these examples.
and the relationship between the two without entering into mat-
his characterization of intentionality as aboutness is only
ters about which there is not even rough agreement. heorists
true to a irst approximation because something can exhibit
difer about what sorts of things contents are, about whether
intentionality without being about anything if it purports to be
there are any expressions for which content and reference
about something. Zeus is not about, does not represent, any-
coincide, and about whether there are any kinds of expressions
thing; this name, unlike Saul Kripke, does not have a worldly
that cannot possess a content without possessing a reference.
correlate. Nonetheless, Zeus counts as an example of intention-
Canonical works on these topics include Frege (1892), Russell
ality by virtue of the fact that it (in a diicult to explain sense)
(1905), Frege (1918), Carnap (1947), Kripke (1972), and Kaplan
aims to be about something, even if it does not succeed.
(1989).
Intentionality, Content, and Reference Intentionality, Intensionality, and Intentions

Glossing over a wealth of distinctions, we see that the vocabulary It is worth mentioning at this point two persistent, though purely
used in discussions of intentionality is divisible into two broad terminological, sources of confusion about intentionality: the
398
Intentionality
distinctions between intentionality and intensionality, on the presented, in judgement something is airmed or denied, in love
one hand, and intentionality and intentions, on the other. loved, in hate hated, in desire desired, and so on.
Intensionality is a property of sentence contexts. Given any his is characteristic exclusively of mental phenomena. No
context in a sentence, we can then ask: Can we, by replacing physical phenomenon exhibits anything like it. We can, there-
one expression or phrase in that context with another that has fore, deine mental phenomena by saying that they are those
the same reference, change the truth value of the sentences as a phenomena which contain an object intentionally within them-
whole? If so, then the context is said to be intensional. selves. (Brentano [1874] 1997, II.i.5)
So far, the connection between intentionality and intensional-
We can think of Brentanos thesis as having two components:
ity may seem to be merely orthographic. But it has been claimed
that the latter is a criterion for the former: that descriptions of Intentionality is necessary for mentality; all mental states
intentional phenomena will always include an intensional con- exhibit intentionality.
text (Chisholm 1957). For descriptions of propositional atti- Intentionality is suicient for mentality; everything that
tudes like beliefs, this seems plausible. For example, exhibits intentionality is a mental state.
John believes that the worlds most famous sheep is famous. he claim of necessity is uncontroversial when we are thinking of
propositional attitudes like believing, supposing, and judging. It
may be true while
is more controversial but still plausible when we think of percep-
John believes that Dolly the sheep is famous. tual states; the sense in which my visual experience is currently
of or about a computer screen is recognizably the same as the
is false, even if the worlds most famous sheep and Dolly the
sense in which a name is a name of its bearer.
sheep have the same reference. But the criterion seems to fare
Bodily sensations like itches and pains, however, may seem
less well in other cases. For example,
to be counterexamples to Brentanos claim that intentionality is
he thick blue line on my map of South Bend represents the necessary for mentality. My sensation of throbbing pain is clearly
St. Joseph River. a mental state but can it be said to represent, or be about, any-
thing at all? Many have thought not and have seen the attempt
appears to ascribe the right sort of aboutness to qualify as a sen-
to ind intentionality in sensations as an ad hoc attempt to ind
tence about intentionality, but the sentence does not contain
something common to mental phenomena (Rorty 1979). But
any intensional contexts. And many sentences that do contain
this negative verdict can be challenged, and it has been in recent
intensional contexts dont seem to be descriptions of intentional
philosophy of mind. For one thing, pains are felt as located, and
phenomena. For example,
given this, it is not implausible to think of them as about the part
Mammals have a greater chance of heart failure than latworms of the body where they are felt to be (Tye 1995; Byrne 2001).
because they are cordates. On the face of it, the other half of Brentanos thesis that inten-
tionality deines the mental seems to be less well-of. How can
and
one claim that intentionality is suicient for mentality when
Mammals have a greater chance of heart failure than latworms things that are clearly not mental states like words, parts of
because they are renates. maps, and gas gauges exhibit intentionality?
difer in truth value, even though cordates and renates have the
same reference. Original and Derived Intentionality
A second potential source of confusion is the similarity of he best answer to this question invokes a distinction between
intention, and intentionality. Intention, like belief and original and derived intentionality. We began by noting the
desire, is the name of a type of mental state. Like beliefs and diversity of things that exhibit intentionality: mental states, lin-
desires, intentions exhibit intentionality, but they are no more guistic expressions, maps, gas gauges. But it is plausible to think
essential to intentionality than other mental states. that at least some of these intentional phenomena acquire this
status via a relation to some other more fundamental intentional
phenomenon. If this is correct, we can recast the second half of
Intentionality and Mentality
Brentanos thesis as the claim that only mental phenomena have
hough intentionality is derived from intentio, a technical term
original intentionality: intentionality not explicable in terms of
that had wide use in medieval philosophy, and intentio is itself
other intentional phenomena.
a translation of technical terms from premedieval Arabic phi-
his sort of defense of Brentano carries with it a commit-
losophy, modern usage of the term is usually traced to Franz
ment to the research program of explaining the intentionality
Brentanos 1874 Psychology from an Empirical Standpoint.
of language, maps, and gas gauges in terms of the intentionality
Brentano is standardly taken to have made two basic claims
of the mental. his research program has considerable promise
about intentionality, the irst of which is that intentionality is
and has received sophisticated development over the last few
internally related to mentality:
decades, with most of the attention focused on explanations of
Every mental phenomenon is characterized by what we might linguistic meaning in terms of mental content.
call direction toward an object. Every mental phenomenon One well-developed attempt to provide such an explana-
includes something as an object within itself, although they tion begins with the thought that linguistic expressions mean
do not all do so in the same way. In presentation something is what they do because of what speakers intend to convey by
399
Intentionality
using them (Grice 1957, 1969; Schifer 1972, 1982). On this he internal states in question will presumably be complex
view, what a speaker means by uttering an expression on an physical states of subjects. Given this, we can ask: Is the content
occasion (speaker-meaning) is a function of the beliefs that of such states derived from the contents of its parts so that, in
that speaker intends to bring about in his or her audience via the case of a state that has the content that grass is green, the
their recognition of that communicative intention, and, fur- state would have one part representing grass, and another rep-
ther, what an expression means in a community is a function resenting the color green or are the fundamental content-con-
of what speakers mean or would mean by using the expression ferring properties a matter of the propositional attitude state as
on various occasions. By this two-part reduction (of expres- whole? To take the former option is to endorse the language of
sion-meaning to speaker-meaning, and speaker-meaning to thought hypothesis (Fodor 1975; Rey 1995) and to take the latter
communicative intentions), the intentionality of language is is to reject it (Stalnaker 1990; Blackburn 1984).
explained in terms of the intentionality of intentions. Critics Whether or not the language of thought hypothesis is true,
of this approach have focused on its inability to explain uses the principal challenge in constructing a theory of content is to
of language in thought and apparently normal examples of specify the properties that confer contents on those representa-
communication in which speakers lack the requisite commu- tions. Here, the proliferation of theories is such that it is hardly
nicative intentions (Chomsky 1975; Schifer 1987). But despite possible to do better than the following list of candidate comple-
the problems faced by speciic versions of this reductive pro- tions of an internal representation x has the content p if and
gram, there is widespread agreement that there is some way of only if:
explaining the intentionality of language via the intentional-
x is actually caused by ps being the case / ps being the case
ity of the mental states of language users if not their inten-
would, under epistemically ideal conditions, cause that inter-
tions, then perhaps their beliefs (Lewis 1975). (Opposed views
nal state (Stalnaker 1984) / x covaries with ps being the case
of the source of the intentionality of language are defended in
during the learning period when the state is acquiring a
Laurence 1996 and Brandom 1994.)
content (Dretske 1981).
The Reduction of Original Intentionality It is the biological function of x to be present when p is the
Supposing that there is a genuine distinction between original case (Millikan 1989).
and derived intentionality, there is a further question about x has nomological connections of speciied kinds with prop-
whether original intentionality can itself be explained. he sec- erty p (Fodor 1990).
ond thesis about intentionality often associated with Brentano here is an isomorphism between the system comprised of x
is that it cant be: Original intentionality is not only deinitive of and the rest of the agents internal representations and a sys-
mentality but also inexplicable in nonintentional terms. By contem containing p which maps x onto p (Cummins 1989).
trast, the view dominant in recent years may be summed up as A (speciied) theory maps xs functional role its causal con-
follows: nections to perceptual input, behavioral output, and other
internal representations onto p (Block 1986; Harman [1988]
I suppose that sooner or later the physicists will complete the 1999).
catalogue theyve been compiling of the ultimate and irreduc-
ible properties of things. When they do, the likes of spin, charm, he discussion so far leaves open an important metaquestion
and charge will perhaps appear on their list. But aboutness surely about intentionality: Supposing that there is no reduction of
wont; intentionality simply doesnt go that deep. If aboutness original intentionality to nonintentional facts, what attitude
is real, it must really be something else. (Fodor 1987, 97) should we take toward the claims about the intentionality of
mental states to which we unhesitatingly subscribe in daily life?
In part because most recent theorists have adopted the view, Some who have rejected such analyses have put alleged inten-
sketched here, that original intentionality is found at the level of tional facts into the same category as alleged facts about phlo-
thought, these theorists have approached the task of explaining giston, witches, and other posits of false theories (Quine 1960;
original intentionality by constructing theories of mental con- Churchland 1981); others have taken the failure of reductions of
tent. he standard method of theory construction takes as given original intentionality to show that intentionality is an unana-
the following broad thesis: Being in a certain mental state is a lyzable feature of the world, and no less real for that (Chisholm
matter of being in an internal state that has properties that make 1957; Searle 1983).
it a mental state of the relevant type with the relevant content.
his view is sometimes called the representational theory of the Jef Speaks
mind though this label is sometimes used for the conjunction
of the present view with the language of thought hypothe- WORKS CITED AND SUGGESTIONS FOR FURTHER READING
sis (about which more later) and other times is called function-
Blackburn, S. 1984. Spreading the Word. Oxford: Clarendon Press.
alism though this label is sometimes used for the conjunction
Block, N. 1986. Advertisement for a semantics for psychology. In
of the present view with the thesis that the content-determining Mental Representation: A Reader, ed. S. Stich and T. Warield, 81141.
properties of internal states are their functional roles. Cambridge, MA: Basil Blackwell.
he natural next questions are, therefore: What properties of Brandom, R. 1994. Making It Explicit. Cambridge: Harvard University
internal states make them mental states of a certain type with Press.
a certain content? And which internal states are the bearers of Brentano, F. [1874] 1997. Psychology from an Empirical Standpoint.
content? London: Routledge.
400
Intentionality Internal Reconstruction
Byrne, A. 2001. Intentionalism defended. Philosophical Review . 1982. Intention-based semantics. Notre Dame Journal of Formal
110: 199240. Logic 23: 11956.
Carnap, R. 1947. Meaning and Necessity: A Study in Semantics and Modal . 1987. Remnants of Meaning. Cambridge, MA: MIT Press.
Logic. Chicago: University of Chicago Press. Searle, J. 1983. Intentionality. New York: Cambridge University Press.
Chisholm, R. 1957. Perceiving: A Philosophical Study. Ithaca, NY: Cornell Stalnaker, R. 1984. Inquiry. Cambridge, MA: MIT Press.
University Press. 1990. Mental content and linguistic form. In Context and Content,
Chomsky, N. 1975. Relections on Language. London: Temple-Smith. 22540. New York: Oxford University Press.
Churchland, P. 1981. Eliminative materialism and the propositional Tye, M. 1995. Ten Problems of Consciousness: A Representational heory
attitudes. Journal of Philosophy 78.2: 6790. of the Phenomenal Mind. Cambridge, MA: MIT Press.
Crane, T. 1998. Intentionality as the mark of the mental. In Current Issues
in Philosophy of Mind, ed. A. OHear, 22951. Cambridge: Cambridge
INTERNAL RECONSTRUCTION
University Press.
Cummins, R. 1989. Meaning and Mental Representation. Cambridge, Internal reconstruction (IR) is a method, or group of methods,
MA: MIT Press. used to establish unattested earlier forms of languages. It difers
Dretske, F. 1981. Knowledge and the Flow of Information. Cambridge, from the comparative method, which has similar aims, in
MA: MIT Press.
being based on the features of one language, without external
Fodor, J. 1975. he Language of hought. Hassocks, UK: Harvester.
reference, and it is therefore appropriately applied to languages
. 1987. Psychosemantics: he Problem of Meaning in the Philosophy
without relatives, such as language isolates or reconstructed
of Mind. Cambridge, MA: MIT Press.
. 1990. A theory of content, II: he theory. In A heory of Content protolanguages. Languages reconstructed by these means are
and Other Essays, 89136. Cambridge, MA: MIT Press. referred to as prelanguages, as opposed to the protolanguages
Frege, G. 1892. On sense and reference. In Translations from the that result from the application of the comparative method.
Philosophical Writings of Gottlob Frege, ed. P. Geach and M. Black, IR arose in the later nineteenth century as an extension of
5678. Oxford: Basil Blackwell. the comparative method. For example, Ferdinand de Saussures
. 1918. hought. In he Frege Reader, ed. M. Beaney, 32545. reconstruction of sonant coeicients later termed laryngeals
Oxford: Basil Blackwell. for Indo-European was based largely on this method, though it
Grice, P. 1957. Meaning. Philosophical Review 66.3: 17788. was not recognized as legitimate at the time, and his reconstruc-
. 1969. Utterers meaning and intentions. In Studies in the Way of
tions were largely ignored. he method was identiied in the early
Words, 86116. Cambridge: Harvard University Press.
twentieth century but was considered too speculative, method-
Harman, G. [1988] 1999. Wide functionalism. In Reasoning, Meaning,
and Mind, 23543. Oxford: Clarendon.
ologically unsound, and lacking controls on its results. It was not
Husserl, E. [1901] 2002. Logical Investigations. 2 vols. until documentary evidence for laryngeals was found in Hittite
London: Routledge. that its value was recognized, and then not universally.
Kaplan, D. 1989. Demonstratives. In hemes from Kaplan, ed. IR is not a single method but a group of approaches having
J. Almog, J. Perry, and H. Wettstein, 481563. Oxford: Oxford in common reliance on internal features of the language. It is
University Press. possible to identify three such approaches, which are not nec-
Kripke, S. 1972. Naming and Necessity. Cambridge: Harvard University essarily completely distinct nor mutually exclusive: historical
Press. morphophonemics, regularization of systems, and universal and
. 1982. Wittgenstein on Rules and Private Language: An Elementary typological reconstruction.
Exposition. Cambridge: Harvard University Press.
Laurence, S. 1996. A Chomskian alternative to convention-based seman-
Historical Morphophonemics
tics. Mind 105: 269301.
his approach, applicable only to phonology, is fairly well
Lewis, D. 1975. Languages and language. In Language, Mind, and
Knowledge, ed. K. Gunderson, 335. Minneapolis: University of
deined methodologically. It is based on the structuralist
Minnesota Press. Reprinted in Lewis 1983, 16388. principle of the morphophoneme, which also corresponds to
. 1983. Philosophical Papers. Vol. 1. Oxford: Oxford University the underlying phoneme of classical generative phonology.
Press. Morphophonemes comprise a set of morphologically related
Millikan, R. 1989. Biosemantics. Journal of Philosophy 86.6: 28197. phonemes, that is, occurring in diferent forms of the same
Moran, D. 1996. Brentanos thesis. Proceedings of the Aristotelian morpheme. he principle is that the diferent phonemes result
Society Supplementary Volume 70: 127. from the application of diferent phonological changes in difer-
Perler, D., ed. 2001. Ancient and Medieval heories of Intentionality. ent contexts; by reducing these alternating phonemes to a single
Boston: Brill.
form, we efectively establish the original phoneme from which
Pietroski, P. 2003. he character of natural language semantics. In
they are derived. For example, in Latin /s/ became /r/ between
Epistemology of Language, 21756. Oxford: Oxford University Press.
vowels, giving alternations such as los/lor-. Synchronically, this
Quine, W. V. 1953. Two dogmas of empiricism. In From a Logical Point
of View, 2046. Cambridge: Harvard University Press. provides us with a morphophoneme {S}; historically, we assume
. 1960. Word and Object. Cambridge, MA: MIT Press. that this corresponds to an original phoneme /s/. A weakness of
Rey, G. 1995. A not merely empirical argument for a language of this method is that it assumes that there was no morphophone-
thought. Philosophical Perspectives 9: 20122. mic alternation in earlier stages of languages.
Rorty, R. 1979. Philosophy and the Mirror of Nature. Princeton,
NJ: Princeton University Press. Regularization of Systems
Russell, B. 1905. On denoting. Mind 14: 47993. While historical morphophonemics works by eliminating
Schifer, S. 1972. Meaning. Oxford: Oxford University Press. alternations and apparent irregularities, the regularization of
401
Internal Reconstruction Interpretation and Explanation
systems method involves, in efect, the generalization of this where the comparative method cannot be applied. he meth-
process to whole systems. hus, where there are inconsistencies ods and especially the approach that relies on universals and
in paradigms for example, with diferent classes of nouns or typology are extremely powerful, and for that very reason they
verbs the method seeks to eliminate these diferences in order have often been regarded with some suspicion and must be used
to produce a single paradigm, on the assumption that inconsis- with caution.
tencies arise through change. For example, the diferent declen-
Anthony Fox
sions or conjugations of Latin and Greek or the forms of Germanic
strong verbs can be reduced to single patterns. Again, a weakness
of this approach is that it assumes complete consistency in the WORKS CITED AND SUGGESTIONS FOR FURTHER READING
prelanguage. Campbell, Lyle. 1998. Historical Linguistics: An Introduction.
Edinburgh: Edinburgh University Press. See Chapters 7 and 8.
Universal and Typological Reconstruction Fox, Anthony. 1995. Linguistic Reconstruction: An Introduction to heory
his is the most speculative and perhaps methodologically the and Method. Oxford: Oxford University Press. See Chapters 7 and 8.
least controllable approach, which to some extent goes beyond Lehmann, Winfred. 1992. Historical Linguistics. London: Routledge. See
purely internal reconstruction inasmuch as it invokes general Chapter 8.
properties of languages. hese properties may be either universal
or typological. In the former case the method relies on language
INTERPRETATION AND EXPLANATION
universals, which are held to determine either the synchronic
structures of languages or the diachronic processes of change An interpretation is an account of what is said, done, and thought
(see synchrony and diachrony). hese have the efect of by some person or people an account at the level of content. An
constraining possible reconstructions since we may not postu- interpretation may take as its subject some particular episode
late earlier states nor processes of change that do not conform to involving some agents action. Consider a bosss remarks to a
the universals. new employee about her skirt. (Is the boss generally a fashion
hus, in phonology we may assume universal phonetic maven? Does the boss wear skirts or chase skirts?) What is the
processes (for example, the palatalization of velar consonants agent saying, doing, and thinking? An interpretation may take as
in the neighbourhood of front vowels); a reconstruction that its subject some kind of utterance or activity in a social group.
assumes the reverse (palatalization with back vowels) would be Consider some students at a game chanting Were gonna beat
ruled out. Similarly, the universal process described by the term the hell out of you. What were they really saying, doing, and
grammaticalization the development of lexical items thinking? An understanding of such matters an interpretation
into grammatical items precludes reconstructing the reverse. depends epistemically on an associated explanatory understand-
here may also be constraints on structures and systems; if all ing of why such things are said and done.
languages must have, say, vowels, then we cannot reconstruct An interpreted episode or a phenomenon typically involves
an earlier stage without them. In all such cases, our reconstruc- some linguistic activity. Yet an interpretation is more than a
tions will seek to establish earlier forms in compliance with straightforward translation for those utterances. he chanting
these universal constraints. of the students can be homophonically translated. But is their
he typological approach (see typology) is similar, but it utterance primarily a part of a ritual of hope and group identi-
relects language types rather than universal properties. he ication or an expression of belief? Consider also one historical
principle here is that language typology involves not just iso- episode: George W. Bushs insistance that it is morally wrong
lated features but, rather, sets of harmonizing or co-occurring to destroy life in order to save life (quoted in Stolberg 21 May
features. According to one widely used typology, languages fall 2005). Again, for speakers of English, homophonic translation is
into two basic types, VO and OV, according to the order of verb uncontroversial. Yet just what was expressed? What would his
and object, but this ordering is also relected in the order of other principle allow, and what would it rule out? What is Bush saying
items, such as noun and adjective (VO languages generally also and doing here?
have NA, and OV languages have AN), and the occurrence of Interpretation goes beyond translation in part because inter-
prepositions or postpositions (VO languages favor the former, pretation comes to terms with pragmatic elements of an
OV languages the latter). Given such principles of harmony, a utterance what speech-acts are performed and how con-
language that has one such order, say, NA, will be expected to versational implicature may outrun what is explicit. An
have the other harmonic orders; if it does not, we can reconstruct acceptable treatment of these matters is bound up with rich
this harmony for an earlier period of the language. In the case understandings of agents situated projects their beliefs and
of English, for example, which has VO but AN, we can assume desires.
an earlier stage that had either NA or OV (the latter is assumed). An interpretation may constrain the translation of some sub-
Other typological parameters can be used in an analogous fash- jects language on which it depends. his is common in anthro-
ion. A weakness of this method, however, is that it assumes that pological studies of a peoples religion or magic as one
languages will originally have been typologically consistent, translation may strongly suggest an understanding that is at odds
though consistency is not necessarily an inherent attribute of with an anthropological interpretation, whereas an alternative
language. translation may support that interpretation. he classic debates
In spite of potential weaknesses, the methods of IR have between symbolist and intellectualist anthropologists over how
proved useful in establishing earlier stages of languages in cases to interpret various folk religions had implications for whether
402
Interpretation and Explanation
the associated linguistic constructions should be translated into more, bears on which of several possible interpretations is ulti-
terms that suggested theories about causes (as intellectu- mately most satisfactory. Bush might be understood as having
alists such as Horton 1970, 1982 urge), or into more guarded advanced a moral principle that is implicitly qualiied so that
terms that suggested symbolic expression (as symbolists such as there is no inconsistency with his military policies. Alternatively,
Leach 1954 and Beattie 1970 urge) perhaps understandings there might have been unnoticed inconsistency as all humans
and inluences. Here, translation (treating the literal said) and exhibit some inconsistency. Finally, one might consider whether
interpretation (of what is ultimately said, done, and thought) are Bush might have noticed an inconsistency that he conveniently
interdependent. (Turner 1980; Henderson 1993; Risjord 2000; omits to mention.
Stueber 2006). As Donald Davidson (1984) argued, in settling he central point concerns how diverse information allows
upon an interpretation and translation, several matters must one to decide among such alternative interpretations: It bears
be sorted out together interrelated matters having to do with on the explanation of what is said, done, and thought in the epi-
belief, desire, and meaning (see meaning and belief, radi- sode interpreted. he political concerns of Bush and his advis-
cal interpretation, agreement maximization, and ers might help to explain the assertion as political boilerplate.
charity, principle of). Certainly the staking out of political positions, the responsive-
What, then, is the mark of a good interpretation? Surprisingly, ness to constituencies, and the like can lead politicians to cast
many theorists, both historical and contemporary from various about for simple (simplistic) formulations of sweeping principles
traditions, can be seen to agree on an answer to this epistemo- in which to wrap the desired policy. Is this the explanation, and
logical question: A successful or adequate interpretation afords interpretation, of Bushs assertion? Understood and explained
an explanatory understanding of what is said, done, and thought. as boilerplate, these questions about the exact content of the
An interpretation of someone or some folk as doing or saying asserted principle (whether or how it is implicitly qualiied), the
some sort of thing is of a piece with an explanatory understand- way it then squares with various policies, and the obviousness of
ing. Epistemologically, the two stand or fall together. he agree- inconsistency (if any) may become less pressing as they matter
ment here is pervasive but does not run deep. Writers quickly less to the explanation of the episode. What is then central is the
come to difer over what makes for a successful explanation of sense of certain politicians for how taking the moral high road
thought and deed and, thus, what marks good interpretation. will play in the relevant constituencies. But perhaps one senses
(For example, those urging a strong principle of charity such that Bush is a politician whose public face is here tied to his
as Davidson typically think of explanation here as a matter of own moral view of the world. hen, the degree of inconsistency
exhibiting rationality, while others may see more place for expla- and how he could be insensitive to it become more central as a
nations that do not rationalize. Various conceptions of what matter of explanatory concern. In either case, one is informed
make for explanatory understandings are discussed later.) both by a general sense for what makes humans tick (cogni-
Focus for a time on the agreement. Consider again Bushs tively and otherwise) and for an antecedently formed impres-
assertion that it is wrong to destroy life in order to save life. he sion of this person in particular by the sedimented results of
irst thing to notice is the wide range of information that we rec- past interpretations. his is to draw upon generic resources for
ognize as relevant to its interpretation. It was uttered in the con- explaining human action and thought and on resources more
text of legislation regarding research using embryonic stem cells. speciic to this individual. Again, each is the fruit of past prac-
In keeping with the venerable discussions of hermeneutics (see tice that is both interpretive and explanatory. One has a sense for
philology and hermeneutics), one should consider what how humans think and act and for variations. One has a sense
was also said in the wider context in which the assertion was for Bushs character as one variation, one that is the product
advanced. Its interpretation is bound up with the interpretation of his biography, and one that is evinced in past interpreted and
of the whole of Bushs remarks which, in turn, is dependent explained practice.
on successful treatment of the various component utterances. Just how one ends up interpreting the assertion in question
Information having to do with much beyond this set of remarks is and the associated political or moral act depends on the explana-
also relevant. For example, how did Bush and his political advis- tion for this episode that one judges to be most likely, given all that
ers understand their political situation? hat Bush was politically one knows about the agent or agents involved and about humans
beholden to right-wing conservative Christians even was one as agents. Interpretation, it seems, is a matter of abduction or
is relevant. hat they hold that human life begins at conception is inference to the best explanation of an ongoing, always revis-
relevant. So also is information regarding the wider political con- able, sort. (Readers should ind it easy to explore many further
text. Was the conservative Christian component of his political alternatives and wrinkles in understanding the Bush case and
base at that time disenchanted and thought to need iring up? to assure themselves that their plausibility devolves onto the
Sweeping moral statements are commonly best interpreted as question of greater or lesser explicability.)
containing an implicit ceteris paribus clause. One might wonder he interdependence of interpretation and explanation, their
if Bushs statement should be understood as likewise qualiied. being two faces of the same coin, is widely appreciated. What is
What is known about whether Bush would authorize the military contested is how to understand the explanation of thought and
to destroy lives to save lives? How does Bush understand col- deed. here are two broad schools of thought here, although rep-
lateral damage in warfare? Is the military context one in which resentatives of each have been diverse.
ceteris is not paribus? Is the principle unqualiied except that Some think of explanation as subsumption as a matter of
it expresses a prima facie duty that can be overridden by other deploying generalizations to show that the case or phenomenon
duties? But what other duties? All this information, and much in question was dependent on certain antecedents. To conceive
403
Interpretation and Explanation
of the explanation of thought and action along these lines is to To illustrate, economic theory might inform our explanation
generalize a common understanding of scientiic explanation. C. and interpretation of an individuals verbal exchange with his or
Hempel (1965) provides a particularly clear variant in his vener- her broker as the reception of and risky use of insider information
able hypothetical-deductive and statistical-probabilistic models to avert a loss, interpreting and explaining this as an expectable
of explanation. (However, there are reasons to think that the form of proit maximization in light of the agents understand-
approach to scientiic explanation is itself lawed; see, for exam- ing of situated risks of detection. (Perhaps the agent said, hat
ple, Salmon 1989.) A very diferent subsumptive model is pro- is unacceptable exposure to risk. he explanation envisioned
vided by James Woodwards (2000) discussions of explanation in supports interpreting this as an instruction to sell, rather than as
the special sciences. Woodward thinks that the generalizations a merely general point about levels of risk in certain portfolios.)
produced and deployed are themselves not exceptionless nomic Without drawing on theory, one might imaginatively put oneself
generalizations, but relatively invariant generalizations holding in the agents shoes (under tentative interpretation) and see that
within imperfectly speciied limits. one would do the same as the simulation theorist envisions. In
he second school of thought views explanation of thought both cases, one appreciates that with diferent information and
and action as revealing intelligibility explanation is understood antecedent beliefs and values, the agent would have done system-
as a matter of understanding what is said and done as having or atically otherwise. hus, generalizations about economic agents
expressing a signiicance or meaning so that the whole thereby and imaginative simulation might support a single understand-
becomes intelligible. Just what being intelligible comes to is ing and explanation of the agent answering the why-question by
itself understood variously. Certainly a kind and degree of inter- attention to the same dependencies (under interpretation).
nal contentful coherence plays a role as does the interpreters If explanation is understood as a matter of answers to how-
ability to then see how and why one would do as indicated on the and why-questions (and associated what-if-things-had-been-
basis of such reasons. hose who draw on the hermeneutic tradi- diferent questions), then what have been taken to be competing
tion are representatives. So also is R. G. Collingwood (1946) with accounts of explanation can be understood in a fashion that
his conception of explanation as reenactment. Something like renders them compatible, and afords us a multifaceted under-
Collingwoods approach has enjoyed a contemporary revival of standing of the explanatory practice associated with interpreta-
sorts within cognitive psychology where there has been much tion. he pivotal move is to abandon each tradition insofar as
work on explanation of thought and deed as a kind of simula- it attempts to delimit what counts as an explanation treating
tion in which ones own cognitive processes are taken of line the erotetic understanding as the more generic and acceptable
and put to work on imagined input that relects ones provisional account of explanation. hen, approaches such as Woodwards
interpretation of a subject. One imaginatively puts oneself in (appealing to subsumption under generalizations with signii-
the others shoes and deliberates or reasons; if ones simula- cant invariance) and the simulation theorists (appealing to the
tion then accords with observed actions or expressed beliefs, one imaginative use of our of-line cognitive capacity) can each be
has a prima facie successful explanation (see Stueber [2006] for a recognized as part of a full epistemic story. Subsumption and
recent overview and discussion). simulation each have their role in coming to appreciate why the
As noted, the representatives of each approach have been agent or agents acted thus and what they would have done if
diverse. Still, we can appreciate that while proponents of these things had been relevantly diferent. hus, my suggestion is that
approaches difer over what makes for explanation, they com- explanation is the mark of a good interpretation, that explana-
monly understand that interpretation epistemically depends tion typically itself supposes a tentative interpretation, that the
on explanation. (It is worth adding that most would also rec- generic understanding of explanation is that provided by erotetic
ognize that the information supposed in an explanation itself logic, and that it commonly is attained by some amalgamation of
depends on interpretations so that there is ultimately a holistic the epistemic resources that have been of concern in traditions
epistemic interdependence here.) A uniied understanding can that have wrongly claimed to provide the account of explana-
then be had by way of drawing on a very general understanding tion in the human sciences.
of explanation. When we do this, elements of the diverse inter-
David Henderson
pretive/explanatory approaches can be understood as comple-
mentary epistemologies of explanation, rather than as competing
accounts of what it is to explain thought and action. Let me WORKS CITED AND SUGGESTIONS FOR FURTHER READING
explain by drawing on what has come to be called the erotetic Beattie, J. H. M. 1970. On understanding ritual. In Rationality, ed. Bryan
account of explanation (see Salmon 1989; van Fraassen 1980). Wilson, 24068. Worchester, UK: Basil Blackwell.
his label refers to the logic of questions and answers erotetic Collingwood, R. G. 1946. he Idea of History. Oxford: Clarendon.
logic. On this approach, an explanation is an answer to a ques- Davidson, Donald. 1984. Belief and the basis of meaning. In Inquiries
tion typically a why-question or a how-question. In either case, into Truth and Interpretation, 14154. Oxford: Oxford University
the resources for answering the question allow one to under- Press.
Hempel, C. 1965. Aspects of Scientiic Explanation and Other Essays. New
stand a pattern of dependencies. he resources for answering a
York: he Free Press.
why-question (let us focus on these) should allow us to appreci-
Henderson, David. 1993. Interpretation and Explanation in the Human
ate that what is done or said was dependent on certain standing
Sciences. Albany: State University of New York Press.
states and events. If we can do this, we can answer a range of Horton, Robin. 1970. African traditional thought and Western sci-
associated what-if-things-had-been-diferent questions explor- ence. In Rationality, ed. Bryan Wilson, 13171. Worchester, UK: Basil
ing these dependencies (see possible worlds semantics). Blackwell.
404
Interpretive Community Intertextuality
. 1982. Tradition and modernity revisited. In Rationality and same person writes two diferent texts for himself when reading
Relativism, ed. Martin Hollis and Steven Lukes, 20160. Cambridge, from diferent interpretive communities, for he understands the
MA: MIT Press. Hebrew texts as prophesies of Jesus Christ only after his conver-
Leach, E. R. 1954. he Political Systems of Highland Burma. London: Bell. sion at Damascus.
Risjord, Mark. 2000. Woodcutters and Witchcraft. Albany: State University
Fishs theory has been criticized for making words have no
of New York Press.
meaning. He responds with just the opposite: Words always have
Salmon, Wesley. 1989. Four decades of scientiic explanation. In
Scientiic Explanation, Minnesota Studies in the Philosophy of Science.
meaning, in fact many meanings, all of which are constructed by
Vol. 13. Ed. P. Kitcher and W. Salmon, 3219. Minneapolis: University situated readers in various communities. Fish adds that his the-
of Minnesota Press. ory is sociological, not normative, that is, it describes only what
Stolberg, Sherly Gay. In Rare hreat, Bush Vows Veto of Stem Cell Bill. people say (or think) a text means; it does not prescribe how
New York Times, 21 May 2005. Stueber, Karsten. 2006. Rediscovering we ought to interpret texts. Finally, to the objection that some
Empathy. Cambridge, MA: MIT Press. authors use certain techniques to ensure that their texts convey
Turner, Stephen. 1980. Sociological Explanation as Translation. certain meanings, he responds that those meanings come to frui-
Cambridge: Cambridge University Press. tion only if the reader belongs to the same interpretive commu-
van Fraassen, Bas. 1980. he Scientiic Image. Oxford: Oxford University nity as that author.
Press.
Woodward, James. 2000. Explanations and invariance in the special Jefrey R. Wilson
sciences. British Journal for the Philosophy of Science 51: 197254.
WORK CITED
INTERPRETIVE COMMUNITY Fish, Stanley Eugene. 1980. Is here a Text in his Class? he Authority of
Interpretive Communities. Cambridge: Harvard University Press.
In 1976, literary critic Stanley Fish used this term to describe
the unspoken (often unknown) alliances among readers who
INTERTEXTUALITY
share similar strategies for determining what a text means. his
theory of pragmatics, he says, is the explanation for both the Building on Mikhail Bakhtins (1981) discussion of the dialogic
stability of interpretations among diferent readers (they belong nature of language, Julia Kristeva (1986) coined the term inter-
to the same community) and for the regularity with which a sin- textuality for the multiple ways in which texts refer to and draw
gle reader will employ diferent interpretive strategies and thus on other texts. his notion highlights the interconnectedness of
make diferent texts (he belongs to diferent communities) (Fish texts and challenges deep-rooted literary values, such as auton-
1980, 171). omy, uniqueness, and originality (Allen 2000, 56). An inter-
he notion of interpretive community insists upon the pri- textual perspective views text production as a social practice in
macy of situated readers, and it can be thought of as a theory which diferent texts, genres, and discourses are drawn upon and
of creative reading. Fish says that a set of general assumptions text consumption as a process in which readers may bring addi-
on how one ought to interpret a text precedes every act of inter- tional texts not only those that have shaped production into
pretation; thus, a reader always perceives a given text within an the interpretation process (Fairclough 1992, 845). he study
already in-place hermeneutical framework. One does not read of intertextuality does not focus solely (or even primarily) on
the words on a page and then decide what those words mean the speciic prior texts that are brought into play in a given text;
because no temporal separation exists between acts of percep- rather, it also examines the implicit texts underlying production
tion and interpretation. Instead, ones community conditions and interpretation (e.g., presuppositions, genre conventions)
how its members read those words in the irst place. As such, (Culler 1976, 1388). hus, a newspaper crime report has inter-
readers actually write a text for themselves as they read, for they textual links not only to eyewitnesses accounts and previous
have a tool kit of interpretive strategies always at work determin- reports on the same and/or similar events but also to newswrit-
ing what certain words will mean should they arise in a given ing conventions, propositions that the journalist takes as given,
context. Readers using the same tool kit belong to the same and even the journalists/readers understanding of crimes in
community. general.
One can see interpretive communities at work in Christian Reported speech, a prime example of intertextuality, has been
typology, a mode of biblical exegesis that aims to square Old extensively studied in sociolinguistics. Reporting speech is
Testament texts with the events recounted in the New Testament. always a reformulation of the original act. Even if prior speech is
For the typologist, the belief that Jesus was God combines with reported verbatim, the reporting speaker may use prosodic fea-
other assumptions in order to form the exegetes set of interpre- tures like stress and intonation to indicate his/her interpre-
tive strategies. Other readers who share these strategies make up tation of the utterance, or he/she may frame the reported speech
this exegetes community even if they do not know one another, in such a way as to manipulate the addressees perception of the
which explains how two Christians might independently inter- reported speaker. In some cases, material represented as reported
pret some events in the Old Testament as prophesies of Jesus speech is not spoken by anyone at all. hese observations have
Christ. Of course, a Jew, Gnostic, or pagan produces a much dif- led Deborah Tannen (1989) to conclude that reported speech is
ferent meaning of those same Hebrew texts because he or she primarily the creation of the reporting speaker and serves to cre-
works from a community that reads/writes those texts difer- ate a sense of interpersonal involvement between the reporting
ently. And inally, a look at Paul of Tarsus demonstrates how the speaker and the addressee in the reporting context.
405
Intertextuality Intonation
Reported speech is an example of what Norman Fairclough only because of the stereotype of Spanish speakers as lazy
(1992) calls manifest intertextuality. Manifest intertextuality and procrastinating.
refers to the way in which speciic other texts are overtly drawn Several issues continue to dominate sociolinguistic research
upon within a text. In addition to reported speech, it covers such on intertextuality. One, to which I have alluded, focuses on
phenomena as irony, negation, and presupposition. In the relation of intertextuality to power. Others are concerned
contrast, constitutive intertextuality also known as interdiscur- with the conditions that make decontextualization and recon-
sivity refers to the way in which texts draw on abstract sets of textualization possible, as well as the semantic and functional
conventions like genres and styles. In their research on inter- changes that texts undergo as a result of recontextualization.
discursivity, several critical discourse analysts have Intertextuality also raises interesting issues about authorship.
noticed a widespread appropriation of conversational styles in If all texts are created out of prior texts and conventions, what
public discourse. Focusing on a consumer feature about lipstick is an author, and who is responsible for what is said/written?
from a British teenage magazine, Mary Talbot (1995) examines hese issues are likely to be worked out diferently in diferent
how the text producer exploits features of conversational speech cultures.
(e.g., the pronoun you, expressive punctuation) to establish an
Andrew Wong
informal, friendly relationship with the reader. his practice,
however, is far from benign. Under the guise of ofering sis-
terly advice, the consumer feature serves as covert advertising
and trains teenage readers to become consumers of cosmetic Allen, Graham. 2000. Intertextuality. London: Routledge.
products. Bakhtin, Mikhail. 1981. he Dialogic Imagination. Ed. Michael Holquist,
Generic intertextuality, a notion developed by anthropologists trans. Caryl Emerson and Michael Holquist. Austin: University of Texas
Press.
Richard Bauman and Charles Briggs, can be viewed as a particu-
Bauman, Richard. 2004. A World of Others Words: Cross-Cultural
lar kind of interdiscursivity. hey deine generic intertextuality as
Perspectives on Intertextuality. Malden, MA: Blackwell.
the construction of the relationship between a text and a genre,
Briggs, Charles, and Richard Bauman. 1992. Genre, intertextuality, and
and they are interested in how and for what purposes this rela- social power. Journal of Linguistic Anthropology 2: 13172.
tionship is established in communicative practice (Briggs and Culler, Jonathan. 1976. Presupposition and intertextuality.
Bauman 1992). hey see genre as a speech style that serves as an MLN: Modern Language Notes 91: 138096.
orienting schema for the production and reception of discourse. Fairclough, Norman. 1992. Discourse and Social Change. Cambridge,
Genre interacts with such factors as the interactional context and UK: Polity.
the speakers/writers communicative goals in shaping a given Hill, Jane. 1993. Hasta la vista, baby: Anglo Spanish in the American
text. In turn, these factors may lead to the selective adoption of Southwest. Critique of Anthropology 13: 14576.
the constituent features of the generic framework and create an Kristeva, Julia. 1986. Word, dialogue, and the novel. In he Kristeva
Reader, ed. Toril Moi, 3461. New York: Columbia University Press.
intertextual gap between the text and its generic model. Briggs
Talbot, Mary. 1995. A synthetic sisterhood: False friends in a teenage
and Bauman argue that research on strategies for manipulating
magazine. In Gender Articulated, ed. Kira Hall and Mary Bucholtz,
intertextual gaps between texts and their generic schemas can
14365. Routledge: New York.
shed light on issues of power, ideology, and political economy. Tannen, Deborah. 1989. Talking Voices. New York: Oxford University
In cultures with traditional genres that are invested with great Press.
power, speakers/writers often minimize the distance between
their texts and these genres. his serves as a powerful strategy
INTONATION
for creating textual authority. At times, however, speakers/writ-
ers may maximize the intertextual gaps between texts and their this term refers to the fundamental frequency (or its percep-
generic models. hey may do so to resist the hegemony of estab- tual correlate, pitch) contour associated with phrases and other
lished genres or to claim authority in cases where creativity is large prosodic units. Language communities use intonation to
highly valued. serve a wide range of functions, both grammatical and discourse
Appropriation, a speciic case of intertextuality, refers to the based. For example, intonation is used to signal prosodic bound-
practice of adopting words, expressions, or ways of speaking aries: he ends of utterances are characteristically associated
that are generally thought to belong to someone else. Many with terminal pitch excursions, either a rise or a fall depending
white American teenagers use elements of African-American on semantic factors. Another function of intonation is to cue
vernacular English (AAVE) in their speech so as to align them- many types of semantic distinctions, such as the diference
selves with hip-hop culture and/or to project an urban youth between yes/no questions and neutral declarative statements
identity by exploiting certain connotations of AAVE (e.g., tough- in many languages. Intonation is also used to convey emotional
ness). Appropriation, however, may also serve disailiating or (see emotion and language) and expressive states, as well as
even denigrating purposes. In the United States, some monolin- pragmatic information.
gual Anglos use what anthropologist Jane Hill (1993) call Mock Intonation is a universal property in that speakers of all lan-
Spanish that is, a subregister of colloquial English made guages manipulate pitch to communicate linguistic and para-
up of (pseudo-)Spanish expressions (e.g., maana) to proj- linguistic functions (see paralanguage and phonology,
ect a congenial persona. Yet to make sense of Mock Spanish, universals of). Even languages that use pitch to diferentiate
one also requires access to certain racist beliefs about Spanish individual lexical items, for example, tone languages such as
speakers. Maana works as a humorous substitute for later Mandarin Chinese and pitch accent languages such as Swedish,
406
Intonation Irony
also have intonation systems that are evident when words are (Dutch, English, and German), tone languages (Cantonese and
grouped into larger prosodic constituents or uttered in isolation. Mandarin Chinese), pitch accent languages (Japanese, Swedish,
Intonation systems may vary substantially, however, from lan- and Serbo-Croatian), languages lacking word-level stress, lex-
guage to language and also potentially from speaker to speaker. ical tones, or pitch accents (Korean and French), and indige-
hus, while a yes/no question is associated with a terminal rise in nous languages of North America and Australia (Chickasaw and
pitch in many languages (e.g., German, Japanese, and Korean), Bininj Gun-wok, respectively).
there are other languages (e.g., Finnish and Chickasaw) that An issue common to the intonation systems of all languages
mark yes/no questions with a inal pitch fall. One nearly univer- is the mapping between intonational tunes and their meanings.
sal property, however, is the lowered pitch characterizing the Because intonation is used to convey many subtle diferences in
end of semantically neutral declarative utterances. meaning, often in gradient fashion, it is a challenge to determine
he study of intonation has witnessed many important theo- which diferences in intonation merit diferent phonological anal-
retical advances over the last 30 years. Whereas certain schools yses. Phonological distinctions between intonational tunes must
of intonation (e.g., the British school) described pitch contours be captured as diferences either in the sequence of tones com-
in terms of their overall shape or gestalt, many current models prising the tunes or in the alignment of those tones with words.
of intonation capture changes in pitch in terms of discrete tonal Yet another topic inextricably linked to intonation is pro-
sequences, thereby bringing the study of intonation into line with sodic constituency. Since pitch excursions are often observed
the analysis of segmental and word-level phonological phenom- at prosodic boundaries, a comprehensive analysis of intona-
ena. his type of approach assumes that peaks and troughs in a tion hinges on the characterization of the types of constituents
fundamental frequency contour are attributed to phonological constituting utterances. Research has shown that prosodic con-
high and low tones aligned with various phonological elements. stituency and the mapping between constituency and intonation
Actual surface phonetic intonation patterns result from inter- vary from language to language. For example, some languages
polation between these high and low tonal targets. divide utterances into groupings of words that are characterized
As one of the pioneering works in this school of intonation, by a tonal template. hus, phrases in Chickasaw (Gordon 2005)
Janet Pierrehumbert (1980) developed an analysis of English are associated with a LHHL sequence, whereby the irst and the
intonation in which a wide range of pitch contours conveying last low tonal targets associate with the beginning and the end of
several diferent semantic and pragmatic functions are captured phrases, respectively, and the two high tones associate with the
as a sequence of phonological high and low tones associated second syllable (or the irst syllable if it contains a long vowel or
with hierarchically arranged prosodic constituents. A funda- ends in a sonorant consonant) and the beginning of the inal syl-
mental insight of Pierrehumberts analysis is that tones may be lable, respectively.
classiied into two groups: those that are associated with promi- A number of researchers have published books providing
nent, that is, stressed, syllables and those that are associ- overviews of these and other issues in the study of intonation,
ated with the periphery, especially the end, of prosodic domains. including Ladd (1996), Cruttenden (1997), and Gussenhoven
Tones that are associated with stressed syllables are termed pitch (2004).
accents. Pitch accents difer according to whether they consist of
Matthew Gordon
a single tone, high or low, or a sequence of tones, such as H + L
or L + H, which phonetically yield a tone fall or rise, respectively. WORKS CITED AND SUGGESTIONS FOR FURTHER READING
In addition to pitch accents on certain stressed syllables, tonal
excursions are often observed at the end (and potentially the Cruttenden, Alan. 1997. Intonation. Cambridge: Cambridge University
Press.
beginning) of phrases. hese phrase-level tonal movements are
Gordon, Matthew. 2005. Intonational phonology of Chickasaw. In Jun
attributed to boundary tones, which may be associated with rela-
2005b, 30130.
tively small phrases, termed intermediate phrases, or with larger Gussenhoven, Carlos. 2004. he Phonology of Tone and Intonation.
phrases, termed intonation phrases or intonation units. Like Cambridge: Cambridge University Press.
pitch accents, boundaries may be characterized by a single tone Hirst, Daniel, and Albert Di Cristo, eds. 1998. Intonation Systems: A Survey
or a sequence of tones. For example, intonation phrase boundar- of Twenty Languages. Cambridge: Cambridge University Press.
ies in Korean consist of as many as ive tonal targets, for example, Jun, Sun-Ah. 2005a. Korean intonational phonology and prosodic tran-
LHLHL, which conveys a sense of annoyance on the part of the scription. In Jun 2005b, 20129.
speaker (Jun 2005a). Jun, Sun-Ah, ed. 2005b. Prosodic Typology the Phonology of Intonation
One of the challenges facing linguists interested in the and Phrasing. New York: Oxford University Press.
typological investigation of intonation is the relative dearth Ladd, D. R. 1996. Intonational Phonology. Cambridge: Cambridge
University Press.
of reliable descriptions of intonation on a broad cross sec-
Pierrehumbert, Janet. 1980. he phonology and phonetics of English
tion of languages. Fortunately, recent years have witnessed
intonation. Ph.D diss., Massachusetts Institute of Technology.
a dramatic expansion of cross-linguistic studies intonation. Reproduced by the Indiana University Linguistics Club, Bloomington,
Daniel Hirst and Albert Di Cristo (1998) studied intonation 1987.
in 20 languages, including several non-Indo-European lan-
guages. Jun (2005a) compiled investigations of 13 languages
IRONY
all analyzed within a Pierrehumbert-type framework. A typo-
logically and geographically diverse array of languages is dis- here are several types of irony described in the literature, all of
cussed in this work, including languages with word-level stress which rely on an incongruity or discrepancy between appearance
407
Irony
and reality. In dramatic irony, for instance, the incongruity is cre- (1995, 241). One cannot claim that the opposite counterpart to
ated by having the audience aware of information about which a the literal expression is that the kings were not celebrating with
character in a play is ignorant (such as in Sophocles Oedipus Rex). Te Deums or that the irony can be substituted by a literal expres-
In situational irony and irony of fate, the disconnect is between sion of the opposite, namely, that the kings were bewailing their
ideal expectations of justice and actual (or idealized) outcomes, defeat with lamentations.
such as would occur if Bill Gates, one of the worlds most wealthy Empirically, it has been shown that the processing of state-
individuals, won a lottery or as exempliied by Beethovens inabil- ments in a discourse context that emphasizes the irony is not
ity, on going deaf, to hear his own musical masterpiece. slower than that observed for the same statement in a discourse
he form of irony most studied in the language sciences that is consistent with its literal sense, a inding incompatible
is verbal irony, traditionally conceptualized as a trope in with predictions arising from the standard pragmatic approach.
which the meaning one intends to communicate is opposite Moreover, there is empirical evidence incompatible with the
of that expressed literally by the words that are used. hus, notion of rejection and substitution of the literal, demonstrat-
in Shakespeares Julius Caesar, when in his famous solilo- ing instead that the diference between the literal and nonliteral
quy (Act 3, Scene 2) Anthony states: For Brutus is an honor- sense is important in determining the magnitude of the per-
able man / So are they all, honorable men, it is understood ceived irony (the tinge hypothesis of Ellen Winner) and that rec-
that Anthony is emphasizing the opposite of being honorable, ognition of irony requires the processing of both of the opposing
namely, the dishonorable action of Brutus and the other con- meanings in order to determine that the two messages are in an
spirators. In principle, with irony one should be able to convey ironic relation (the indirect negation hypothesis of Rachel Giora
negative attitude by expressing something positive (as in the [2003]).
Shakespearean example above) or positive attitude by stating Other theories have de-emphasized the importance of the
something negative. literal expression as well and accord greater importance to psy-
here is also a general recognition that because the expressed chological factors. With pretense theory, there is the recognition
utterances are literally plausible, the recovery of ironic intent is that the ironist is taking on the role of a person who holds the
consequently highly context dependent and facilitated by sig- opinion expressed in the irony, thus mocking both the opin-
nals fashioned (or unintentionally employed) by the ironist. ion and the people who would hold it. Two competing theories
Tone of voice is one such hint in spoken language, but because (echoic mention and echoic reminder) share the assumption that
irony can be detected even when ironic intonation is not a verbal utterance can be seen as a mention or an allusion about
employed (such as when reading text), this cue is not necessary. the expression, about expectations, or about cultural norms. In
Other cues include hyperbole, understatement, and excessive this way, the ironist communicates his or her attitude about the
politeness, but it is generally agreed that there is no signal that actual and expected state of afairs (see Gibbs 1994, Chapter 8,
points exclusively to irony. for a review).
he context necessary for the recovery of ironic intent tradi- Extant theories have been criticized rightfully for their
tionally has been limited to the discourse in which the irony is inability to encompass all types of irony and for a theoretical
embedded, but in more recent years, the concept of context has vagueness that make them of questionable scientiic utility.
been widened, even for verbal irony, to encompass an ironic he importance accorded background context or identiication
environment that includes social-cultural factors, such as those of an ironic environment is especially problematic for process
dependent on discursive communities that share knowledge, and computational models of irony comprehension, given the
beliefs, values, and communicative strategies (e.g., Hutcheon failure to identify any signal of irony that is both necessary and
1994). suicient, though recent models based on constraint satisfac-
Over the past three decades, the standard pragmatic tion or graded saliency principles are encouraging (see Colston
approach to nonliteral language processing (see metaphor ) and Katz 2004)
has framed much of the discussion on irony. Based on the prag-
Albert N. Katz
matics of conversation and on speech-act theory, irony is
described accordingly as the outcome of a conversational
implicature initiated by a violation, or based on exploita- WORKS CITED AND SUGGESTIONS FOR FURTHER READING
tion, of H. P. Grices (1975) Maxim of Quality: he recognition Colston, H., and A. Katz. 2004. Figurative Language Processing: Social and
that the literal expression does not make sense in the context Cultural Inluences. Mahwah, NJ: Erlbaum.
in which it is produced leads one to initiate a search for a con- Gibbs, R. 1994. he Poetics of Mind. Cambridge: Cambridge University
text-appropriate interpretation in which the literal sense of the Press.
expression is denied, suppressed, and replaced by the logical Giora R. 2003. On Our Mind: Salience, Context and Figurative Language.
opposite.
Grice, H. P. 1975. Logic and conversation. In Syntax and
here have been challenges over the years regarding the tra-
Semantics: Speech Acts, ed. P. Cole and J. Morgan, 4158. New
ditional emphasis on the initial processing of the literal expres-
sion and then substituting it with its opposite meaning, on both Hutcheon, L. 1994. Ironys Edge. London: Routledge.
logical and empirical grounds. Consider, for instance, the analy- Katz, A., ed. 2000. Uses and processing of irony and sarcasm. Metaphor
sis of a passage from Voltaires Candide given by Dan Sperber and Symbol 15 (Special Issue): 1116.
and Deirdre Wilson: When all was over and the rival kings were Sperber, D., and D. Wilson. 1995. Relevance: Communication and
celebrating their victory with Te Deums in the respective camps Cognition. 2d ed. Oxford: Blackwell.
408
Language, Natural and Symbolic
L language is not. No one without the idea of reading and writing

could learn the language of mathematics. Symbolic language is
essentially written.
A second, related diference between the two sorts of language
LANGUAGE, NATURAL AND SYMBOLIC is that natural language is enormously versatile. One can do all
he paradigm of language is natural language, a naturally evolved sorts of things in and with natural language. Symbolic languages,
system of human communication using spoken or signed words by contrast, are speciically designed for particular purposes and
according to the various ways they can be combined. But in an are useless for others. One cannot tell a joke or a story, describe
extended sense, many species of nonhuman animals also have a room (or a language), or even greet a friend in the formula lan-
language, in this case, a means of communication through inar- guage of arithmetic.
ticulate sounds; and humans similarly have various nonver- Natural language is also constantly evolving through its use;
bal means of expression and communication through facial it is inherently social and deeply historical. Symbolic language is
expressions, gestures, and body language more generally, and instead self-consciously created, often by a single individual, and
through music, dance, and the imitative arts (see animal com- has no inherent impulse to change with use. Although French
munication and human language; art, languages of; has changed considerably over the past four centuries, the lan-
communication, prelinguistic). For more than a thou- guage of algebra that Descartes introduced in 1637, and which
sand years, we have also had the positional Arabic numeration every schoolchild learns today, has changed not at all.
system, together with its algorithms for the basic arithmetical A subtler diference concerns the characteristic sorts of con-
operations, a symbolic language that was extended in the sev- cepts of the two languages. he concepts of natural language
enteenth century to include the literal notation of algebra as are paradigmatically object-involving and, for that reason, sen-
well. he twentieth century, inally, saw the development of very sory: we talk, irst and foremost, of the objects met with in every-
powerful formal languages, for example, the language of math- day experience, for example, of the things we eat, navigate by,
ematical logic and programming languages, both of which are enjoy, and fear. Although our concepts of such things do involve
constituted by a collection of signs together with rules governing much more than the way they appear sensorily to us in appropri-
their use, that is, by a syntax but not, at least not essentially, a ate circumstances, they could not be understood in abstraction
semantics (see artificial languages). from those appearances, that is, from the subjective character of
Although natural language is the paradigm of language, the our experience of them. he concepts of natural language (those
foregoing suggests a continuum of sorts: irst with systems of owing nothing to the development of symbolic language) are con-
animal and human nonverbal communication, then natural cepts of sensory entities, of things that look, feel, taste, smell, and
languages, followed by the symbolic language of arithmetic and sound in characteristic ways (compare embodiment). Insofar
algebra, and inally programming and other formal languages. as they are, they resist expression in a symbolic language.
Given their centrality, both along this continuum and in our Consider, for example, the notion of a sphere. According to
lives, the focus here is on natural language and the symbolic lan- Aristotle, a sphere is a common sensible; it is an object that
guage of arithmetic and algebra. contrasts with a proper sensible, such as the color red or a
Natural language and the symbolic language of arithmetic certain bitter taste, in being perceptible not merely through one
and algebra share two fundamental, and related, features. Both sense organ (say, by sight as color is, or by taste as bitterness is)
are vehicles of inquiry and knowing, and both are a medium for but through more than one. Red things have a characteristic
the expression of what we know. Not only do we learn from expe- look; spheres have a characteristic look and a characteristic feel.
rience as animals can but we also, in virtue of our immersion in Aristotles concept of a sphere is not the modern, mathemati-
natural language, can relect on what we learn, wonder at how cal, and nonsensory concept of a two-dimensional surface all
we learn, and question whether we really know what we seem to points of which are equidistant from a center; it is the concept
know. Because we reason not only implicitly and involuntarily as of a three-dimensional, essentially sensory object. he mod-
other animals do but also explicitly, in words, we can question ern mathematical concept of a sphere can be expressed in the
our reasons and strive to discover better ones. Just the same is symbolic language of mathematics (namely, as x2 + y2 + z2 = r2);
true of the language of arithmetic and algebra by means of which Aristotles essentially sensory concept cannot.
we discover, for example, negative, rational, real, and complex We have seen that a symbolic language, unlike a natural lan-
numbers, explore their manifold natures, and display in familiar guage, is self-consciously created and expresses concepts that
symbols that which we know. hough for very diferent reasons, can be fully understood in abstraction from our sensory experi-
neither systems of nonverbal communication nor merely formal ence. It is unsurprising, then, that the rules governing the use of
languages can serve in this way as a vehicle of critically relective signs in a symbolic language can be explicitly formulated and,
inquiry. hence, that it is relatively easy to build a machine that correctly
Natural language and the symbolic language of arithmetic manipulates those signs. Because natural language is instead
and algebra are also very diferent, however. Natural language social, historical, essentially sensory and object-involving, mak-
is, for example, primarily spoken (or signed) and a vehicle of ing the rules of its use explicit (in order, perhaps, to build a
communication between a speaker and a hearer; the symbolic machine capable of correctly manipulating its signs) is an alto-
language of mathematics is instead essentially written and serves gether diferent, and much harder, possibly intractable, prob-
primarily as a vehicle of reasoning. Spoken natural language is lem. It may be that natural language users can only be grown or
intelligible independent of written natural language; symbolic raised, that they cannot be built.
409
Language, Natural and Symbolic Language Acquisition Device (LAD)
he fact that natural language embodies a particular sensory with no inherent impulse to change. he two sorts of language
view of the world, one that is inextricably tied to human biol- are, then, quite diferent. And yet most work in the language sci-
ogy, has other implications as well. Our common experience, ences is pursued on the assumption that symbolic language dif-
grounded in our common (biological) form of life, explains, for fers from natural language only in its degree of clarity, rigor, and
example, the intertranslatability of all human natural languages perspicuity. We may, as a result, fail adequately to understand
and predicts the untranslatability of the natural languages of either. By importing considerations relevant only to the work-
creatures evolved to have radically diferent (biological) forms ings of a symbolic language, we are liable to falsify the work-
of life. Given the role of acculturation in the acquisition of natu- ings of natural language; and by taking natural language as our
ral language, any being capable of learning a particular natural paradigm, we risk radically misconceiving the nature of symbolic
language must share at least some sense modalities with other language. (It has, for example, long been assumed that the con-
speakers of the language; but nothing in the very idea of accul- ception of generality, or quantification, that is needed in
turation into a natural language requires that there be some mathematics is identical to that employed in natural language,
favored sense modalities. Symbolic language is not similarly but perhaps this is just not so.) If natural and symbolic languages
rooted in our bodily being. Not only are symbolic languages are essentially diferent, then the language sciences need to take
universal to all people whatever their natural language; they those diferences into account, showing how they do, or do not,
are in principle, at least along this dimension, accessible to any matter to research programs in those sciences.
rational being. here are further ramiications for psychology and for peda-
Why, then, is translation from one natural language to gogy. For example, given the diferences between natural and
another so diicult? Ludwig Wittgenstein suggests an answer symbolic language, it is reasonable to ask whether one reads
in his Philosophical Investigations. As he points out, although symbolic languages diferently than one reads written natural
some of our everyday concepts can be adequately understood by language, whether one looks at the page of marks diferently in
reference to a standard or paradigm case, most (including lan- the two cases. If one does, then perhaps what prevents some
guage) are correctly applied to a range of things that exhibit only students from thriving in mathematics is that they try to read its
a family resemblance. Whereas in the former instance all symbolic language as if it were a written natural language. Maybe
correct applications refer back to the one standard or paradigm students who are adept at modern, symbolic mathematics are, in
case, in the case of concepts that exhibit only a family resem- fact, primarily adept at catching on, without explicit instruction,
blance, similarities between correct applications need only over- to the way of writing and reading it involves. hese are testable
lap, like ibers twisted one on another over the length of a thread. hypotheses, but they are hypotheses we will think to test only if
Such similarities are essentially historically conditioned: Some we comprehend the diferences between natural and symbolic
applications will, at a particular moment in the evolution of the languages.
language, seem appropriate, natural; others will not work so
Danielle Macbeth
well, or at all. And because what works crucially depends on pre-
vious successful applications, diferent natural languages come
to employ common words in quite diferent ways. here is noth-
ing like this in the case of symbolic language. Although the words Gupta, Anil. 1980. he Logic of Common Nouns. New Haven, CT: Yale
may be borrowed from natural language (e.g., group, ring, University Press. Argues for the two sorts of predication needed in
and ield in abstract algebra), the concepts of a symbolic lan- speaking about change.
Macbeth, Danielle. 2004. Vite, Descartes, and the emergence of mod-
guage have a ixed content and univocal application. It is only
ern mathematics. Graduate Faculty Philosophy Journal 25: 87117.
our understanding of those concepts that develops and deepens
Explores diferences between premodern, nonsymbolic and modern,
in the course of inquiry. he nature and structure of meaning is symbolic mathematical understanding.
quite diferent in the two sorts of language. . 2005. Freges Logic. Cambridge: Harvard University Press. Argues
he two sorts of language tend to difer, inally, in their that Gottlob Freges language, unlike the language of mathematical
predicative structure: a natural language is constitutively logic, which is merely formal, without content, is a fully contentful
narrative, a language within which to tell what happens; a symbolic language, a vehicle of inquiry.
symbolic language is not. And natural language is narrative Wittgenstein, Ludwig. 1953. Philosophical Investigations.
because the everyday world is a world of becoming, of change. Oxford: Blackwell. Deeply insightful investigation into the nature of
To speak of it, then, requires two modes of predication, that natural language.
marking what something is and that marking properties a thing
can acquire and lose and also (as a matter of its form, not merely
LANGUAGE ACQUISITION DEVICE (LAD)
in its content) tense. Symbolic languages generally involve nei-
ther diferent modes of predication nor tense. he language of his term refers to Noam Chomskys early proposal of what
mathematics, for example, speaks timelessly of what is: 7 + 5 = 12, would be necessary for construction of a language acquisition
(a + b)2 = a2 + 2ab + b2. model (1965, 303). More precisely, this early proposal for a
Natural language is primarily spoken and communicative, language acquisition device (LAD) provides a logical analysis
narrative, essentially sensory and object-involving, and histori- of the components that would be minimally necessary for any
cal, that is, constantly evolving with use. A symbolic language, such model to account for language acquisition. Its components
such as the language of arithmetic and algebra, is essentially are summarized in Figure 1. hese can be viewed as specifying
written, non-narrative, nonsensory, and self-consciously created logically and abstractly what would be minimally necessary to
410
Language Acquisition Device (LAD)
(i) Technique for representing input signals
(ii) Way of representing structural information about these signals
(iii) Some initial delimitation of a class of possible hypotheses about language structure
(iv) Method for determining what each such hypothesis implies with respect to each sentence
(v) Method for selecting one of the (presumably infinitely many) hypotheses that are allowed by
(iii) and are compatible with the given primary linguistic data.
Figure 1. LAD components (Chomsky 1965, 30).
the child as language learner in the initial state, that is, before measure for (v) required speciication. he absence of a temporal
language experience. or developmental component was especially challenging, given
hese components, which were proposed as necessary for that an instantaneous view of language acquisition is obvi-
language learning, were formulated as accounting for the map- ously false (Chomsky 1975, 119, 121).
ping from primary language data (data which is necessarily Subsequent formulations of linguistic theory that pursue
inite and inherently variable) presented at the initial state to a generative approach now ofer a number of developments
complex, ininite, and systematic language knowledge in the end of the original LAD proposal. In general, they seek to deine
state. hey characterized the foundations of the speciic innate a theory of universal grammar (UG) as a theory of the ini-
abilities that make this achievement possible for the language tial state. Deined as an abstract partial speciication of the
learner (Chomsky 1965, 27). genetic program that enables the child to interpret certain
At the same time, the components in Figure 1 were to indicate events as linguistic experience and to construct a system of
what a linguistic theory would have to treat in order to support rules and principles on the basis of this experience (Chomsky
an acquisition model and thus attain explanatory adequacy 1980, 187) and as of course, not a grammar, but rather a sys-
(see descriptive, observational, and explanatory tem of conditions on the range of possible grammars for pos-
adequacy). For example, such a theory would need to pro- sible human languages (Chomsky 1980, 189), UG opens the
vide a representation of possible sentence in order to support possibility for developing future integration of logical and real-
(i), a deinition of structural description for (ii), generative istic approaches to language acquisition and language devel-
grammar for (iii), a method for determining structural descrip- opment. a principles and parameters version of UG, for
tions for (iv), and an evaluation metric for (v). example, formulates parameters as providing a speciic mech-
he original Chomsky proposal is often interpreted as though anism for (i)(v). (See Table 4.4 in Lust 2006 for examples of
it described a realistic device. However, it is best viewed as pro- current approaches to modeling language acquisition within
viding an explication of the logical foundations that any compre- this framework.)
hensive model for language acquisition would have to presuppose Combining these and subsequent theoretical advances with
and account for, with the precise nature of each of the compo- signiicant recent empirical advances concerning infant map-
nents to be subsequently determined as an empirical matter ping to language data (e.g., Jusczyk 1997; Morgan and Demuth
(Chomsky 1965, 37). Perhaps at least partially because of this 1996), which informs us on actual mechanisms in (ii) and (iii),
divergence in interpretation, subsequent language acquisition may now allow the ield to approach the more comprehensive
research over the last decades has frequently pursued divergent model of language acquisition that LAD was irst intended to
paths, one pursuing the logical problem of language acquisi- underlie (see discussion in Chomsky 2000). See Chapter 4 in Lust
tion consisting of linguistic analyses of potential data mapping 2006 for further discussion of this LAD-based model and a review
(e.g., Baker and McCarthy 1981), another pursuing a realistic of several proposed alternatives to the LAD, derived from oppos-
approach consisting of empirical studies of language developing theoretical paradigms.
ment (initiated largely by Roger Brown and his students [1973]). Barbara Lust
In fact, certain components in Figure 1 proved particularly
challenging to a realistic model. For example, if there were a real- WORKS CITED AND SUGGESTIONS FOR FURTHER READING
istic device, it would have to include a mechanism for (i) and (ii).
Baker, C. L., and J. McCarthy, eds. 1981. he Logical Problem of Language
If (iii) were to be implemented in terms of an enumeration of
Acquisition. Cambridge, MA: MIT Press.
the class G1, G2 of possible generative grammars (Chomsky
Brown, R. 1973. A First Language. Cambridge: Harvard University Press.
1965, 31), then there is the de facto impossibility of predeining
Chomsky, N. 1965. Aspects of the heory of Syntax. Cambridge, MA: MIT
given grammatical hypotheses across 6,000 to 7,000 actually Press.
potentially innumerable human languages, as well as the risk . 1975. Relections on Language. New York: Pantheon.
of begging the question of language acquisition, that is, how the . 1980. Rules and Representations. New York: Columbia University
individual grammars arise. Similarly, the nature of an evaluation Press.
411
Language Change, Universals of
. 2000. he Architecture of Language, ed. J. Mukherji, B. Narayan Sound Change

Patnaik, and R. Agnihotri. Oxford: Oxford University Press. he symmetry of the terms used to describe sound changes
Jusczyk, P. 1997. he Discovery of Spoken Language. Cambridge, MA: MIT
assimilation/dissimilation, weakening/strengthening, deletion/
Press.
insertion makes it seem as though all directions of change were
Lust, B. 2006. Child Language: Acquisition and Growth.
equally probable. he facts now known from a wide array of lan-
Morgan, J., and K. Demuth, eds. 1996. Signal to Syntax: Bootstrapping guages show instead that assimilation, weakening, and deletion
from Speech to Syntax in Early Acquisition. Mahwah, NJ: Lawrence are vastly more common than dissimilation, strengthening, and
Erlbaum. insertion. If sound change is rigidly deined as language internal,
phonetically motivated, and lexically general, then the less com-
mon types of change show up even more rarely.
LANGUAGE CHANGE, UNIVERSALS OF
Common assimilation changes involve consonants taking
All aspects of language can change, and with every aspect we see on the properties of adjacent vowels, for instance, palataliza-
broad tendencies that have inspired the search for universals of tion before front vowels, labialization or rounding before round
change. In some areas, powerful universals have emerged, sug- vowels. Or vowels can take on properties of consonants, as
gesting that many of the synchronic similarities we see across when vowels become nasalized in the same syllable with nasal
languages may have their source in shared cognitive and process- consonants. For some of these changes, more detailed hierar-
ing mechanisms. Researchers have identiied cross-linguistically chies of contexts for change can be established. Palatalization
similar paths of change in sound change in morphological of coronal consonants (such as [t], [d], [s]) occurs irst and most
change, and in grammaticalization. commonly before a high front glide ([j]), occurs next before a
Given any two related items, A and B, one could in principle high front vowel ([i]), and then progresses to the lower front
expect a change from A > B or from B > A to be equally prob- vowels.
able. However, the facts of change show that this is not the case. Paths of reductive change can also be established. For
Across the languages of the world, we ind that one direction instance, a common consonant reduction involves the loss of [p]
predominates and the other is rare. hus, we speak of unidirec- via the pathway p > f > h > . Parts of this path are documented
tionality in language change. Claims about unidirectionality are in diferent languages: Japanese has undergone a change that
sometimes controversial because apparent counterexamples reduced all prevocalic instances of /p/ to a fricative that assimi-
do emerge. For this reason, it is important to deine types of lates to the place of articulation of the following vowel. Spanish
change clearly before making proposals about the directionality and other Romance languages have undergone a change that
of change. reduced word-initial [f] to [h] and further to . he positions in
he importance of such unidirectional trends cannot be over- which such reductions occur also show regularity: Syllable-inal
stated. Universals attested in the diachronic plane appear to be position favors the reduction of consonants, as does intervocalic
more powerful than those that can readily be stated for the syn- position. Word-initial position is least likely to condition reduc-
chronic plane. hus, diachronic typology is a theory of universals tion of consonants.
that proposes that the synchronic patterns are not in themselves It is commonly assumed that such changes are caused by
the universals but, rather, the result of strong diachronic trends phonetic tendencies. In the articulatory domain, some general
(Greenberg 1969, Greenberg, Ferguson, and Moravcsik 1978). principles of change have been proposed that deal with the way
he structure of language is created by change that is ongoing in articulatory gestures change during production. he primary
language use (see usage-based theory); because languages directions in phonetic change are the reduction of the magnitude
are used by human beings in very similar ways across cultures, of the gestures, leading to reduction or loss, and their increased
languages tend to be similar to one another. overlap, leading to assimilation (Browman and Goldstein 1992;
An important question to ask concerns the sense in which Mowrey and Pagliuca 1995).
there can be universals of change. No change has to occur; there As mentioned earlier, the importance of universals of language
are many changes that could occur. he universals, then, specify change is that they can predict and thus explain synchronic pat-
only similar paths of change that can be found in diferent lan- terns across languages. For instance, the presence of nasal vowels
guages. If these languages are not closely related genetically, in a language is almost always due to assimilation to a nasal conso-
then the similar paths of change cannot be attributed to a shared nant. Sometimes this consonant is lost, leading to phonemic nasal
history but must be viewed as independent developments. hus, vowels. he diachronic source explains why nasal vowels always
even though we cannot say that a change has to occur, nor can have a more restricted distribution and frequency compared to
we predict when it will occur, there are still many substantive oral vowels (Greenberg 1966). Similarly, the fact that some lan-
predictions that we can make about change. guages lack the phoneme [p] can be explained by its tendency to
In examining trends in language change, it is important to undergo reduction. he restrictions against certain consonants
consider how well documented a particular change is. Some in syllable-inal position can be attributed to their propensity for
changes are reconstructed on the basis of a comparison of loss in that position (Vennemann 1988, Blevins 2004).
related languages (see comparative method ). Since known
trends in change are often used in this reconstruction, such After Sound Change: Morphologization
changes cannot be taken as evidence for trends in change. Only Another set of unidirectional changes involves the results of
well-attested changes are valid inputs to a theory of universals sound change. Phonetically conditioned sound change creates
of change. alternations that gradually acquire morphological or lexical
412
Language Change, Universals of
conditioning, as, for example, when vowel shortening before a habeo meant I have to say, In medieval Spanish, the verb had
consonant cluster created an alternation in English keep/ kept, reduced to he and consistently followed the ininitive. Eventually
sleep/slept and weep/wept. the former verb (now a grammatical word) fused with the inini-
Another example is the alternation in nouns such as wife, wives. tive, giving decir. In further changes, the stem of some verbs lost
At irst, in Old English there were no voiced fricatives: /v/, /z/, and a syllable, in our example creating dir I will say. he last stage
// did not occur. Later, by a regular sound change /f/, /s/, and (loss) is occurring in this case as a new future from ir a + verb go
// became voiced when they occurred between two vowels, as to + verb replaces the old future.
in the plurals of house, wife and wreath (at that time the plural Parallel to this path of change are the many paths of semantic/
suix had a vowel in it, putting the fricative between two vow- pragmatic change that have been identiied as creating the gram-
els). Nowadays, the alternation is not phonetically productive. In matical morphemes of the languages of the world. he numeral
words such as classes, efort, and ether, voiceless fricatives occur one with a noun tends to develop into an indeinite article; demon-
between vowels. Also /v/, /z/, and // have become phonemes stratives develop into deinite articles and complementizers; verb
and they occur in all positions. So now the alternation still found constructions involving a verb meaning want or go to plus
in wife, wives, and so on is associated with certain nouns and the another verb develop into future markers; resultative construc-
morphological category of plural. In this way, morphologically or tions with have or be and a past or passive participle, such as I
lexically conditioned alternations are created; such alternations have gone, develop into perfect, pasts, and perfectives; construc-
tend to be unproductive phonologically and to involve segments tions with locative verbs or movement verbs develop into progres-
that are contrastive elsewhere. sives, which may go on to become imperfectives. Verbs meaning
know or be able develop into auxiliaries indicating possibility.
Morphological Change: Analogy A preposition meaning to or in order to develops into an inin-
Morphological and lexical alternations that are created in the way itive marker. Body-part terms such as head and back become spa-
just described tend to undergo further change on the basis of the tial and later temporal adpositions. Passive constructions develop
patterns found in the paradigms of the language. Certain gen- into ergative constructions. All of these paths of change (and many
eral trends in analogical change have been identiied (Mazcak others) are documented as independent developments in unre-
1980). In analogical leveling, when an alternation is lost, it is lated languages (Bybee, Perkins, and Pagliuca 1994).
the high frequency form of the paradigm, such as the singular In comparing these speciic paths, certain general patterns
in nouns or the present indicative in verbs, that serves as the are discernible: As a construction grammaticalizes, its mean-
basis for new formations. hus, the alternation weep/wept is lev- ing becomes more general and abstract; its form becomes more
eled by the creation of a new past tense, weeped. Low-frequency reduced and dependent upon surrounding material; it under-
words are more likely to undergo analogical leveling than high- goes an extreme frequency increase; and its category member-
frequency words. hus, wept is more likely to change than kept. ship can change, say, from verb to auxiliary, from noun or verb
he productive pattern that prevails in leveling is the one that has to adposition. he lexical material upon which grammaticaliza-
the largest number of members. One synchronic consequence tion works is similar across languages, as are the mechanisms of
of these trends is that the irregular paradigms in a language are change. he fact, for instance, that one can infer intention from
usually among the most frequent. a construction of be going to VERB and then later infer future
Extension of an alternation to a paradigm that did not have it from that intention seems not to be culture speciic, inasmuch
before is less common than is leveling. It occurs where there is as the development of such a construction into future occurs in
strong phonological similarity to a productive pattern, as when languages all over the world. hus, the commonalities found in
the regular verb string gets a new past tense strung due to its simi- grammaticalization point to interactive, cognitive, and process-
larity to swing, swung. ing mechanisms that are shared across cultures.
Joan Bybee
Morpho-Syntactic Change: Grammaticalization
Most change in the morpho-syntax is a product of the general pro-
cess of grammaticalization. Grammaticalization itself, however, WORKS CITED AND SUGGESTIONS FOR FURTHER READING
involves phonetic, semantic, and pragmatic change, in addition to Blevins, Juliette. 2004. Evolutionary Phonology. Cambridge: Cambridge
morpho-syntactic change. Various cross-linguistic paths of change University Press.
have been identiied in recent research into grammaticalization. Browman, Catherine P., and Louis M. Goldstein. 1992. Articulatory pho-
First, there is a general path of change for the form of the gram- nology: An overview. Phonetica 49: 15580.
maticalizing element as it progresses from lexical to grammatical: Bybee, Joan, Revere Perkins, and William Pagliuca. 1994. he Evolution of
Grammar: Tense, Aspect and Modality in the Languages of the World.
content item > grammatical word > clitic > inlectional aix > Chicago: University of Chicago Press.
stem change > loss Greenberg, Joseph. 1966. Language Universals: With Special Reference to
Feature Hierarchies. he Hague: Mouton.
his progression involves a loss of the properties originally
. 1969. Some methods of dynamic comparison in linguistics.
associated with the content word (its ability to occur as a noun In Substance and Structure of Language ed. Jan Puhvel, 147203.
or verb) and a growing fusion of the element with lexical items Berkeley: University of California Press.
nearby. A good example is the development of the Spanish future Greenberg, Joseph H., Charles Ferguson, and Edith. Moravcsik, eds.
suixes. In Latin, there was a construction that consisted of an 1978. Universals of Human Language. Vols. 14. Stanford, CA: Stanford
ininitive with the conjugated verb habere. For example, dicere + University Press.
413
Language Families
Maczak, Witold. 1980. Laws of analogy. In Historical Morphology ed. he common ancestor of an entire language family is assigned
J. Fisiak, 2838. Berlin: Mouton de Gruyter. a name by preixing Proto- to the name of the family, as in Proto-
Mowrey, Richard, and William Pagliuca. 1995. he reductive character Indo-European and Proto-Afro-Asiatic. he place where the pro-
of articulatory evolution. Rivista di Linguistica 7.1: 37124. tolanguage was spoken is called the homeland of the language
Vennemann, heo. 1988. Preference Laws for Syllable Structure and the
family.
Explanation of Sound Change. Berlin: Mouton de Gruyter.
he study of language families is part of historical lin-
guistics and is contextualized within a particular model of
language change: divergence. When innovations in one part
LANGUAGE FAMILIES
of a language community fail to spread to other parts, diferences
A language family is a set of languages that developed from the accumulate until the community can be said to speak diferent
same ancestral language. he best-known example is the Indo- languages. It is this historical process that language-family the-
European family, which comprises more than a hundred lan- ory is meant to model. But perhaps because language families
guages that, even in premodern times, extended from the Indian are commonly illustrated by showing similarities between lan-
subcontinent to northwestern Europe. his family is well known guages (e.g., English mouse is cognate with Latin mus), the idea
not only because it contains many of the worlds most widely arises that relatedness is about similarity between languages. In
spoken languages, such as Bengali, English, French, German, fact, there is no requirement that cognates be similar at all (e.g.,
Hindi, Portuguese, and Russian, but also because it was the main English two is related to Armenian yerku), and many sources of
focus of research in the nineteenth century, when linguistics similarity are disavowed as being irrelevant to the model. hese
was established as a modern science. However, many other lan- include borrowing (see contact, language), onomatopoeia,
guage families are subjects of intense research today, such as the universals (absolute and statistical universals), and
following: chance similarities.
he study of language families typically involves one or more
Afro-Asiatic, which includes Arabic, Hebrew, and several
of the following enterprises:
languages of northern Africa, including Ancient Egyptian,
Hausa, and Somali Demonstrating that languages are related
Algic, which includes several native languages of North Reconstructing the common protolanguage
America, such as Wiyot, Yurok, Cheyenne, Ojibwa, and Subgrouping the languages by hypothesizing intermediate
Shawnee ancestors
Austronesian, which includes more than a thousand lan- Associating linguistic data with historical and archaeological
guages spoken from Madagascar to Polynesia, such as Bahasa data
Indonesia, Fijian, Tagalog, and Tahitian
he following sections irst describe the traditional and still-
Dravidian, which includes most of the non-Indo-European
dominant methods for pursuing these tasks and then sketch and
languages of India, such as Malayalam, Tamil, and Telugu
evaluate some new methodologies.
Niger-Congo, which includes most of the languages of sub-
Saharan Africa, including Igbo, Swahili, and Zulu
Sino-Tibetan, which includes Burmese, Chinese, and Tibetan
Traditional Methods
he traditional technique is the comparative method. he
Tupi, comprising several languages of South America, includ-
linguist studies characteristics that rarely recur across languages,
ing Guarani
such as grammatical paradigms and the associations between
Uralic, which includes most of the non-Indo-European lan-
sound and meaning in morphemes. Eforts are made to discard
guages of Europe, such as Estonian, Finnish, Hungarian,
loans and onomatopoeia, although the former is a diicult and
Nenets, and Sami (Lapp)
often intractable problem. Matching morphemes across lan-
his list is only a sample of the hundreds of known language guages by meaning, one looks for recurrent sound correspon-
families and of the languages included in each one. For a com- dences. For example, English f corresponds to Latin p in father
prehensive listing, see Gordon (2005). = pater, feels = palpat, few = pauca, and many other words. If a
Familial metaphors are the standard terms of art. Languages in large number of recurrent correspondences are found, the lan-
the same family are said to be genetically related to one another. guages are related. he recurrences are also used to reconstruct
A language from which other languages developed is called an the protolanguage (see historical reconstruction).
ancestor, or parent, of those languages. Words that descend from After a language family is identiied, the next step is sub-
the same form in an ancestral language are related, or cognate. grouping, identifying the branches or groups within the fam-
his homey terminology is undergoing some competition with ily. Subgrouping seeks to uncover the history of the divergence
that of modern cladistics as used in biology, but certain linguistic (cladogenesis) of a language family. If the family contains three
concepts do not translate well. In biology, it is understood that or more languages, the linguist looks for evidence that some
all biological taxa are related to one another, and family is but proper subset of those languages may have descended from
a midlevel taxon. Linguists assume that relationships between an intermediate common ancestor. his is done by looking for
languages must be proved, and a language family is a maximal shared innovations (synapomorphies) sound changes or new
taxon. In principle, even isolates languages that do not group words or grammatical constructions that were not in the ances-
with other languages can trivially be considered families by tor language but are found in two or more of the descendant
reinterpreting their dialects as separate languages. languages. For example, the fact that English, German, Swedish,
414
Language Families
and several other languages have f where Proto-Indo-European is often left with unlikely cladograms like the 15-way branching
had p is a shared innovation that indicates that those languages tree of Indo-European.
may have a shared intermediate ancestor that underwent this Another disappointment is that little progress has been made
change; otherwise, we would have to assume that each of those in the past century in pinning down the Indo-European home-
languages separately innovated the change of p to f or borrowed land or proving that additional languages are related to English
the innovation from another language. In fact, the preponder- topics of recurrent interest among linguists, archaeologists, and
ance of evidence supports such an intermediate language and enthusiasts alike. More disappointing is that when linguists have
a branch (clade) of languages descending from it: the Germanic claimed that language families such as Uralic are related to Indo-
languages. Other branches of Indo-European include the Balto- European such groupings often being given the Eurocentric
Slavic (including Bosnian, Lithuanian, Polish, and Russian), name Nostratic (our family) the methodology has given no
Celtic (including Breton, Irish, and Welsh), Italic (including Latin irm guidance as to how signiicant the evidence is, with the
and the Romance languages), Indo-Iranian (including Bengali, result that many linguists ind themselves uncomfortably agnos-
Farsi, Pashto, and Urdu), and the extinct Anatolian branch, tic on whether Nostratic has been proved or not. Unlike in mod-
which included Hittite, Luvian, and Lycian. Several other lan- ern experimental sciences, there are no statistical techniques for
guages, including Greek, Albanian, Armenian, and half a dozen estimating the probability that the number of correspondences
extinct languages (see extinction of languages), do not found is due to a real relationship between languages rather
share an agreed-upon intermediate branch at all. than to chance. Rules of thumb were developed to provide some
Associating language history with external facts entails pinning guidance; a typical piece of advice is to treat words as potentially
the protolanguage to a particular time and place its homeland cognate only if at least three of their consonants are found in
and demonstrating how it spread from there. he time depths recurrent correspondence sets. But such rules are very approx-
under consideration mean that written records are rarely, if ever, imate, not tailored to the speciic structures of the languages at
available. he primary linguistic tool is to look for words found hand, and they have discouraged linguists from applying the
in multiple branches of a language family and exhibiting all of method to languages with short morphemes.
the regular sound correspondences; they are assumed to date Joseph H. Greenberg addressed several of these concerns
back to the protolanguage and, therefore, to name objects found with a technique called multilateral comparison. Tables are con-
in its environment. For example, a pan-Indo-European word for structed listing the translation equivalents for many concepts in
wheel suggests that the protolanguage split up no earlier than many diferent languages. It is claimed that the tabular layout
the invention of the wheel, some six thousand years ago (Mallory itself makes the relationships among the languages, even their
1989). Another technique is to look for areas of greatest linguis- correct subgrouping, patent. Using this technique, Greenberg
tic diversity. he fact that the Austronesian languages are much presented an analysis of the languages of Africa (1963), which is
more diverse in Taiwan than anywhere else supports the theory now considered standard, then went on to hypothesize language
that they developed there longest, that is, that Taiwan was the families that lumped established families together into much
homeland for the Austronesian family (Blust 1999). A third tech- larger families what became known as deep linguistic relation-
nique is to seek archaeological evidence of population move- ships. he dozens of families and isolates of the Americas were
ments that may have disseminated a language family. In the reduced to three families (Greenberg 1987); Indo-European,
case of Austronesian, knowledge of how people spread through Uralic, Japanese, and several other families were lumped into a
the Paciic and Indian Oceans is consistent with the theory of family called Eurasiatic (Greenberg 2002). Multilateral compari-
a Taiwan homeland. In the case of Indo-European, it has often son has proven popular among enthusiasts, in part because it
been noted that early adopters of horse-drawn wheeled chariots requires no special language expertise, in part because it appears
would be in an ideal position to spread their languages through- to reveal many new, deep relationships. Unfortunately, there is
out much of Europe and Asia. A well-received theory points to no way to evaluate a methodology that simply calls for contem-
the chariot users who lived in the PonticCaspian region about plating raw data until patterns emerge.
six thousand years ago (Mallory 1989). Several researchers have shown, however, that some of
Greenbergs key ideas can be transformed into algorithmic
Challenges to the Traditional Method (reproducible) methodologies that introduce to language fam-
he traditional comparative method is still the basic framework ily research the beneit of statistical signiicance testing. Robert
within which language families are researched, but it is not per- L. Oswalts procedure (1998) minimized experimenter bias by
fect. It is a complicated process that demands a great deal of requiring that a speciic concept list be used and that one specify
knowledge about all of the relevant languages. It can be misled in advance speciic criteria for measuring degree of similarity
by loanwords, and it ofers little guidance in distinguishing true between two languages. William H. Baxter and Alexis Manaster
shared innovations (synapomorphies) from independent iden- Ramer (2000) added reliable signiicance-testing procedures
tical innovations (homoplasies). he linguist must constantly based on randomization tests. Brett Kessler and Annukka
decide whether multiple languages could have undergone a Lehtonen (2006) adapted the technique to handle multiple lan-
particular change independently and how likely they would be guages in a single test, informally conirming Greenbergs claim
to have borrowed it. In reality, of course, anything that happens that such large-scale comparisons are inherently more power-
once can happen twice, and there is nothing that is not subject to ful than two-language comparisons. Don A. Ringe (1992; see
borrowing (homason and Kaufman 1988). he true solution is Kessler 2001 for extensive discussion and methodological reine-
probabilistic, but hard numbers are lacking, and the investigator ments) measured not similarity but the number of recurrent
415
Language Families
sound correspondences. his has the advantages both of being there must come a point at which any remaining commonalities
closer to the traditional comparative method and of generating between languages are indistinguishable from chance levels.
correspondences useful for subgrouping and reconstruction. But even if the more pessimistic predictions are true and new
Disappointingly, however, none of these neo-Greenbergian methods are unlikely to greatly expand intensively studied fami-
techniques found evidence for the deep relations that were lies like Indo-European, they may greatly ease new analyses of
advertised for the original, impressionistic method. lesser-known languages.
Other new techniques have concentrated on subgrouping. New computerized cladistic methods are, likewise, already
Lexicostatistics (Swadesh 1955) was an early attempt to facilitate aiding the analysis of complex language families and are provid-
subgrouping and also assign dates to protolanguages. he idea ing Indo-Europeanists food for thought. However, the develop-
was that if languages replace a constant number of words per ment and application of such algorithms could beneit from the
century with new words, then by measuring the percentage of compilation and deployment of data about the probability of
a list of words that is cognate between languages, one could cal- various types of linguistic innovations and borrowings.
culate when the languages diverged and even construct a family To date, the new methodologies have not been adopted
tree. Although these assumptions were mostly wrong and were by most practitioners. While it is easy to fault established
therefore rejected by most linguists, many people still use lexi- researchers for conservatism, it is also true that quantita-
costatistical techniques as a rough indication of a languages his- tive methods typically cannot take into account the diverse
tory in the absence of more compelling data. types of information that linguists are accustomed to reason-
Arranging many shared innovations into a binary tree is an ing with. Fortunately, the emerging partnerships between lin-
extremely laborious undertaking, especially given the possibil- guists and cladists should help bridge the gap between old and
ity that some identical innovations are independent (homoplas- new approaches and lead to the widespread adoption of hybrid
tic). he recent development of computational cladistic methodologies.
methods similar to those used in biology (e.g., Ringe, Warnow,
Brett Kessler
and Taylor 2002) is a tremendous advance in helping the lin-
guist ind optimal trees. In addition, several solutions to the
problem of borrowing have emerged in the form of programs
that construct networks instead of trees. Shared innovations Baxter, William H., and Alexis Manaster Ramer. 2000. Beyond lump-
that cannot be cleanly attributed to a shared ancestor are taken ing and splitting: Probabilistic issues in historical linguistics. In Time
as evidence of contact, obviating somewhat the need to make a Depth in Historical Linguistics, ed. Colin Renfrew, April McMahon,
and Larry Trask, 16788. Cambridge, UK: McDonald Institute for
priori judgments about whether borrowing was involved (e.g.,
Archaeological Research.
Bryant, Filimon, and Gray 2005; Nakhleh, Ringe, and Warnow
Blust, Robert. 1999. Subgrouping, circularity and extinction: Some
2005). issues in Austronesian comparative linguistics. In Selected Papers
he problems of inding homelands and tracing the spread from the Eighth International Conference on Austronesian Linguistics,
of languages still requires one to resort to data that are often ed. E. Zeitoun and P. J. K. Li, 3194. Taipei: Academia Sinica.
suggestive but not deinitive. Colin Renfrew (1987) added a new Bryant, David, Flavia Filimon, and Russell D. Gray. 2005. Untangling
perspective when he theorized that languages may be spread our past: Languages, trees, splits and networks. In he Evolution of
by the movement of culture, rather than by the movement of Cultural Diversity: A Phylogenetic Approach, ed. Ruth Mace, Clare J.
people. He suggested that Indo-European languages were Holden, and Stephen Shennan, 6985. London: UCL Press.
spread from Anatolia along with the adoption of agriculture. Cavalli-Sforza, Luigi Luca, Paolo Menozzi, and Alberto Piazza, 1994.
Most linguists have not accepted this theory, in part because it he History and Geography of Human Genes. Princeton, NJ: Princeton
University Press
is incompatible with such linguistic data as an Indo-European
Gordon, Raymond G., Jr., ed. 2005. Ethnologue: Languages of the World.
word for the wheel, which postdates the spread of agriculture
15th ed. Dallas: SIL International. Content also available online
by millennia. Recently, further data are aforded by genetic at: http://www.ethnologue.com/
analyses of populations (genes and language). he pres- Greenberg, Joseph H. 1963. he languages of Africa. International
ence of a Pontic genetic component in Europe is compatible Journal of American Linguistics 29.1 (Supplement): Part 2.
with the idea that invaders from the PonticCaspian region . 1987. Language in the Americas. Stanford, CA: Stanford University
brought Indo-European languages into Europe (Cavalli-Sforza, Press.
Menozzi, and Piazza 1994). . 2002. Indo-European and Its Closest Relatives: he Eurasiatic
Language Family: Lexicon. Stanford, CA: Stanford University Press.
Prospects Kessler, Brett, 2001. he Signiicance of Word Lists. Stanford, CA: Center
Recent computer techniques add simplicity, reproducibility, for the Study of Language and Information.
Kessler, Brett, and Annukka Lehtonen. 2006. Multilateral comparison and
and quantitative rigor to methodologies for proving relation-
signiicance testing of the Indo-Uralic question. In Phylogenetic Methods
ships between languages, but so far there has been no notice-
and the Prehistory of Languages, ed. P. Forster and C. Renfrew, 3342.
able increase in power over what experts are able to do by hand. Cambridge, UK: McDonald Institute for Archaeological Research.
Failure to corroborate the sort of deep relationships conceived Mallory, J. P. 1989. In Search of the Indo-Europeans: Language,
by Greenberg may mean that better techniques need to be devel- Archaeology and Myth. London: hames and Hudson.
oped, or that the languages are not in fact related, or that the Nakhleh, Luay, Don Ringe, and Tandy Warnow. 2005. Perfect phyloge-
answer is unknowable. Because languages are always changing, netic networks: A new methodology for reconstructing the evolution-
they constantly lose information that links them to their relatives; ary history of natural languages. Language 81: 382420.
416
Language-Game
Oswalt, Robert L., 1998. A probabilistic evaluation of North Eurasiatic the interplay, in the determination of meaning, between lan-
Nostratic. In Nostratic: Sifting the Evidence, ed. J. C. Salmons and B. D. guage and the actions into which it is woven and to bring into
Joseph, 199216. Amsterdam: Benjamins. prominence the fact that the speaking of language is part of
Renfrew, Colin. 1987. Archaeology and Language: he Puzzle of Indo- an activity, or of a form of life ([1953] 1997, 23; see forms of
European Origins. London: Pimlico.
life ).
Ringe, Don A., Jr. 1992. On Calculating the Factor of Chance in Language
In fact, Wittgenstein nowhere provides a well-rounded dei-
Comparison. Philadelphia: American Philosophical Society.
Ringe, Don, Tandy Warnow, and A. Taylor. 2002. Indo-European and
nition of language-game because his employment of the term
computational cladistics. Transactions of the Philological Society evolves and because it is what he calls a family resemblance
100: 59129. concept. He employs the term
Swadesh, Morris. 1955. Towards greater accuracy in lexicostatistic dat- i. to circumscribe various more or less broad domains of lan-
ing. International Journal of American Linguistics 21: 12137. guage. Here, we can speak of single and speciic language-games,
homason, Sarah Grey, and Terrence Kaufman. 1988. Language Contact, such as those we play in our use of particular words or con-
Creolization, and Genetic Linguistics. Berkeley: University of California cepts (e.g., fear, game, hand, knowing) or in speciic
Press. activities (e.g., lying, thanking, cursing, making a joke,
following a rule, giving orders and obeying them); but these
speciic language-games are subsumed under more general uses
LANGUAGE-GAME
of the term either a generic use: he calls the whole, consisting of
At the beginning of the Philosophical Investigations, Ludwig language and the actions into which it is woven, the language-
Wittgenstein questions the Augustinian or traditional picture of game ([1953] 1997, 7) or an anthropological use, that is, what
the essence of human language, according to which the meaning he calls the human language-game (1977, 554) (as opposed, say,
of a word is the object for which it stands, so that a word to which to the language-game of alien tribes).
no object corresponds has no meaning. Learning language con- ii. to describe the degree of sophistication of language use.
sists in giving names to objects, and the association between Here, Wittgenstein speaks of primitive or complicated language-
the word and the object is established by ostensive teaching of games.
words. Wittgenstein attacks this conception as both confused iii. to describe how language works.
and reductive. It makes naming seem like a queer connection of It is in its encapsulation of how language works that the
a word with an object, a kind of baptism of the object, as if mean- expression language-game is most eloquent. he expression
ing existed separately from the word and was attached to it by is due to the analogy Wittgenstein makes between language and
a mental process: a remarkable act of mind or occult process. games, which supersedes the calculus analogy of the Tractatus,
Moreover, making the correspondence between name and object thereby signaling the switch from his own conception of language
a condition of meaning has the absurd consequence that when as a ixed and timeless symbolism to a conception of language as
the object no longer exists, the word no longer has any meaning. essentially embedded in human practice language as essen-
Here, the meaning of a name is confounded with the bearer of tially in use. In the Blue Book, he begins to question the idea that
the name, whereas in fact when Mr. N. N. dies, one says that the speaking a language is, in all cases, to apply a calculus according
bearer of the name dies, not that the meaning dies (Wittgenstein to strict and exact rules; rather, using language is much like play-
[1953] 1997, 40). he Augustinian picture of language is also ing a game. he game analogy is more itting than the calculus
reductive. Even if we were to correct its conception of ostensive analogy in several ways:
deinition (e.g., by replacing the occult process with training), it 1. Like game, language is a family resemblance concept. Just
would still err in being an oversimpliication: Ostensive deini- as there is a multiplicity of games with nothing common to all,
tion is too narrow; it does not describe everything that we call there is nothing common to all of our uses of language that
language. We think that language consists in giving names to makes them into language or parts of language, but they are
objects, whereas in fact we do the most various things with our related to one another in many diferent ways, and it is because
sentences: hink of exclamations alone, with their completely of this relationship that we call them all language ([1953]
diferent functions. Water! Away! Ow! Help! Fine! No! 1997, 65).
([1953] 1997, 27). 2. Language is an activity, and it is essentially connected with
he functions of words, suggests Wittgenstein, are as diverse practice or use: our language-game is a piece of behaviour
as the functions of tools in a toolbox ([1953] 1997, 11). And (1980, 151); a language-game incorporates both language and
these are countless. But his toolbox analogy does more than the actions into which it is woven ([1953] 1997, 7). To use lan-
underscore the diversity of the uses of language; it also suggests guage meaningfully is analogous to making a move in a game;
that words work much like tools in that their meaning resides in to understand a word is to know how to use it. Just as we learn
their use: We remain unconscious of the prodigious diversity how to play a game by learning what the permitted moves are
of all the everyday language-games because the clothing of our in the game, we learn the meaning of words by learning what
language makes everything alike ([1953] 1997, 224) and so it is accepted as a meaningful use of the word. And here, it is not
is use that is the distinguishing mark. In reaction to the mental- the application of explicit rules but training (1970, 186) and
ist conception of meaning, which sees it unilaterally as a mental repeated exposure that are needed to play a game properly or
connection between words and objects, Wittgenstein airms to use a word meaningfully. he concept of language-games
that the meaning of a word or sentence resides in the use we highlights the idea that the mastery of language is an acquired
make of it. He introduces the term language-game to highlight skill or know-how, not a systematic (innate or acquired)
417
Language-Game
application of rules: To understand a language means to be of life a human language. Language and form of life are intern-
master of a technique ([1953] 1997, 199). ally related: To imagine a language means to imagine a form
3. Like games, languages are rule governed, but this does not of life ([1953] 1997, 19), and to imagine a human language is
mean that there are strict and precise rules for each language- necessarily to imagine a human form of life, a human way of
game: Just as there are not rules to legislate for everything in being and acting that essentially involves both our biological
a game (e.g., there are no rules for how high or how hard one makeup and our social behavior. For our language-games are
throws the ball in tennis), language, too, is not everywhere cir- impacted by the facts of our natural history, such as our bio-
cumscribed by rules. And as in games, the learning of explicit logical human constraints; for example, the language-game
rules is not always necessary someone can have learned a with colors is characterized by what we can and what we can-
game without ever learning or formulating rules ([1953] 1997, not do (1970, 345). herefore, if certain very general facts of
31) and so, too, in language can the game be learned purely nature were diferent from what they are, so would our con-
practically, without learning any explicit rules (1977, 95). cepts and language-games be. But to say that our language-
he constitutive rules of language are those of grammar games are conditioned by certain facts (1977, 617), is not to
(1974, 184). What Wittgenstein means by grammar is not, how- say that our language-games are justiied by, or answerable to,
ever, what grammarians mean by grammar: It is neither a tax- these facts.
onomy of the structural features of a language nor a science 5. Like games, language and language-games are autono-
that describes or prescribes the correct or standard usage of mous. By this, Wittgenstein means that although our language-
words or arrangement of words. Wittgensteinian grammar is a games are rooted in our form of life, they are not accountable
generic term for the publicly determined (though this determi- to, or rationally grounded in, any reality: he language-game is
nation is not due to a concerted, but to an unconcerted, con- not based on grounds. It is not reasonable (or unreasonable). It
sensus) conventions or conditions (1974, 138, 88) that govern is there like our life (1977, 559). Rather than speak of symbols,
our meaningful use of words or expressions. Languages are rule words, or sentences as the primary or elementary units of mean-
governed, but the rules that govern them are not metalinguistic ing (as logicians, including Wittgenstein himself in the Tractatus,
norms that exist in advance of use; learning the meaning of a had done), the later Wittgenstein views the language-game as
word is learning how the word is used. Moreover, if grammat- the basic unit in linguistic activity; he urges us to look on the
ical rules determine what it makes sense to say, they cannot language-game as the primary thing ([1953] 1997, 656), that
themselves belong to the language-game: a grammatical rule which does not have to be explained by any fact. Here, he can
is a preparation (1993, 72) for a language-game, except in be viewed as broadening Gottlob Freges context principle: he
heuristic language-games (e.g., pedagogical language-games; context necessary for meaning is not the proposition but the
language teaching) where the formulation of some rules is the language-game (e.g., a sound is an expression only in a language-
object of the language-game (the distinction here is an instance game [(1953)1997, 261]).
of the use and mention distinction). To learn grammatical In his last work, On Certainty, Wittgenstein dwells on the
rules is to learn what the conventional conditions and con- importance of unmoving foundations for the possibility of
straints of the uses of a word are, which linguistic moves are language-games: It is essential for our language-games that
meaningful and which are not. Just as the rules of a game con- no doubt appears at certain points (cf. 1977, 524). He argues
stitute the game and its allowable moves, grammar constitutes that some basic certainties such as The world exists or
language and its allowable moves: This is a hand or Cats dont grow on trees are necessary,
4. Like games, language is embedded in our social, cul- unmoving foundations of our language-games (cf. 1977, 403,
tural and natural ways of living that is, in our form of life. 411), that the whole language-game rests on this kind of cer-
Languages cannot be abstracted from the context in which tainty (cf. 1977, 446). This kind of certainty, however, is non-
they live: words have meaning only in the stream of life epistemic; he views it as a kind of trust: I really want to say
(1982, 913). Language is a normative practice, but it is also a that a language-game is only possible if one trusts something
social practice. Any language is founded on convention ([1953] (1977, 509).
1997, 355); it is embedded in the shared activities of the lan-
Danile Moyal-Sharrock
guage users in a given community: To obey a rule, to make a
report, to give an order, to play a game of chess, are customs
(uses, institutions) ([1953] 1997, 199). To understand a lan-
guage-game, one must be either immersed in the community Baker, G. P., and P. M. S. Hacker. 1980. Wittgenstein: Understanding
in which it is embedded or knowledgeable about that commu- & Meaning: An Analytical Commentary on the Philosophical
nitys practices: Someone not accustomed to, or aware of, the Investigations. Vol. 1. Oxford: Basil Blackwell.
Canield, John V. 1993. he Living Language: Wittgenstein and
practice in some Semitic cultures of saying Hamsa! (Five!)
the Empirical Study of Communication Language Sciences 15.3:
a verbal conjuring of the ive ingers of the hand that protects
16593.
against the evil eye would not understand the purpose of Wittgenstein, Ludwig. [1953] 1997. Philosophical Investigations. Trans.
the utterance. For a language to emerge or be possible, there G. E. M. Anscombe. 2d ed. Oxford: Blackwell.
has to be something shared. What is shared is a distinct form . 1970. Zettel. Ed. G. E. M. Anscombe and G. H. von Wright, trans
of life: the particular biosocial conditions and activities that G. E. M. Anscombe. Berkeley: University of California Press.
make particular languages possible. he human form of life . 1974. Philosophical Grammar. Ed. R. Rhees, trans. A. Kenny.
could not have produced a feline language, nor a feline form Oxford: Blackwell.
418
Language-Learning Environment
. 1977. On Certainty. Ed. G. E. M. Anscombe and G. H. von the basis of their co-occurrence statistics (Redington, Chater,
Wright, trans. D. Paul and G. E. M. Anscombe. Amended 1st ed. and Finch 1998) or their occurrence in frequent frames (Mintz
Oxford: Blackwell. 2003). Moreover, studies using modern infant techniques have
. 1980. Remarks on he Philosophy Of Psychology. Vol. 1. Ed. shown that very young children are sensitive to all of these
G. E. M. Anscombe and G. H. von Wright, trans. G. E. M. Anscombe.
potential sources of information (Jusczyk 1999; Gomez and
Oxford: Blackwell.
Gerken 2000).
. 1982. Last Writings on the Philosophy of Psychology. Vol. 1. Ed.
G. H. von Wright and Heikki Nyman, trans. C. G. Luckhardt and
An interesting feature of this kind of work is the extent to which
M. A. E. Aue. Oxford: Blackwell. it sheds new light on classical arguments from the poverty of the
. 1993. Moores Wittgenstein lectures in 19301933. In stimulus. For example, John Lewis and Jefrey Elman (2001) have
Philosophical Occasions: 19121951, ed. J. C. Klagge and A. Nordman, shown that it is possible for a simple recurrent network to learn
46114. Indianapolis: Hackett. to obey structure-dependent rules such as those involved in the
formation of complex yes-no questions (e.g., Is the boy who is
dancing singing?) in the absence of exposure to complex yes/
LANGUAGE-LEARNING ENVIRONMENT
no questions. his occurs because the knowledge that the net-
his term refers to the linguistic and sociocultural environment work has acquired by processing simple yes/no questions and
in which children learn to talk and, in particular, to the language complex declaratives constrains the way in which it deals with
to which they are exposed. he irst systematic studies of the lan- complex yes/no questions.
guage-learning environment were conducted in response to the A inal strand of research has investigated the relation between
claim that, like the language used among adults, the language variation in language development and variation in the language
heard by children was grossly defective full of false starts, gram- to which children are exposed. For example, research on chil-
matical errors, and misleading pauses and as such represented dren learning English has found a relation between their aux-
a very poor sample of the language that the child must eventually iliary development and mothers use of yes/no questions (e.g.,
learn. hese studies showed that speech addressed to children Can you kick the ball?) in which the auxiliary occurs in stressed
was largely clear, well formed, and semantically and syn- utterance-initial position (see Richards 1990 for a review). One
tactically simpler than speech addressed to adults, leading problem with this kind of research is that covariance in patterns
some researchers to argue that in simplifying their speech, par- of within-language variation often makes it diicult to distinguish
ents were presenting their children with graded language lessons empirically between alternative explanations of the efects that
that could bear at least some of the burden of explanation for the are found. One way of avoiding this problem is to focus on the
childs remarkably swift progress in language learning (see Pine relation between cross-linguistic diferences in patterns of devel-
1994 for a review). opment and cross-linguistic variation in the language to which
In fact, there are a number of problems with this view. First, children are exposed. For example, Daniel Freudenthal and his
it confuses the notions of facilitating interaction and facilitating colleagues (2007) have recently shown that it is possible to simu-
acquisition. hus, some of the adjustments made by parents to late cross-linguistic variation in childrens tendency to use non-
facilitate interaction (e.g., the extensive use of questions) prob- inite verb forms in inite contexts in English, Dutch, German,
ably have the efect of increasing the complexity of the language and Spanish as a function of the interaction between one identi-
to which children are exposed during the early stages. Second, cal mechanism that learns from the right edge of the utterance
it is unclear to what extent the adjustments made by Western (MOSAIC, Model Of Syntax Acquisition In Children) and the
middle-class parents generalize across cultures. Indeed, ethno- statistical properties of child-directed speech in these four lan-
graphic researchers have identiied cultures in which parents guages. MOSAIC produces high proportions of noninite verb
react with mirth or horror to the idea of holding conversations forms in inite contexts in Dutch, German, and English because
with young language-learning children (see Lieven 1994 for a in these languages, the verb forms occurring in utterance-inal
review). hird, the notion that the language-learning environ- position in the input are much more likely to be noninite than
ment somehow facilitates acquisition is theoretically rather vac- inite. However, it produces much lower proportions of noninite
uous in the absence of a reasonably well-speciied theory of how verb forms in inite contexts in Spanish because in this language,
the child is learning from this environment. inite verb forms are much more likely than noninite verb forms
Cognizant of this last problem, more recent work has focused to occur at the right edge of the utterance.
on the kind of information that is present in the environment
Julian Pine
and the way in which this might be exploited by the childs
language-learning mechanisms. hus, work in computational WORKS CITED AND SUGGESTIONS FOR FURTHER READING
modeling has shown that there is a great deal of information in
the statistical structure of human languages that could, in prin- Brent, Michael. 1999. Speech segmentation and word discovery: A com-
putational perspective. Trends in Cognitive Sciences 3: 294301.
ciple, be used to solve particular language-learning problems.
Freudenthal, Daniel, Julian Pine, Javier Aguado-Orea, and Fernand
For example, work on segmentation has shown that it is pos-
Gobet. 2007. Modelling the developmental patterning of initeness
sible to use information about the stress pattern, the phonot- marking in English, Dutch, German and Spanish using MOSAIC.
actics, and the transitional probabilities between syllables in Cognitive Science 31: 31141.
a language to identify boundaries between words (Brent 1999), Gomez, Rachel, and LouAnn Gerken. 1999. Infant artiicial language
and work on the acquisition of syntactic categories has shown learning and language acquisition. Trends in Cognitive Sciences
that it is possible to categorize words quite successfully on 4: 17886.
419
Language of Thought
Jusczyk, Peter. 1999. How infants begin to extract words from speech. ii) there are semantic issues regarding the meaning or interpreta-
Trends in Cognitive Sciences 3: 3238. tion of the symbols and data structures. In the case of artifactual
Lewis, John, and Jefrey Elman. 2001. Learnability and the statistical computers, this issue is usually conveniently settled by stipula-
structure of language: Poverty of stimulus arguments revisited. In tion: he artifactor gets to say what the symbols represent (e.g.,
Proceedings of the Twenty-Sixth Annual Boston University Conference
bank balances, chess moves). In the case of natural creatures,
on Language Development, 35970. Somerville, MA: Cascadilla.
there is, of course, no relevant artifactor, and so the meaning
Lieven, Elena. 1994. Crosslinguistic and crosscultural aspects of
language addressed to children. In Input and Interaction in
of the symbols must be determined by some natural facts or
Language Acquisition, ed. Clare Gallaway and Brian Richards, 5673. relations.
Cambridge: Cambridge University Press. here are two (not necessarily exclusive) kinds of candidates
Mintz, Toby. 2003. Frequent frames as a cue for grammatical categories for a theory of meaning of a LOT:
in child-directed speech. Cognition 90: 91117. i. Meaning is spelled out in terms of some of the symbols cru-
Pine, Julian. 1994. he language of primary caregivers. In Input and cial causal/ conceptual roles in relation to other symbols, mirror-
Interaction in Language Acquisition, ed. Clare Gallaway and Brian ing patterns of inference. his is a natural suggestion for logical
Richards, 1537. Cambridge: Cambridge University Press. symbols, such as and and not: A symbol, @, might mean
Redington, Martin, Nick Chater, and Steven Finch. 1998. Distributional and because states entokening sentences p and q separately
information: A powerful cue for acquiring syntactic categories.
might cause and in turn be caused by a state entokening p@q
Cognitive Science 22: 42569.
by itself (see Block 1986; Peacocke 1994).
Richards, Brian. 1990. Language Development and Individual
Diferences: A Study of Auxiliary Verb Learning. Cambridge: Cambridge
ii. Meaning is spelled out in terms of causal relations that
University Press. the individual symbols bear to phenomena in the world. For
example, a symbol S might be entokened in a creatures brain in
a way that covaries with the presence of a certain shape before
LANGUAGE OF THOUGHT
the creatures eyes in various circumstances, such as under ideal
his is a special language that has been postulated by a number conditions, under evolutionarily selective ones, or under ones
of writers G. Harman (1972), J. Fodor (1975, 1987) to explain that display a certain counterfactual structure (see Dretske 1988;
how humans and many animals represent and think about the Stalnaker 1984; Fodor 1987).
world. he language of thought (LOT) is claimed to be coded, he chief rivals to the LOT hypothesis are either one or another
or entokened, in their brains, rather than in the way certain form of interpretativism, according to which a creature has prop-
formal languages are entokened in the circuitry of a computer. ositional attitudes only because ascription of such states permits
What makes the LOT a language is that it possesses semantically the most rational interpretation of their behavior (see Davidson
valuable, causally eicacious logico-syntactic structure: hat is, it [1973] 1984; Dennett 1987), and/or the brain structures underly-
consists, for example, of names, predicates, variables, quantiiers ing that ascription are of a radical connectionist nonsyntacti-
(all, some) logical connectives (not, and, only if) and cally structured sort (see Smolensky 1988). Its sometimes also
operators (possibly, probably) that are combined to form thought that some kind of system of imagistic representations
complex sentences that can be true or false. would not only be truer to introspection but also explain various
he LOT need not be the natural language (e.g., English, response-time results that suggest that people think in images
Chinese), if any, that a creature speaks, although some writers (see Kosslyn 1994).
have supposed that in adult humans the two may coincide (pro- he main reasons for believing in a LOT as opposed to these
viding an interesting perspective in the Sapir-Whorf hypothesis rivals is that it would explain a number of interesting phenom-
that thought is determined by language). Indeed, given that the ena associated with the mind. Salient among these are the
relevant sorts of intelligent behavior are displayed by many infra- following:
linguistic creatures infants, chimpanzees its postulation need 1. he propositional structure of attitudes: he standard
not be conined to natural language users. Nor need the LOT be object of, for example, a thought, belief, desire, hope, or expec-
in the least conscious or introspectible: A persons thought pro- tation is some kind of truth-valuable object, most perspicuously
cesses might be explained by a LOT, while introspectively his or expressed by a sentence; neither images nor connectionist
her mental life might seem to consist wholly of images, feel- networks are able to systematically represent logically com-
ings, and inarticulate impulses. Most importantly, processing a plex thoughts, involving, for example, negations and nested
LOT need not require any sort of intelligent creature to read and quantiiers (what image could express Not everyone loves
understand the sentences being processed in the brain. Along someone?).
lines set out by Alan Turing, the processing of the LOT symbols 2. he causal eicacy of thought: A thought can cause bodily
can be executed purely mechanically. states and movements because it is a structure entokened in the
As in discussing any language, there are two issues raised by brain.
such a postulation: i) here are syntactic issues regarding the 3. he productivity of thought: People can potentially under-
character of the actual symbols and the computations deined stand an ininitude of diferent thoughts formed by logical com-
over them. his is the kind of issue regularly addressed in rich binations of simpler ones, for example, Its possible for every
detail by programmers dealing with artifactual computers, and cat to chase some rat that eats some cheese that that lives in
with psychologists dealing with naturally occurring, living crea- the house that Jack built, since the LOT entokened in their brain
tures, in their concern with data structures and algorithms for can (under reasonable idealization) produce a corresponding
dealing with them, for example, for vision or reasoning; and ininity of sentences.
420
Language Policy
4. he systematicity of thought: If people can think some with or claiming authority to modify the practices or beliefs of
thought p, then they can also think all logical permutations of other speakers. Practices, beliefs, and management can be stud-
p; for example, people can think If John leaves, then someone ied separately, although they turn out to be interdependent.
insulted Mary if and only if they can think If someone leaves, then Early language managers were the Sanskrit and Arabic gram-
Mary insulted John, since they can mechanically recombine the marians who guarded the purity of sacred texts, the medieval
simple expressions of a sentence. European rulers who switched from Latin to the vernacular for
5. he intensionality of thought: People can think of things legal matters, and the language nationalists in the nineteenth
in one way without thinking of those very things in another; for century who made their national language diferent from that of
example, they can think that the morning star is Venus without their previous ruler.
thinking that the evening star is, and they can think about dif- Einar Haugen (1966) described how rival political ideolo-
ferent nonexistent things, such as Zeus and Santa Claus. Indeed, gies supported competing invented varieties of Norwegian and
the LOT can explain the hyperintensionality of thought: People compromised by requiring schoolchildren to learn both. In the
can think about things that are even necessarily identical, as 1960s, scholars became interested in the language-planning
when one thinks that Mark Twain but not Sam Clemens is problems of postcolonial Africa and Asia (Fishman, Ferguson,
funny, because they employ correspondingly diferent LOT and Das Gupta 1968). hey focused on status planning (mak-
symbols. ing one variety of language oicial or designating it for school
6. he multiple roles of attitudes: Diferent attitudes can be use) and corpus planning (changes in the language itself, such
directed upon the very same thought; for example, people can as standardizing it or providing it with a writing system or new
believe, desire, suspect, doubt the same thought, that God exists. terminology) (Kloss 1966). Others concentrated on the culti-
Fodor (1975, 1987) and Georges Rey (1997) have argued that vation and standardization of developed European languages
these and other phenomena cannot be explained by the rival (Nekvapil 2007).
views without substantial, additional ad hoc assumptions, for In status policy, the problem was to decide between the
example, that certain images or networks express logically com- demands of contending varieties; as a decision depended on
plex thoughts and are causally related in the aforementioned nonlinguistic values, such as the power of social or ethnic or eco-
ways. For these reasons, the language of thought is to be pre- nomic groups, language planners had little real inluence. here
ferred on empirical grounds (for more of an a priori argument was work to do in language academies or in writing textbooks
see, e.g., Davies 1991). to purify the linguistic usage of schoolchildren. In the 1970s,
some scholars tried to evaluate the efect of corpus planning,
Georges Rey
but it proved easier to keep doing it than to study its efective-
ness (Fishman 1977). Language policy expanded when Robert
L. Cooper (1989) added a third area, language-acquisition plan-
Block. N. 1986. Advertisement for a conceptual role semantics. In ning, or language-education management, the efort to increase
Studies in the Philosophy of Mind. Ed. P. French, T. Uehling, and the number of speakers. Related is language difusion, govern-
H. Wettstein, 61578. Minneapolis: University of Minnesota Press. ments working to spread their language outside their political
Davidson, D. [1973] 1984. Radical interpretation. In Inquiries into Truth
boundaries (Cooper 1982).
and Interpretation. Oxford: Clarendon.
Davies, M. 1991. Concepts, connectionism, and the language of thought.
In Philosophy and Connectionist heory, ed. W. Ramsey et al., 22956.
Rights and Theories
Hillsdale, NJ: Erlbaum. In the last half century, a further development has been the
Dennett, D. 1987. he Intentional Stance. Cambridge, MA: MIT Press study and promotion of human or civil rights associated with
Dretske, F. 1988. Explaining Behavior: Reasons in a World of Causes. language (Laitin and Reich 2003; May 2005; Skutnabb-Kangas,
Cambridge, MA: MIT Press Phillipson, and Rannut 1995). Building on principles irst pro-
Fodor, J. 1975. he Language of hought. New York: Crowell. posed after World War I, several international covenants of lan-
. 1987. Psychosemantics, Cambridge, MA: MIT Press. guage rights for minorities have been formulated, and some have
Harman, G. 1972. hought. Princeton, NJ: Princeton University Press. been adopted by international bodies such as UNESCO and the
Kosslyn, S. 1994. Image and Brain: he Resolution of the Imagery Debate. European Community; a smaller number have been ratiied by
nation-states, and a few have been implemented.
Peacocke, C. 1994. A Study of Concepts. Cambridge, MA: MIT Press.
here is no generally accepted theory of language policy.
Rey, G. 1997. Contemporary Philosophy of Mind. Oxford: Blackwell.
Smolensky, P. 1988. On the proper treatment of conectionism.
homas Ricento (2001), in fact, argues that there cannot be one.
Behavioral and Brain Sciences 11: 5974. However, Joshua A. Fishman (1991) has presented a model of
Stalnaker, R. 1984. Inquiry. Cambridge, MA: MIT Press. reversing language shift, which includes a graded intergenera-
tional disruption scale intended both to describe the state of a
language and its likelihood of being maintained and to suggest
LANGUAGE POLICY
how to resist further loss or reestablish earlier strength. Bernard
Deinition and History Spolsky (2004) has proposed that language policy has three
Language policy studies the regular choices among varieties and components (language practices, language ideology or beliefs,
variants within a speech community the language practices and language management), and has sketched a theory based
of members of the community, their beliefs about the values to on this proposal. Ji Nekvapil (2006), following Bjoern Jernudd
be assigned to the varieties, and eforts by individuals or groups and J. V. Neustupn (1987), has put forward a theory of language
421
Language Policy
management, ranging from individual self-correction to the 2,000 varieties. Central Europe and the Balkans repeated this
organized management of all micro and macro levels. Detailed process, as the partition of Czechoslovakia has led to oicial sta-
descriptions of language policy such as Grenoble (2003), Kaplan tus for Czech and Slovak (Neustupn and Nekvapil 2003), and
and Baldauf (2003) and Zhou (2004) have started to clarify the the division of Yugoslavia has now led to eforts to distinguish
complex dimensions that a theory must handle. Serbian and Croatian (Pranjkovic 2001).
Belgium and Switzerland use territoriality to resolve language
The Politics of Language Policy conlict. Externally, both are believed to be bilingual. In fact,
One of the most critical facts or beliefs about language vari- Belgium is divided into language regions, some of which are oi-
eties concerns their power. There are many nation-states cially French-speaking, others oicially Dutch-speaking, and a
that assume monolingualism to be ideal and combine this few oicially German-speaking. Only Brussels is oicially bilin-
assumption with a belief in the value, beauty, efficiency, gual. he varieties spoken in these regions are neither Dutch nor
and desirability of their own national variety. This is true of French but regional dialects; as a result, 40 percent of Belgian
English-speaking nations, although it is challenged by coun- high school students report that they are taught in a language
terassertions in South Africa, which has a long tradition of that they do not speak at home (Aunger 1993). In Switzerland,
claims for Afrikaans and has recently extended nominal rec- each canton establishes its own language policy, choosing among
ognition to nine African languages, and in Canada by the lan- German, French, Italian, and Romansch. Knowledge of a second
guage-related claims of Quebec for independence. The belief language (other than the expanding use of English associated
was first manifested in Spain, which continued its search for with globalization) is no better than in other European countries
purity after the expulsion of Moors and Jews with a procla- (Harlow 2004; Hrdegen 2001).
mation carried to the New World of the value of Spanish; this he special language problems of Africa were produced by
resulted in the virtual destruction of Native American lan- the fact that the borders drawn by colonizing European powers
guages. The belief in the importance of a single national vari- in the nineteenth century did not coincide with ethnic, tribal, or
ety was adopted by the Jacobins during the French Revolution linguistic boundaries. After independence, African states had
and gradually implemented in France and French territories to choose among a variety of languages, most of which were
(Ager 1999): The difficulty of its implementation continues also spoken in a neighboring states (Bamgbose 2000). Colonial
to be demonstrated by the need to pass new laws and regula- educational policy had favored the use of European metropoli-
tions. German Romanticism and nationalism (Fishman 1973) tan languages, absolutely in the case of French and Portuguese
provided an ideological base with the proclamation of the colonies and, after initial use for a few years of local vernaculars,
truth of one nation, one language. Another example of ideo- in British colonies. Partly because choice of any one vernacular
logical monolingualism is Japan, which during its period of would provide excessive power to its speakers, partly because
colonial expansion required conquered peoples to switch to the elite already spoke the metropolitan language, and partly
Japanese, and which has only recently taken note of minority because of inertia, postindependence eforts to establish the sta-
languages (Katsuragi 2005). tus of African languages have generally failed (Phillipson 1992).
Commonly, the existence of two or more major languages
within a single nation-state or confederation is associated with Globalization and Local Resistance
political conlict. One resolution is to favor a single variety, either Globalization has a major impact on language policy. One efect
that spoken by the majority or that controlled by the dominant has been the unparalleled difusion of English, the most widely
elite. In the Soviet Union, a Lenin-inspired policy of recognizing used second language in most of the world. English is the favored
minority languages to speed up the spread of communism was irst foreign language in all European countries, spreading also
replaced under Stalin by a Russiication policy (Grenoble 2003; into former Soviet nations. In Asia, English is the lingua franca
Lewis 1972). After the collapse of the Soviet Union, most of the for intercommunication among Japanese, Chinese, Koreans, and
newly independent states reasserted the signiicance of their hais. International corporations, even those located in European
territorial languages, so that currently each of the former Soviet countries, tend to prefer English. Foreign language teaching is a
states (including Russia itself) appears to be working toward topic of interest mainly in English-speaking countries; elsewhere,
monolingualism in the territorial language (Landau and Kellner- the major concern is English language teaching.
Heinkele 2001; Ozolins 2003). he protection of endangered languages is a recent concern
of many language policy scholars. hey have noticed the rapid
Territoriality loss of smaller minority languages, estimating that most of the
A second solution to problems associated with having multiple current 6,000 languages in the world will disappear in the next
major languages within a single nation-state is territoriality. hundred years (Krauss 1991). he threat comes not just from
India tried to base its internal political divisions on language. world languages like Spanish (which has virtually denuded
he partition into a Hindu-dominated India and Moslem- Latin America of its rich linguistic diversity) or French (with its
dominated Pakistan was paralleled by a language-management strong monolingual ideology) or English (universally feared as
efort to divide what was previously considered a single lan- the exemplar of linguistic imperialism) but also from stronger
guage, Hindustani, into two Hindi written in Devanagari script local languages like Swahili. Fishman (1990) provides a set of
and Urdu with Perso-Arabic script (Annamalai 2001). he split- benchmarks for studying loss and suggests how to reverse it. So
ting up of India into states relected major language diferences, far, the most successful eforts at reversal have been associated
although it could not capture the complexity of a nation with with grants of political autonomy, as in Spain (Hofmann 1995),
422
Language Policy
the United Kingdom ( Laoire 1996; Coupland et al. 2006), and Fishman, Joshua A., Charles A. Ferguson, and Jyotirinda Das Gupta. 1968.
Canada (Bourhis 2001). here are also eforts in New Zealand Language Problems of Developing Nations. New York: Wiley.
(Spolsky 2005; May and Hill 2005), in South America (Hornberger Grenoble, Lenore A. 2003. Soviet Language Policy. Dordrecht, the
and King 2001), and among other indigenous peoples (McCarty Netherlands: Kluwer Academic Publishers.
Harlow, Ray. 2004. Switzerland. In Encyclopedia of Linguistics, ed.
2003; Omoniyi 2003).
P. Strazny. London: Taylor and Francis.
Speakers of major languages also fear language loss. his
Haugen, Einar. 1966. Language Conlict and Language Planning: he
can be seen in Spain, with the sensitivity of its Academy to lan- Case of Modern Norwegian. Cambridge: Harvard University Press.
guage change; in France, with its growing number of regulations Hofmann, Charlotte. 1995. Monolingualism, bilingualism, cultural
and language agencies; in Russia, with its refusal to recognize pluralism and national identity: Twenty years of language planning
non-Cyrillic alphabets for minority languages and its claim of in contemporary Spain. Current Issues in Language and Society
defending the language rights of Russian-speakers in former 2.1: 5990.
Soviet states; and even in the United States, where an English- Hornberger, Nancy H., and Kendall A. King. 2001. Reversing language
only movement is struggling against what it sees as the threat of shift in South America. In Can hreatened Languages be Saved? ed.
Spanish and other immigrant languages to the survival of what J. A. Fishman, 16694. Clevedon and Avon: Multilingual Matters.
most people believe to be the strongest language in the world Hrdegen, Stephan. 2001. he Fribourg linguistic case controversy
about the language of instruction in schools in the light of freedom
(Baron 1990).
of language and equal educational opportunities in Switzerland.
Language policy is a new and rapidly developing ield, the
European Journal for Educational Law and Policy 5: 7382.
urgency and seriousness of which has resulted in activism by Jernudd, Bjoern, and J. V. Neustupn. 1987. Language planning: For
scholars who feel responsible for correcting what they see as whom? In Proceedings of the International Colloquium on Language
injustices or blindness to the potential loss of linguistic diversity, Planning, ed. L. LaForge, 7184. Quebec: Presses de lUniversite Laval.
as well as in academic attempts to develop theories to explain Kaplan, Robert B., and Richard B. Baldauf. 2003. Language and
data from increasingly detailed descriptions of situations and Language-in-Education Planning in the Paciic Basin. Dordrecht, the
policies. Netherlands: Kluwer Academic Publishers.
Katsuragi, Takao. 2005. Japanese language policy from the point of view
Bernard Spolsky of public philosophy. International Journal of the Sociology of lan-
guage 175/176: 4154.
WORKS CITED AND SUGGESTIONS FOR FURTHER READING Kloss, Heinz. 1966. German-American language maintenance eforts.
Ager, Dennis E. 1999. Identity, Insecurity and Image: France and Language. In Language Loyalty in the United States, ed. J. Fishman, 20652. he
Clevedon and Philadelphia: Multilingual Matters. Hague: Mouton.
Annamalai, E. 2001. Managing Multilingualism in India: Political and Krauss, Michael. 1991. he worlds languages in crisis. Language
Linguistic Manifestations. New Delhi: Sage. 68.1: 410.
Aunger, Edmund A. 1993. Regional, national and oicial languages Laitin, David D., and Robert Reich. 2003. A liberal democratic
in Belgium. International Journal of the Sociology of Language approach to language justice. In Language Rights and Political
104: 3148. Theory, ed. W. Kymlicka and A. Patten, 80104. Oxford: Oxford
Bamgbose, Ayo. 2000. Language and Exclusion: he Consequences of University Press.
Language Policies in Africa, Beitrage zur Afrikanistik. Mnster and Landau, Jacob, and Barbara Kellner-Heinkele. 2001. Politics of Language
Hamburg: LIT Verlag. in the Ex-Soviet Muslim States: Azerbaijan, Usbekistan, Kazakhstan,
Baron, Dennis E. 1990. he English Only Question. New Haven, CT: Yale Kyrgyzstan, Turkmenistan and Tajikistan. London and Ann Arbor:
University Press. C.Hurst & Co. and University of Michigan Press.
Bourhis, Richard Y. 2001. Reversing language shift in Quebec. In Can Lewis, E. Glyn. 1972. Multilingualism in the Soviet Union. he
hreatened Languages be Saved? ed. J. A. Fishman 10141. Clevedon Hague: Mouton.
and Avon: Multilingual Matters. May, Stephen. 2005. Language rights: Moving the debate forward.
Cooper, Robert L. 1989. Language Planning and Social Change. Journal of Sociolinguistics 9.3: 31947.
Cambridge: Cambridge University Press. May, Stephen, and Richard Hill. 2005. Mori-medium education: Current
Cooper, Robert L., ed. 1982. Language Spread: Studies in Difusion and issues and challenges. International Journal of Bilingual Education
Social Change. Bloomington: Indiana University Press. and Bilingualism 8.5: 377403.
Coupland, Nicholas, Hywel Bishop, Betsy Evans, and Peter Garrett. 2006. McCarty, Teresa L. 2003. Revitalising indigenous languages in homoge-
Imagining Wales and the Welsh language: Ethnolinguistic subjectivi- nising times. Comparative Education 29.2: 14763.
ties and demographic low. Journal of Language and Social Psychology Nekvapil, Ji. 2006. From language planning to language management.
25.4: 35176. Sociolinguistica 20: 92104.
Fishman, Joshua A. 1973. Language and Nationalism: Two Integrative . 2007. Language cultivation in developed contexts. In Handbook
Essays. Rowley, MA: Newbury House. of Educational Linguistics, ed. B. Spolsky and F. M. Hult, 25165.
. 1977. Comparative study of language planning: Introducing a sur- Oxford: Blackwell.
vey. In Language Planning Processes, ed. J. Rubin, B. H. Jernudd, J. Das Neustupn, J.V., and Ji Nekvapil. 2003. Language management
Gupta, J. A. Fishman and C. A. Ferguson, 3140. he Hague: Mouton. in the Czech republic. Current Issues in Language planning
. 1990. What is reversing language shift (RLS) and how can it suc- 4.3/4: 181366.
ceed? Journal of Multilingual and Multicultural Development 11.1/2: Laoire, Muiris. 1996. An historical perspective of the revival of Irish
536. outside the Gaeltacht, 18801930, with reference to the revitaliza-
. 1991. Reversing Language Shift: heoretical and Empirical tion of Hebrew. In Language and State: Revitalization and Revival in
Foundations of Assistance to hreatened Languages. Israel and Eire, ed. S. Wright, 5175. Clevedon and Avon: Multilingual
Clevedon: Multilingual Matters. Matters.
423
Laws of Language
Omoniyi, Tope. 2003. Local policies and global forces: Multiliteracy and Laws in the Study of Language and Text
Africas indigenous languages. Language Policy 2.2: 13352. In quantitative linguistics, the exact science of language
Ozolins, Uldis. 2003. he impact of European accession upon language
and text, distributional and functional kinds of laws are known.
policy in the Baltic States. Language Policy 2.3: 21738.
he irst kind takes the form of probability distributions; that is,
Phillipson, Robert. 1992. Linguistic Imperialism. Oxford: Oxford
it makes predictions about the number of units of a given prop-
University Press.
Pranjkovic, Ivo. 2001. he Croatian standard language and the Serbian erty. A well-known example of this kind is the Zipf-Mandelbrot
standard language. International Journal of the Sociology of Language Law. he status of the corresponding phenomenon has been
147: 3150. discussed since the days of George K. Zipf, who was the irst to
Ricento, homas, ed. 2001. Ideology, Politics and Language Policies: Focus systematically study quantitative properties of language from
on English. Amsterdam and Philadelphia: John Benjamins. a scientiic point of view. he law relates a) the frequency of a
Skutnabb-Kangas, Tove, Robert Phillipson, and Mart Rannut. 1995. word in a given text (in any language) to the number of words
Linguistic Human Rights: Overcoming Linguistic Discrimination. Berlin with the given frequency (called frequency spectrum) and b) the
and New York: Mouton de Gruyter. frequency of a word in relation to its rank (called rank-frequency
Spolsky, Bernard. 2004. Language Policy, Key Topics in Sociolinguistics.
distribution). he irst formulation by Zipf was later modiied
and corrected by Benoit Mandelbrot, who derived the law from
. 2005. Maori lost and regained. In Languages of New Zealand, ed.
the assumption that languages optimize their lexicons with
A. Bell, R. Harlow, and D. Starks, 6785. Wellington: Victoria University
Press. respect to code-production efort in the long run. his resulted
Zhou, Minglang, ed. 2004. Language Policy in the Peoples Republic in the famous formula (1), which has the form of a rank-fre-
of China: heory and Practice since 1949. Dordrecht, the quency distribution: If the words are arranged according to their
Netherlands: Kluwer Academic Publishers. frequency, the most frequent word is assigned rank one, and so
on. he formula gives the frequency that a word should have at
a given rank:
LAWS OF LANGUAGE K
(1) f (r) =
The Concept of Law (b+ r)
he philosophy of science deines the term scientiic law as a where f (r) is the frequency, r the rank, b and empirical param-
meaningful universal hypothesis that is systematically connected eters, and K a normalizing constant that makes the probabilities
to other hypotheses in the ield and, at the same time, well cor- sum up to 1.0.
roborated on relevant empirical data (cf. Bunge 1967). A law is Since the seminal works of Zipf and Mandelbrot, numerous
called universal because it is valid at all times, everywhere, and laws have been found. Other examples of distributional laws
for all objects of its scope. are (in morphology and lexicon) the distribution of length,
A system of laws is called a theory. he construction of a the- polysemy, synonymy, age, part of speech (see word classes),
ory is the highest and most demanding goal of scientiic research and so on, (in syntax) the frequency distribution of syntac-
and can be undertaken only if and when a number of interre- tic constructions, the distribution of their complexity, depth of
lated laws have been found. here is much confusion about the embedding, information, and position in mother constituent; (in
term theory, especially in linguistics, where all kinds of formal- semantics) the distribution of the lengths of paths in semantic
isms, thoughts, approaches, descriptive tools, deinitions, and networks (see also semantic fields), semantic diversiica-
concepts are called theories. he philosophy of science distin- tion, and so on. Any property and any linguistic unit studied so
guishes two kinds of theories: 1) the axiomatic theories of logics far displays a characteristic probability distribution.
and mathematics and 2) the empirical theories in the factual sci- he second kind of law is called the functional type, because
ences. While the irst ones make statements only within a given these laws link two (or more) properties. An illustrative example
axiomatic system and can be used only to construct analytical of this kind is Menzeraths Law (also called Menzerath-Altmann
truths, the latter ones make statements about parts of the world. Law), which relates the size of linguistic constituents to the size
he truth of an empirical theory and of its elements, the laws, of the corresponding construct. hus, the (mean) length of the
depends not only on internal correctness but also on the cor- syllables of a word depends on the number of syllables the
respondence with the facts of reality although every empirical word consists of; the (mean) length of the clauses in a sentence
theory must have an axiomatic kernel. depends on the length of the sentence (measured in terms of the
he value of theories and their components, the laws, lies not number of clauses it consists of). he most general form of this
only in their role as the containers of scientiic knowledge but law is given by formula (2):
also in the fact that there can be no explanation without at least
(2) y = Ax be-cx
one law: A valid scientiic explanation (the so-called deductive-
nomological explanation; cf. Hempel and Oppenheim 1948) is a where y is the mean length of the constituents, x the length of the
subsumption under laws taking into account boundary condi- construct, and A, b, and c are parameters. (his law predicts the
tions. Laws must not be confused with rules, which are either function [2] but not the values of its parameters. hey are esti-
prescriptive or descriptive tools without any explanatory power; mated empirically on the data under analysis. Future research
hence, grammars and similar formalisms also cannot explain may provide an enhanced version of the law that will also deter-
anything. Another signiicant diference is that rules can be vio- mine these parameters.) Experience shows that the parameters
lated laws (in the scientiic sense) cannot. are determined mainly by the level of the units under study. hey
424
Laws of Language
3.2 (/vart/ > /vurde/) in the time period from 1445 to 1925. As the
graph shows, the replacement was very limited for the irst 200
3.0
years, but even a much shorter time span can provide enough
2.8 information to predict the development over the next several
hundred years.
2.6 Another variant of this third kind of law is based on (discrete)
linguistic instead of (continuous) physical time. he simplest
2.4
way to operationalize linguistic time is the reference to text
2.2 position. In oral texts, there is a direct correspondence of the
sequence of linguistic units to physical time intervals.
2.0 Several linguistic characteristics can be investigated using
0 2 4 6 8
Figure 1. The functional dependence of mean syllable length (y-axis) on indices that relate their frequency to current text position, among
word length (x-axis) in Hungarian. The line represents the prediction; the them the type-token-ratio (TTR). At each text position, the num-
marks show the empirical data points. ber of types occurred to that point is counted, which yields a
monotonously increasing curve, because the number of words
used before a given text position cannot decrease in the course
of the rest of the text. A straightforward theoretical derivation of
this law was given by Gustav Herdan (1966), represented by the
simple formula (4):
(4) y = ax b
where y is the number of types, x the number of tokens (= text

position), and b a text characteristic. he parameter b is also an
indicator of the morphological type of the language under study
if word forms are considered because morphologically rich lan-
guages display a faster increase in word-form types than isolat-
ing languages.
A problem of the TTR, if used for text comparison, is that it is
not independent of the overall text length. herefore, more com-
plicated formulae are used to take this inluence into account or
quite diferent models (cf. Popescu and Altmann 2006a, 2006b)
are applied.
Recent investigations have found that other linguistic units
show a similar behavior in their text dynamics (letters, morphs,
syntactic constructions, syntactic function types, etc.). However,
depending on the size of their inventory in language (which may
vary over several orders of magnitude compare, e.g., the size
Figure 2. Typical curve representing the replacement of a linguistic unit of an alphabet or a phoneme system to the size of a lexicon),
by a new one.
diferent models have to be used. he TTR of syntactic units, for
example, is shown in Figure 3.
increase from the level of sound length gradually to the sentence
and suprasentence level. Figure 1 gives an impression of a typical
curve. Other examples are the dependence of word (or morph)
Theory Construction
Currently, there are two approaches to the construction of a lin-
frequency on word (or morph) length, and the frequency of syn-
guistic theory (in the sense of the philosophy of science): 1) syn-
tactic constructions on their complexity, of polysemy on length,
ergetic linguistics and 2) Gejza Wimmer and Gabriel Altmanns
of length on age, and so on.
uniied theory.
A special variant of a functional law is the developmental one.
The basic idea behind synergetic linguistics (cf. Khler 1986,
Here, a property is related to time. he best-known example is
2005) is the aim to integrate the separated laws and hypoth-
the Piotrowski Law, which represents the development (increase
eses found so far into a complex model that not only describes
and/or decrease) of the portion of new units or forms over time.
the linguistic phenomena but also provides a means to explain
his law is a typical growth process and can be derived from a
them. This is achieved by introducing the central axiom that
simple diferential equation with the solution (3):
language is a self-regulating and self- organizing system .
c
p= An explanation of the existence, properties, and changes of
(3) 1 + ae bt
linguistic, more generally semiotic, systems is not possible
where p is the proportion of new forms at time t, c is the without the aspect of the (dynamic) interdependence of struc-
saturation value, and a and b are empirical parameters. ture and function. The genesis and evolution of these systems
Figure 2 shows the increase of the forms with /u/ at the cost must be attributed to repercussions of communication upon
of the older form with /a/ in the German word ward > wurde structure (cf. Bunge 1998 and Khler and Martinkov 1998).
425
Laws of Language
100
Types
Figure 3. The TTR of syntactic constructions in a text.

The smooth line corresponds to the prediction; the
50 irregular line represents the empirical data.
0
0 300 600 900 1200 1500
Tokens
. 1998. Semiotic systems. In Systems: A New Paradigm for the

Synergetic modeling in linguistics starts from axiomatically
Human Sciences, ed. Gabriel Altmann and Walter A. Koch, 33749.
assumed requirements that a semiotic system must meet: the
Berlin and New York: Walter de Gruyter.
coding requirement (semiotic systems have to provide a Haken, Hermann. 1978. Synergetics. Berlin and Heidelberg: Springer.
means to create meaningful expressions) and the require- Haken, Hermann, and R. Graham. 1971. Synergetik. Die Lehre vom
ments of coding and decoding efficiency, of memory saving, Zusammenwirken. Umschau 6: 191.
of transmission security, of minimization of effort, and many Hempel, Carl G., and P. Oppenheim. 1948. Aspects of scientiic explana-
others. tion. Philosophy of Science 15: 13575.
he other approach at theory construction in linguistics is Herdan, Gustav. 1966. he Advanced heory of Language as Choice and
Wimmer and Altmanns uniied theory. Integration of sepa- Chance. Berlin: Springer.
rately existing laws and hypotheses starts from a very general Hebek, Ludek. 1997. Lectures on Text heory. Prague: Oriental
Institute.
diferential (alternatively: diference) equation, as well as two
Khler, Reinhard. 1986. Zur linguistischen Synergetik. Struktur und
very general assumptions: 1) If y is a continuous linguistic vari-
Dynamik der Lexik. Bochum, Germany: Brockmeyer.
able (i.e., some property of a linguistic unit), then its change . 1990. Elemente der synergetischen Linguistik. Glottometrika
over time or with respect to another linguistic variable will be 12: 17988.
determined in any case by its temporary value. Hence, a cor- . 1995. Bibliography of Quantitative Linguistics = Bibliographie zur
responding mathematical model should be set up in terms of quantitativen Linguistik = Bibliograija po kvantitativnoj lingvistike.
its relative change (dy/y). 2) he independent variable that Amsterdam: Benjamins.
has an efect on y also has to be taken into account in terms . 2005. Synergetic linguistics. In Quantitative Linguistik. Ein
of its relative change (i.e., dx/x). he discrete approach is ana- internationales Handbuch. Quantitative [Linguistics: An International
logical; one considers the relative diference yx/yx. Hence, the Handbook], ed.Reinhard Khler, Gabriel Altmann, and Rajmund G.
general formulas are dy/y = g(x)dx and yx-1 / yx-1 = g(x). he Piotrowski, 76075. Berlin and New York: Walter de Gruyter.
Khler, Reinhard, and Zuzana Martinkov. 1998. A systems theoreti-
solutions of these equations are quite interpretable linguisti-
cal approach to language and music. In Systems: A New Paradigm for
cally and yield the same results as the synergetic approach. he
the Human Sciences, ed. Gabriel Altmann and Walter A. Koch, Berlin,
great majority of laws known up to now can be derived from 51446. New York: Walter de Gruyter.
these equations. Mautek, Jn, and Gabriel Altmann. 2007. Discrete and Continuous
Both models, the uniied and the synergetic, turn out to be Modeling in Quantitative Linguistics. Journal of Quantitative
two representations of the same basic assumptions. he syner- Linguistics 14: 8194.
getic model allows easier treatment of multiple dependencies for Popescu, Ioan-Iovitz, and Gabriel Altmann. 2006a. Some aspects of
which partial diferential equations must be used in the uniied word frequencies. Glottometrics 13: 2346.
model. . 2006b. Some geometric properties of word frequency distribu-
tions. Gttinger Beitrge zur Sprachwissenschaft 13: 8798.
Reinhard Khler Wimmer, Gejza, and Gabriel Altmann. 2005. Uniied derivation of
some linguistic laws. In Quantitative Linguistik. Ein internationales
Handbuch. [Quantitative Linguistics: An International Handbook],
ed. Reinhard Khler, Gabriel Altmann, and Rajmund G. Piotrowski,
Altmann, Gabriel. 1980. Wiederholungen in Texten. Bochum, Germany: 76075. Berlin and New York: de Gruyter.
Brockmeyer. Zipf, George Kingsley. [1935] 1968. he Psycho-Biology of Language: An
Bertalanfy, Ludwig van. 1968. General System heory: Foundations, Introduction to Dynamic Philology. 2d ed. Boston: Houghton-Milin.
Development, Applications. New York: George Braziller. Cambridge, MA: MIT Press.
Bunge, Mario. 1967. Scientiic Research I, II. Berlin and Heidelberg: . 1949. Human Behaviour and the Principle of Least Efort. Reading,
Springer. MA: Addison-Wesley.
426
Learnability
LEARNABILITY L0, which is an ininite set consisting of the symbol a repeated an

arbitrary number of times:
generative grammar shifted the foundations of theoretical
linguistics away from discovery procedures the automatic con- L0 = { a, aa, aaa, ... }
struction of an optimal grammar to the problem of language Otherwise, a language Li in the set consists of all strings of as
learnability, the question of how a natural language could be shorter than, and including, a repeated i times. Learning a lan-
learned in principle. his statement of the problem is so vague guage in this set just means converging to the i that is the index
as to be useless; so let us break the question into subparts and for the language Li.
consider them in turn. It is easy to see that this set of languages is not learnable from
he study of language learning might proceed from the obser- positive-only evidence. Suppose that the longest string that the
vation and investigation of actual human children engaged in the learner has seen to date is of length n. Nothing about the text will
process of learning their mother tongue or tongues. However fas- allow the learner to distinguish between L0 and Ln, and so the
cinating and useful this approach is, it is fraught with a number learner will be incapable of converging. hus, this class of lan-
of diiculties. Real children are engaged in a number of diferent guages is not learnable.
tasks and are changing along a number of diferent dimensions Golds result might seem to spell disaster for the learnability
while in the process of learning their irst language. he develop- project. he derivational machinery needed for the languages
mental psycholinguist must, then, be careful of all these diferent that Gold used to prove his theorem is much simpler than what
factors. would be required for the set of natural languages, yet Golds set
he investigation of learnability seeks to circumvent these dif- is unlearnable. Gold seems to have shown that the natural lan-
iculties by considering language learning as a problem in com- guages are not learnable in the sense outlined here.
putational logic. he researcher in language learnability seeks to We neednt fret for too long over this particular bugbear since
construct an explicit algorithm that will produce a grammar for another way of thinking about Golds result is near at hand. In
a target language after inite exposure to evidence from that lan- particular, the set of learnable languages does not contain the set
guage. Such a researcher takes the rareied view of language as that Gold constructed for his proof. Some have found it tempt-
a set of strings, corresponding to the grammatical sentences ing to massage Golds result into an argument that the learner
of that language. He or she supposes that the learner is actually is equipped with prior information (innate knowledge) about
an algorithm that takes as input a text, an ininite sequence of the set of natural languages, although this goes beyond the actual
sentences. he text is constructed by drawing strings from the content of the result.
language and presenting them, one at a time, to the learner. In here are a number of interesting responses to Golds result.
this case, the learner is presented with positive only evidence; Complexity bounds can be placed on the grammars that the
he/she is given information about sentences that are in the lan- learner can consider. M. Kanazawa (1998) has shown that
guage but no information about sentences that are outside the restricted sets of categorial grammars are string learn-
language. An alternative learning setting would be to allow the able learnable using the kind of text presentation that we have
learner to be tutored by giving him/her strings that are marked considered. Golds result entails that the entire class of categorial
for grammaticality. Such information demonstrably simpli- grammars cannot be learned using this kind of evidence. Note
ies the learning task, but in real learning, the child is unlikely that Kanazawas result does not conlict with Golds theorem
to receive systematic evidence about grammaticality. As a result, since it holds only of a particular subset and not the entire class,
learnability research has generally proceeded from the assump- the latter case being what Golds theorem excludes.
tion of positive-only evidence. Alternatively, learners might receive more evidence about the
After each example is presented to the learner, that learner target language than is present in a positive-only text. K. Wexler
makes a guess about the grammar of the target language. A and P. Culicover (1980) developed a proof of the learnability of
learner is said to converge to a grammar for a language just in the set of rules in the transformational component of a 1970s style
case the learner hypothesizes that grammar after inite exposure transformational grammar. In their system, the learner is
to the text and never alters the hypothesis after that. If the gram- presented with pairs consisting of the surface syntactic string
mar is correct, then the learner is said to have learned the target along with a base structure. his base structure is akin to the level
language; truly, learning a language means that the learner has of deep structure (see underlying structure and sur-
hit the correct grammar and never changes his/her mind after face structure), where the syntactic representation is a kind
that. We only require, at this point, that the grammar generate of mentalese the language of thought, and would be invariant
the correct set of sentences; we have not said anything about across languages (the Universal Base Hypothesis). he learner
how the grammar does so, and so we place no constraints on the is presented occasionally with both a grammatical sentence and
structural descriptions assigned to sentences. its meaning. he proof shows both that the transformation rule
A language is learnable if there exists a learner who, upon component could be learned and that a complexity bound could
inite exposure to the language, learns the language in the afore- be placed on the input evidence that the learner needed in order
mentioned sense. A set of languages is learnable if the leaner can to converge (their Degree 2 Learnability result).
learn every language in the set. S. Pinker (1984) considered the cases whereby the learner has
We can turn to an early learnable result from E. M. Gold access to the string and a representation of its semantic content.
(1967; see also Osherson, Stob, and Weinstein 1986). Imagine a his process, called semantic bootstrapping, uses a set of heu-
set of languages deined as follows: We deine a language, call it ristic rules to link semantic categories to syntactic categories.
427
Learnability Left Hemisphere Language Processing
Costa Florencio (2003) has shown that the full range of categorial only then would the learner set the parameter to the exempliied
grammars can be learned if the learner is presented with unla- value.
beled structures. Charles Yang (2002) has imported a number of mathematical
Another response to Golds result is to impose diferent types tools from population biology to develop a sophisticated model
of constraints on the learners hypothesis space. For example, of parameter setting that is clearly inspired by evolutionary the-
the idea that universal grammar consists of a set of invariant ory. Equally interesting work has been done on the learnability of
principles whose expression is regulated by a inite set of param- optimality theory (Tesar and Smolensky 2000) using tech-
eters has played a seminal role in linguistic theory over the past niques drawn from statistical machine learning. Although no full
quarter of a century (see principles and parameters the- proof of the learnability of parametric approaches yet exists, the sta-
ory and language acquisition). he learner is taken as tistical approaches as well as work in conventional machine learn-
being faced with the inite task of discovering the correct value ing (see Manning and Schutze 1999) promise to yield new insights
of each parameter, where each parameter can take on one of a in language learning, language variation, and language change.
inite set of parameters, given a text consisting of simple gram-
Robin Clark
matical sentences.
he parametric approach usually assumes that the learner WORKS CITED AND SUGGESTIONS FOR FURTHER READING
is given positive-only input. After a sentence is presented to
the learner, it produces a hypothesis, possibly by changing the Clark, R. 1992. he selection of syntactic knowledge. Language
Acquisition 2: 83149.
value(s) of one or more parameters. It is obvious that even a rela-
Clark, R., and I. Roberts. 1993. A computational model of language
tively small set of parameters could produce an enormous space
learnability and language change. Linguistic Inquiry 24: 299345.
of languages. he space of languages might, for example, con- Dresher, E., and J. Kaye. 1990. A computational learning model for met-
tain local maxima. A local maximum might look correct to the rical phonology. Cognition 34: 13795.
learner since it would always yield a structural analysis for any Florencio, C. C. 2003. Learning categorial grammars. Ph.D. diss.,
input sentence, but it would systematically give the sentence an Universiteit Utrecht.
incorrect structural analysis, for example. If this happened, then Frank, R., and S. Kapur. 1996. On the use of triggers in parameter set-
the set of languages deined by that parameter space would not ting. Linguistic Inquiry 27: 62360.
be learnable relative to a simple learning device with positive- Gibson, E., and K. Wexler. 1994. Triggers. Linguistic Inquiry 25: 40754.
only evidence. Gold, E. M. 1967. Language identiication in the limit. Information and
In response to this problem, a number of diferent algorithms Control 10: 44774.
Kanazawa, M. 1998. Learnable Classes of Categorial Grammars. Stanford,
were proposed. R. Clark (1992) proposed using a kind of artiicial
CA: CSLI Publications.
evolution to converge to the target. A population of grammars
Kapur, S. 1991. Computational learning of languages. Ph.D. hesis,
would be exposed to the input text, with the best performers Cornell University.
allowed to combine and produce ofspring that had inherited Manning, C. D., and H. Schutze. 1999. Foundations of Statistical Natural
properties from the parent grammars. his approach uses the Language Processing. Cambridge, MA: MIT Press.
parallelism implicit in a population to avoid the problem of local Niyogi, P. 2006. he Computational Nature of Language Learning and
maxima. his approach to learning falls into the class of probably Evolution. Cambridge, MA: MIT Press.
approximately correct (or PAC) learning; in this framework, the Osherson, D., M. Stob, and S. Weinstein. 1986. Systems hat Learn: An
learner is guaranteed to converge within a margin of error, where Introduction to Learning heory for Cognitive and Computer Scientists.
the margin of error can be made arbitrarily small, but not zero. Cambridge, MA: MIT Press.
Clark and I. Roberts (1993) extended this work to try to account Pinker, S. 1984. Language Learnability and Language Development.
for language change. P. Niyogi (2006) has developed a more
Tesar, B., and P. Smolensky. 2000. Learnability in Optimality heory.
sophisticated computational approach to this problem (see also
Yang 2002). Wexler, K., and P. Culicover. 1980. Formal Principles of Language
E. Gibson and Wexler (1994) tried to develop a learner that Acquisition. Cambridge, MA: MIT Press.
used an algorithm that tests to see if resetting a parameter to Yang, C. D. 2002. Knowledge and Learning in Natural Language.
a new value actual improves the learners performance on the Oxford: Oxford University Press.
input example. In order to avoid local maxima, they proposed
ordering the parameters according to a maturational sequence.
Readers should consult Frank and Kapur (1996) for an extensive
LEFT HEMISPHERE LANGUAGE PROCESSING
critique. S. Kapur (1991) developed a learning algorithm that Some History
avoids local maxima by using a statistical model of indirect nega- he left hemisphere (LH) has been considered to be the primary
tive evidence. his algorithm is, once again, clearly within the locus of language-speciic processing for centuries. We now
PAC learners. know that the right hemisphere (RH) has considerable lan-
Others have proposed using triggering evidence to set param- guage abilities and should no longer be considered the minor
eters (Dresher and Kaye 1990). On this model, each value of a hemisphere. Recent research has demonstrated that both the left
parameter would be associated with the abstract description of and right hemispheres contribute to varying aspects of language
a piece of triggering evidence that would cause the parameter processing (Beeman and Chiarello 1997). Even so, historical and
to be set to that value. he learner would scan the input text, current work still regards the left hemisphere as having a primary
searching for examples that matched the description of a trigger; and signiicant role in language processing.
428
Left Hemisphere Language Processing
One of the irst functional accounts of brain-language rela- here is evidence that this exhaustive access is controlled imme-
tions geared to the LH, the Wernicke-Lichteim model, separated diately by the LH (Burgess and Simpson 1988). Soon after, a
language into activities such as listening, reading, writing, and lexical choice is made on the basis of frequency and context.
speaking. Based on classic lesion localization eforts, these activ- Evidence from aphasia also supports the role of the left anterior
ities were thought to be localized in diferent LH brain regions. frontal cortex in lexical access. For example, individuals with
Geschwinds (1965) model proposed that brocas area a Brocas aphasia appear to show a slow rise time of the initial
region located in the left inferior frontal gyrus (LIFG) at the foot activation of multiple meanings, while those with Wernickes
of the motor strip near regions controlling mouth and tongue aphasia evince normal patterns (Prather et al. 1991). Other acti-
movements was the seat of speech production, while vation accounts suggest that individuals with Brocas aphasia
wernickes area in the posterior superior temporal regions (with damage to LIFG) underactivate lexical forms, while those
adjacent to the primary auditory cortex was the seat of audi- with Wernickes aphasia (with damage to STG) overactivate
tory comprehension. While this simple model was appealing, it (Blumstein and Milberg 2000).
has been clear since at least the 1970s that this view of language he role of LIFG in real-time processing also appears to extend
is likely inaccurate. For example, individuals diagnosed with to the comprehension of sentences that contain displaced argu-
Brocas aphasia have auditory comprehension deicits that are ments or those with iller-gap dependencies, for example, in
exposed on simple experimental probing and have production object relative (OR) constructions (e.g., he audience liked the
deicits that go well beyond those described by luency measures wrestler that the priest condemned *____ for foul language) where
(Zurif and Caramazza 1976; Friedmann 2006). a direct object argument or iller (e.g., wrestler) has been dis-
Beginning in the 1970s, eforts were made to characterize lan- placed from its canonical, post-verb position or gap (noted by *).
guage in the LH by reference to linguistic levels of analysis along Individuals with Brocas aphasia do not activate the iller at the
the lines of syntax, semantics, and phonology. he LIFG gap in real time, unlike what is observed for neurologically intact
has been suggested to be critical for syntax, while the temporal individuals and those with Wernickes aphasia or RH lesions
lobe has been suggested to be important for the normal func- (Swinney et al. 1996). hus, a real-time processing deicit may
tioning of word-level semantics. Moreover, the posterior supe- underlie the inability for Brocas individuals to ultimately com-
rior temporal gyrus (pSTG) has been suggested to play a critical prehend these constructions when they are probed with simple
role in phonology. Even within these linguistic divisions, eforts sentence-picture matching tasks or grammaticality judgments.
have been made to discover exquisitely detailed neurological
instantiations. he trace deletion hypothesis (Grodzinsky 2006), Variability
for example, has taken a minimalist position on syntax-brain rela- Much of the work detailing the role of the LH in language pro-
tions (see syntax, neurobiology of) and has suggested that cessing has been based on descriptions of neuroanatomy con-
only sentence constructions that are derived from the displace- ducted in the later third of the nineteenth and early twentieth
ment of an argument (e.g., a noun phrase) and that yield a trace centuries. K. Brodmann (1909) suggested that the most func-
(see also movement) rely on an intact Brocas area, and that tionally relevant parcellation of the brain is by cytoarchitecton-
other hypothesized aspects of syntax rely on more widely distrib- ics (cellular composition), but the map of Brodmanns Areas
uted anatomical regions. Other accounts of the relation between is based on manually drawn borders of a single brain. K. Amunts
syntax and the brain suggest that only those constructions that and colleagues (1999) examined 10 postmortem brains, and the
are deined as complex rely on an intact LIFG. Alternative theories borders for each brain were automatically drawn and super-
suggest that syntax, broadly deined, requires a neuroanatomical imposed on a template to produce a group cytoarchitectonic
language network consisting of Brocas region as well as STG, the map. Large intersubject variability was uncovered, perhaps par-
middle temporal gyrus (MTG), and the white matter iber tracks tially explaining why so much variability exists in the mapping
(arcuate fasciculus) connecting these regions. between behavior and anatomy from both lesion and functional
imaging studies.
A Real-Time Perspective Another possible contributor to intersubject variability is the
he most current approach to brain-language relations as we assumption of dead tissue only in and around the lesion. For
near the end of the irst decade of the twenty-irst century is for- many years, investigators have assumed that structural lesions
mulated in terms of processing metaphors such as activation (and the ischemic penumbra surrounding the lesion) were the
and maintenance that require a real-time analysis. It has been primary loci contributing to language deicits. With the advent
suggested that within the LH, Brocas area is required for fast- of more reined neuroimaging technology, such as perfusion
acting, relatively automatic and relexive processing routines; weighted and difusion tensor imaging, researchers have been
more frontal areas are critical for executive functions underly- investigating areas of the brain that are found to be structurally
ing language processing, including selecting among alternatives; intact yet not receiving an optimal supply of blood low. hese
left posterior temporal areas seem important for activating and hypoperfused regions give way to functional lesions inside seem-
maintaining argument structure, an aspect of lexical-semantics ingly intact neural tissue (Hillis 2007; Love et al. 2002).
or conceptual structure.
Take, for example, the activation of meaning-related word The Role of Functional Neuroimaging in the Investigation
forms during sentence comprehension. It has been argued that of Left Hemisphere Language
unimpaired individuals initially activate multiple meanings of Lesion studies alone must be interpreted with caution as these
ambiguous words, regardless of the context of the sentence. can only provide information regarding a speciic (i.e., the
429
Left Hemisphere Language Processing
damaged) neural regions necessity to perform a particular that are observed, may help illuminate brain-behavior map-
language task. Functional neuroimaging patterns, on the other ping in both the left and the right hemispheres (e.g., hompson
hand, describe the level of recruitment of speciic area(s), not and Shapiro 2007).
the necessity of only the lesioned area for the process itself (Hillis
Tracy Love and Lewis P. Shapiro
2007). It is the fusion of these and other methodologies that best
aides in the understanding and modeling of the brain basis of,
and networks involved in, language processing. WORKS CITED AND SUGGESTIONS FOR FURTHER READING
Neuroimaging research has demonstrated that the LH is par- Amunts, K., Axel Schleicher, U. Brgel, Hartmut Mohlberg, Harry Uylings,
ticularly well suited for language processing, regardless of the and Karl Zilles. 1999. Brocas region revisited: Cytoarchitecture
modality of language input (auditory or visual, as is found in and intersubject variability. Journal of Comparative Neurology 412
sign languages; Hickok, Love-Gefen, and Klima 2002). he (August): 31941.
literature has demonstrated a LH bias for the neural circuitry Beeman, M., and C. Chiarello, eds. 1997. Right Hemisphere Language
Comprehension: Perspectives from Cognitive Neuroscience. Hillsdale,
involved in the processing of complex versus simple sentence
constructions. More speciically, it has been argued that there
Blumstein, S., and W. Milberg. 2000. Comprehension in Brocas and
exists active recruitment of BA 44 and BA 45 (pars opercularis
Wernickes aphasia: Singular impairment. In Language and the
[Stromswold et al. 1996] and pars triangularis [Caplan and Brain, ed. Y. Grodzinsky, L. P. Shapiro and D. Swinney, 16783. San
Waters 1999]) of the LH during the parsing of complex sentence Diego: Academic Press.
constructions (e.g., the iller-gap constructions described ear- Brodmann, K. 1909. Vergleichende Lokalisationslehre der Grohirnrinde
lier). Yet other reports examining sentence comprehension have in ihren Prinzipien dargestellt auf Grund des Zellenbaues. Leipzig:
found anterior temporal cortex activation, including STG and Barth JA.
MTG (e.g., Humphries et al. 2005; Stowe et al. 1999). It is quite Burgess, C., and G. Simpson. 1988. Cerebral hemispheric mechanisms
likely that the discrepancies found in the imaging literature are in the retrieval of ambiguous word meanings. Brain and Language 33
due to varying methods of presentation, and difering behavioral (March): 86103.
Caplan, D., and G. Waters. 1999. Verbal working memory capacity and
requirements of the participants, as well as experimental design
language comprehension. Behavioral and Brain Sciences 22.1: 7794.
issues and analysis procedures.
Friederici, A. 2002. Towards a neural basis of auditory sentence
processing. TRENDS in Cognitive Sciences 6.2: 7884.
The Integration of Language Processing in the Left Friedmann, N. 2006. Speech production in Brocas agrammatic
Hemisphere aphasia: Syntactic tree pruning. In Brocas Region, ed. Y. Grodzinsky
Work from multiple tasks and methodological techniques have and K, Amunts, 6382. New York: Oxford University Press.
been integrated to form the basis of neurocognitive models of Geschwind, N. 1965. Disconnection syndromes in animals and man.
language processing. hese models capture the choreographed Brain 88: 585644.
workings of neural regions during language processing. One Grodzinsky, Y. 2006. A blueprint for a brain map of syntax. In Brocas
Region, ed. Y. Grodzinsky and K. Amunts, 83107. New York: Oxford
such model posited by A. Friederici (2002) argues for an LH
University Press.
biased temporofrontal network. According to this model, iden-
Heiss, W., J. Kessler, A. hiel, et al. 1999. Diferential capacity of left
tiication of a word into a grammatical category (e.g., noun, verb, and right hemispheric areas for compensation of poststroke aphasia.
determiner, etc.) begins at about 200 milliseconds after the word Annals of Neurology 45.4: 4308.
is encountered in the speech stream, localized in the regions sur- Hickok, G., T. Love-Gefen, and E. Klima. 2002. Left temporal lobe
rounding the anterior superior temporal cortex. At this temporal supports sign language comprehension. Brain and Language
point, such grammatical categories are placed into a hierarchical 82.2: 16778.
syntactic form, which relies on the regions surrounding Brocas Hillis, A. 2007. Magnetic resonance perfusion imaging in the study of
area. At about 300500 milliseconds after a word is encountered, language. Brain and Language 102.2: 16575.
the lexical entry is accessed, which allows for subsequent syntac- Humphries, C. , T. Love, D. Swinney, and G. Hickok. 2005. Response
of anterior temporal cortex to syntactic and prosodic manipu-
tic integration via Brocas area and semantic integration via
lations during sentence processing. Human Brain Mapping
temporal lobe regions.
26.2 : 12838.
Kinsbourne, M. 1971. he minor cerebral hemisphere as a source of
Translational Research aphasic speech. Archives of Neurology 25.4: 3026.
Finally, the investigation of language processing in the LH has Love, T., D. Swinney, E. Wong, and R. Buxton. 2002. Perfusion imaging
yielded a growing enterprise devoted to mapping recovery of and stroke: A more sensitive measure of the brain bases of cognitive
function in aphasia following brain damage. Converging evi- deicits. Aphasiology 16.9: 87383.
dence from clinical studies along with functional neuroimag- Prather, P., L. Shapiro, E. Zurif, and D. Swinney. 1991. Real-time exami-
nations of lexical processing in aphasics. Journal of Psycholinguistic
ing studies have demonstrated that, depending on individual
Research 20.4: 27181.
factors such as size and extent of lesion, premorbid handed-
Stowe, L., A. Paans, A. Wijers, F., et al. 1999. Sentence comprehension
ness, and so on, recovery of language function may include the
and word repetition: A positron emission tomography investigation.
undamaged regions of the LH language processing network or Psychophysiology 36.6: 786801.
homologous RH regions (Heiss et al. 1999; Kinsbourne 1971). Stromswold, K., D. Caplan, N. Alpert, and S. Rauch. 1996. Localization
Treatment of language disorders for individuals with brain of syntactic comprehension by positron emission tomography. Brain
damage, and the subsequent behavioral and neural changes and Language 52.3: 45273.
430
Legal Interpretation
Swinney, D., E. Zurif, P. Prather, and T. Love. 1996. Neurological dis- holding to follow deductively from some set of higher or more
tribution of processing operations underlying language comprehen- general principles. he answer to the present problem then could
sion. Journal of Cognitive Neuroscience 8.2: 17484. be deduced from those principles. he deductive process incor-
hompson, C., and L. P. Shapiro. 2007. Complexity in treatment of porates an efort to ensure that the application of a precedent in
sentence deicits in aphasia. American Journal of Speech-Language
the case at hand cohere with the entire set of abstract principles
Pathology 16: 3042.
that, taken together, constitute the common law. (Interpretation
Zurif, E., and A. Caramazza. 1976. Psycholinguistic structures in
aphasia: Studies in syntax and semantics. In Studies in Neurolinguistics,
of the codes in civil law systems is said to follow the same model,
ed. N. Avakian-Whitaker and H. Whitaker, 26092. New York: Academic except that the general principles are found in the codes provi-
Press. sions, rather than extracted by interpretation from prior decided
cases.)
he formalist approach to precedent was subjected to with-
LEGAL INTERPRETATION
ering criticism in the twentieth century, primarily by the rule
he judicial practice of legal interpretation provides the model skeptics associated with American legal realism (Rumble 1968).
for other versions of legal interpretation, such as the interpreta- hey argued, and to much of the legal community demonstrated,
tion of law by practicing lawyers. Interpretation of three types that the purported deductions never satisied minimal standards
of texts prior judicial decisions (which serve as precedents for of deductive reasoning. he legal realists argued that what courts
a present interpretive exercise), statutes, and constitutions actually did was to interpret precedents with an eye to the poli-
provides the model for interpretation of other texts, such as cies advanced by the rules articulated by the courts: he present
regulations and treaties. It has a temporal and an institutional problem would be resolved by determining how the policies
dimension. Decision makers interpret texts written earlier, embodied in the rules articulated in the precedents would best be
sometimes substantially earlier, and produced by institutions advanced. In the nineteenth century, for example, courts barred
diferent from the ones engaged in the interpretive enterprise an employee injured by the negligence of another employee
(if only because of changes in personnel). hese characteristics from recovering damages from their joint employer, in part
generate some problems common to all three forms of legal because the injured employee was said to be in a good position
interpretation. to notice whether the other employee was a careful worker. Later
courts had to decide whether that policy was applicable where
Case Law and the Interpretation of Precedents the negligent worker labored in a diferent department, or was
Prior judicial decisions play an important part in every system the injured workers supervisor. Critics of the strongest versions
of legal interpretation. Civil law systems, such as those in France of legal realism wondered why the policy-oriented approach
and Germany, are committed to a legal ideology in which judges should be described as involving interpretation at all. Policy-
revert to statutory texts directly, without reference to any prior oriented decisions, they argued, were entirely forward looking,
judicial decisions. Judicial decisions in such systems rarely refer and the precedents did no more than provide a convenient heu-
to prior decisions, but even in such systems, precedent plays an ristic to guide thinking about the present problem.
important role before courts issue their opinions (Lasser 2004). he temporal dimension of legal interpretation is apparent
he common law system of unwritten law, the foundation of on the surface when courts interpret prior decisions. he insti-
law in Great Britain and the United States, was developed by the tutional dimension is revealed when we ask, particularly of the
courts themselves to regulate large portions of the law of prop- policy-oriented interpreter, Why should a court today give any
erty, contracts, and torts (accidents, among other topics). he weight to the rules articulated by courts in the past? Answers
common law is unwritten only in the sense that the texts on vary, but most combine a Burkean ideal that judges today should
which it is based are prior judicial decisions, rather than legis- not be overly conident that they know better than their prede-
lative enactments. cessors what good policies are, with a pragmatic sense that some
Interpreting a judicial decision in order to apply it to a new degree of reliance on prior decisions conserves judicial efort.
problem involves several analytic steps. Typically, a decision will
describe a cases facts and articulate several legal rules that the Statutory Interpretation
court says lead it to its conclusion. he interpreter later must dis- Questions of statutory interpretation as such arise only when
tinguish the decisions holding from any obiter dicta found in the someone an enforcement oicial or a judge, for example has
decision. On standard accounts, the holding is the rule or rules some question about what a statutes terms mean. Where statu-
necessary to support the conclusion, dicta any rule or rules that tory terms are thought to be unambiguous, oicials simply apply
could be eliminated from the courts discussion without altering the statutes, an operation that to them seems preinterpretive
its conclusion (Marshall 1997). Later courts do not always disre- (see also philology and hermeneutics). Application rather
gard dicta, however, sometimes inding that they provide useful, than interpretation is likely to be more common soon after a
though not binding, guidance. statutes adoption, because people will generally be familiar with
he prior decisions holding, once identiied, must be applied what the statutes enactors were trying to do unless, as happens
to the new problem. Again typically, that problem will difer in with some frequency, the adopters deliberately left speciic pro-
some respects from that presented by the precedent. Courts apply visions in the new statute unclear.
the precedents rule in two ways. For much of the nineteenth here are three prominent approaches to statutory interpre-
century, and to some extent today, courts applied precedents tation in the United States, with parallels in other legal systems.
formalistically. hey took the rule supporting the precedents (For an overview of the contemporary discussion in the United
431
States, see Vermeule 2006.) he textualist approach interprets a singular when discussing the institutions that adopt statutes, but
statutes terms by asking what the words would mean to an ordi- it cannot be avoided so easily.
nary reader (usually, a reader at the time the statute was enacted) he inal prominent approach, usually called purposivism,
who is reasonably well informed about the meaning of the tech- implicitly shifts the focus from the enacting legislature to the
nical terms and about the entire statutory environment within interpreter. In a classic formulation, the purposivist assumes
which the contested term is located. that the legislature was composed of reasonable people seeking
Textualism is a rather bare-bones interpretive approach, to pursue reasonable purposes reasonably. Ambiguous statu-
which to its critics requires the interpreter to ignore real and tory terms are to be interpreted so that the goals imputed to the
accurate information about what a statute is designed to do. legislature are most likely to be achieved. Purposivism avoids
Proponents of textualism often claim more clarity for the out- most of the problems associated with intentionalism, because,
comes they reach than there actually is. Where ambiguity per- although its proponents ordinarily refer to the legislatures
sists after considering the sources to which textualists limit purposes, they are not truly concerned with a real institution
themselves, some other basis is needed for resolving the contro- stafed by real people. Rather, purposivists construct an ideal-
versy. he most prominent candidate emerges from the assump- ized legislature to which they impute purposes that they then
tion that ambiguous legislation should not be taken to disturb seek to implement. Yet purposivism typically lacks an account
the status quo. his assumption is sometimes expressed as a of whether the interpreter should posit abstract or more con-
canon of construction that ambiguous statutes should not be crete purposes.
construed to change the common law (that is, the background Purposivism makes the institutional dimension of legal inter-
rules that would apply if the legislature took no action). An alter- pretation clear. It allocates efective decision-making authority to
native defense of textualism is comparative: that it reaches better courts, at least once the legislature does something that licenses
outcomes overall than alternative approaches that ask interpret- the courts to engage in the interpretive enterprise. Its proponents
ers to assess information that they are not well equipped to han- believe that purposivism contributes to the good functioning of
dle, even though in particular cases the use of one or the other the government overall, as courts and legislatures collaborate in
approaches might produce a better result than textualism. accomplishing good for the society. Critics respond with some
he intentionalist approach shifts the focus from the reason- skepticism about the very idea of the public good as something
able reader to the enacting legislature. It asks what the legisla- independent of the choices made by legislatures, and with the
ture intended to accomplish by enacting the statute. In its least observation that what the purposivists are doing cannot fairly be
controversial version, the intentionalist approach directs the described as interpretation. Rather, they suggest, the judges are
interpreters attention to the problem the statute was designed reading into the law their own policy preferences and then attrib-
to solve, producing an interpretation that, in the judges view, uting those purposes to the statute.
solves the problem as well as possible within the bounds set by Statutory interpretation also involves the use of canons of
the statutes words as reasonably understood. Intentionalists in statutory construction, which might be thought as well to con-
the United States, more than in the United Kingdom, are will- strain the judges power to interpret statutes merely to advance
ing to consult documents produced as the statute proceeded their policy preferences. One example is the rule of lenity,
through the enactment process (the statutes legislative his- according to which criminal statutes should be construed where
tory), such as reports by the committees that considered the fairly possible to limit the scope of criminal liability. Another
legislation and statements by the statutes supporters and oppo- example, in legal systems with some form of judicial review for
nents, to determine what the legislature meant by the terms it constitutionality, is the canon that statutes should be construed,
used. again where fairly possible, to make them consistent with the
Textualists criticize these more expansive versions of inten- constitution or basic human rights. Scholars divide canons of
tionalism. Most narrowly, they note that materials drawn from construction into two groups, substantive and legislative-intent
the legislative history are readily deployed strategically by advo- canons. Substantive canons embody policies that courts seek
cates, who present only the materials that support the interpreta- to pursue independent of what legislatures actually sought
tion that will yield the result they favor, and selectively by judges, to accomplish in enacting particular statutes. he rule of len-
who refer only to those parts of the legislative history that favor ity and the rule that statutes should be construed to limit their
that result the judges prefer for reasons independent of the inter- impact on background law are examples. Legislative-intent
pretive enterprise. Critics also observe that referring to legisla- canons are rebuttable presumptions about what legislatures
tive history gives some degree of authority to committees and seek to accomplish in enacting particular statutes. he canon
individual members, whereas only the entire legislature has any dealing with avoiding constitutional questions can be justiied
authority to enact law. Finally, critics question the coherence of on the ground that courts should not assume that legislators
invoking intentionalist terms with respect to multimember legis- sought to enact unconstitutional statutes. Karl Llewellyn ofered
lative bodies. Some legislators might have favored the adoption a classic critique of canons of interpretation, suggesting that
of a statutory provision because they thought that it solved an for each thrust built into one canon, there was a parry from
important public policy problem, others because their constit- another equally well-established canon of statutory interpreta-
uents favored it, still others because important contributors to tion (Llewellyn 1950). So, for example, the canon Every word
their campaigns did so. How can these varying states of mind be and clause must be given efect was parried by the canon If
aggregated into an intent of the legislature? Continental legal inadvertently inserted or if repugnant to the rest of the statute,
theorists elide this question by referring to the legislator in the they may be rejected as surplusage.
432
Canons of interpretation can it into each of the interpretive One can rely on concrete intentions to rule out the possibility
approaches. For the textualist, the canons are part of the general that a practice they understood to be constitutionally permis-
background the ordinary reader is assumed to know as he or she sible would later be found to be constitutionally impermissible,
reads a contested statutory text. he intentionalist can defend the but even then the reliance on concrete intentions or understand-
substantive canons on the ground that legislatures typically do ings requires a defense that goes outside the terms set by origi-
not intend to infringe on the policy goals embodied in substan- nalism itself. An alternative, similar to that ofered by canons
tive canons. he purposivist has an easy time with substantive of statutory interpretation, is to hold a practice constitutionally
canons, which directly embody judgments about good policy, permissible unless it is clearly precluded by the constitution
and can treat the other canons as similarly relecting good policy as originally understood. he justiication for this alternative is
judgments, rather than imputations of legislative intent. institutional: he constitution of a liberal democracy taken as a
whole should be understood to commit decision-making author-
Constitutional Interpretation ity to democratically elected legislatures, unless the constitution
Questions of statutory interpretation typically involve speciic clearly takes that authority away from the legislatures and gives
and detailed provisions of complex statutes. Constitutional it to the courts.
interpretation, in contrast, typically involves the application of he second family has no standard name, but probably can be
general and abstract constitutional terms, such as freedom of best described as including varieties of perfectionism. According
speech, due process of law, and equal protection of the laws to this group of approaches, general and abstract constitutional
(to take examples from the U.S. Constitution) to speciic prob- provisions should be interpreted in accordance with some over-
lems. Here, too, there are two primary families of approaches. arching principles of good government and individual liberty.
he irst family includes varieties of originalism. One ver- hese principles can be relatively modest, as in a commitment
sion holds that constitutional provisions should be interpreted to democratic self-governance (Ely 1980), or more robust, as in
to conform to the intent of the constitutions adopters. As with a commitment to justice broadly understood (Dworkin 1996).
intentionalism in statutory interpretation, original-intent Germanys constitutional court inds the perfectionist approach
approaches run into many diiculties, such as the problem of to interpretation embodied in its constitutions commitment
aggregating individuals intentions. In addition, when, as with to what it calls a basic order of values. (he Muslim con-
the U.S. Constitution, major provisions were adopted two centu- cept of ijtiha-d might be thought to have a similar underlying
ries earlier, the task of identifying what any particular individual structure.)
understood a provision to mean is extremely diicult. Perfectionist approaches to constitutional interpretation
Finally, the abstract terms that constitutions use pose an addi- resemble purposivist approaches to statutory interpretation.
tional problem: Should the provisions be interpreted according Interpreters, it appears, are to rely on their own best understand-
to the abstract or the concrete understandings of their adopters? ing of what the basic order of values is. his, some believe, is
Ronald Dworkin uses the term concept to describe the abstract inconsistent with democratic self-governance because it allows
understanding, conception the concrete one (Dworkin, 1977). judges to substitute their judgments about what justice or equal-
Consider, for example, a constitutional provision dealing with ity requires for the judgments made by elected representatives.
equality. Does that provision enact into fundamental law the Critics suggest, in this setting as well, that what perfectionists
particular understandings the adopters had about equality, such do cannot be called interpretation. he sting of that observation
as the understanding that laws could treat men and women dif- might be reduced by responding that a constitutions text as such
ferently while still providing equality, or does it enact equality has no authority anyway; only the long-standing practices that
itself, that is, the best understanding an interpreter can devise at people have come to accept have authority, and perfectionist
the moment of interpretation? practices have been widely accepted for many years.
Some proponents of originalism responded to these and Another response, suggested by Stanley Fish (1994), is that
other problems with an original-intent approach by arguing that perfectionist interpretation is not as unconstrained as its critics
constitutional provisions should be interpreted according to the think. Judges are part of an interpretive community whose
original public meaning of their terms. he interpreter should shared understandings place signiicant limits on what even the
identify what the terms meant to a reasonable and well-informed most willful judge will take as a responsible interpretation of a
member of the public when the provisions were adopted. his constitutional provision. One important version of this view
does not completely eliminate the evidentiary problems associ- describes a common law constitution, in which what judges
ated with original-intent approaches, because one is still search- interpret is not primarily the constitution as written but the prior
ing for what the terms meant to individuals, but it substantially decisions interpreting the constitution (Strauss 1996). Here,
expands the range of relevant materials to include uses of the constitutional interpretation reproduces common law interpre-
terms in newspaper discourse and the like. Similarly, it shifts ter- tation. It is worth noting that these two defenses of perfectionist
minology about abstract versus concrete intentions to references interpretation do not preclude interpretations that are norma-
to abstract versus concrete public meanings. tively unattractive, if the people or the interpretive community
he temporal problem is perhaps the most diicult one fac- settle on unattractive practices.
ing originalist approaches. he problem is that the adopters or Consistent with the idea that judges are members of inter-
the general public years ago could have no intentions about, or pretive communities, judges in diferent nations take diferent
understanding of, what the terms they used meant, in connec- approaches to interpreting their constitutions (Goldsworthy
tion with developments they did not and could not anticipate. 2006). he practice in the United States is quite eclectic, with
433
Lexical Acquisition
judges using originalism and perfectionism relatively unsystem- partonomy, as well as with the meanings of verbs, adjectives,
atically; the practice in Germany is more perfectionist, and that and prepositions. Or, alternatively, they have argued that chil-
in Australia is formalist. dren rely on many of the same pragmatic principles as adults
Legal theorists have recurrently been attracted to the idea in making inferences in context about possible meanings. Under
that law, and legal interpretation, could become a science. he this view, childrens initial inferences are limited only by what
direct invocation of sciences, such as linguistics, psychology, and they know and the words they have already acquired (Bloom
more recently neuroscience, to understand legal interpretation 2000; Clark 1993).
has produced relatively little enlightenment, to the point where How do children assign some meaning to an unfamiliar
it seems more likely than not that whatever science of law there word? Once adult and child are both attending to the same
might eventually be, it will not be a science on the model of the object or action, for example, the child can infer that that object
physical or biological sciences. or actionat their locus of joint attention in the physical con-
textis the adults intended referent. hat is, the child draws on
Mark Tushnet
both physical and conversational context in assigning meanings
to unfamiliar words, regardless of word class (Clark 2009).
Moreover, once an object or action has been labeled, the child
Dworkin, Ronald. 1996. Freedoms Law: he Moral Reading of the can often infer that subsequent utterances are also relevant to
American Constitution. Cambridge: Harvard University Press. the newly identiied referent. And these utterances, in turn, may
. 1977. Taking Rights Seriously. Cambridge: Harvard University supply added information about properties (size, texture; man-
Press. ner of motion), relations (role of the object as agent, location, or
Ely, John Hart. 1980. Democracy and Distrust: A heory of Judicial Review.
entity-afected, say; see thematic roles), function (common
uses, use on that occasion), and so on. he inferences children
Fish, Stanley E. 1994. heres No Such hing as Free Speech: and Its a
make about meanings are guided by adult usage (a way of ind-
Cood hing, Too. New York: Oxford University Press.
Goldsworthy, Jef. 2006. Interpreting Constitutions: A Comparative Study. ing out the conventional way to designate each category) and by
New York: Oxford University Press. the fact that new words must contrast in meaning with whatever
Lasser, Mitchel de S.-O.-LE. 2004. Judicial Deliberations: A Comparative vocabulary they already know.
Analysis of Judicial Transparency and Legitimacy. New York: Oxford As children acquire more vocabulary, they build up seman-
University Press. tic domains words for food, clothing, cars, animals; types of
Llewellyn, Karl. 1950. Remarks on the theory of appellate decision and motion and location; and relations in space, for example and
the rules or canons about how statutes are to be construed. Vanderbilt they organize and reorganize each domain as they add new
Law Review 3: 395406. members. Members of a domain are typically linked by semantic
Marshall, Geofrey. 1997. What is binding in a precedent. In Interpreting
relations like X is a kind of Y, Z is part of A, B is made of C,
Precedents: A Comparative Study, ed. Neil MacCormick and Robert S.
or D is used for E. But not all relations hold in every domain.
Summers, 50317. Brookield, VT.: Ashgate/Dartmouth.
Rumble, Wilfrid E. 1968. American Legal Realism: Skepticism, Reform,
hey depend on the meanings of individual lexical items. Among
and the Judicial Process. Ithaca, NY.: Cornell University Press. verbs, for instance, the relations typically include their argu-
Strauss, David A. 1996. Common law constitutional interpretation. ment roles. A locative verb like put, for instance, is accompanied
University of Chicago Law Review 63: 877935. by three arguments an agent, an object, and a location, as in
Vermeule, Adrian. 2006. Judging Under Uncertainty: An Institutional Miranda put the cup on the shelf. But a verb of motion like run
heory of Legal Interpretation. Cambridge: Harvard University Press. requires only one argument role: the doer or agent, as in Robert
ran fast (Clark 2003).
Building up each semantic domain also involves identifying
LEXICAL ACQUISITION
words that co-occur (dogs bark, but horses neigh), and common
Children start to produce their irst recognizable words between collocations (compare disappearing ink and vanishing cream). It
12 and 18 months of age, and typically understand more than requires working out the semantic relations that link such terms
they produce. his asymmetry is a lifetime efect. he forms of as tiger, predator, and mammal, or tree, aspen, and gingko, on
their earliest words often depart radically from the adult versions the one hand, and throw, toss, twirl or break, tear, and cut, on the
(consider ga for squirrel) and may be hard to understand. But other. It also requires that children learn the terms for parts and
as their pronunciation becomes more skilled (see phonology, wholes (thumb, inger, hand), for groups (lock, pod, herd; crowd,
acquisition of), children add rapidly to the vocabulary at their reunion, meeting), for complex events (circus, opera, play), for
disposal. hey add words for people, animals, everyday objects, cycles (days of the week, months of the year), for relations (in,
toys, food, and various activities and by age two generally pro- above, behind; before, after; if, because), abstractions (justice,
duce between 200 and 800 distinct words. equality, goodness), and much more.
Researchers have taken one of two main approaches to the Learning words is also the irst step in learning constructions.
study of word acquisition in the last two decades: With the irst Many constructions are linked initially to speciic verbs, and only
approach, they have postulated built-in constraints that would later extended to others that can take the same construction.
limit the hypotheses children entertain about possible mean- Children may learn want irst with a direct object, as in I want
ings, typically limited to noun meanings only (Markman 1989), that. hen they start to use nouns in place of demonstrative that
constraints that must later be overridden since they are incom- (I want the ball, I want a spoon), and only sometime later do they
patible with semantic relations, such as inclusion, overlap, and start to use want with a to-complement, as in I want to go out.
434
Lexical-Functional Grammar
hey take even longer to add a subject to the complement, as in I A crucial underpinning idea is that any meaningful linguistic
want Anna to come. Constructions often appear to be built up on element has associated with it diferent types of linguistic infor-
single lexical items on a one-by-one basis. his takes time (Clark mation, for instance, information about prosodic structure, about
and Kelly 2006; Tomasello 2003). category and constituent structure (see phrase structure),
Finally, adult usage plays a crucial role in acquisition. about grammatical relations (also referred to as functions), and
Children track the frequencies of constructions in parental about semantic structure (see semantics). It is furthermore
speech and acquire irst those constructions that occur most assumed that the organizational principles for these dimensions
often. It is adult speakers who model word use, who ofer chil- may vary and that the formalisms used to represent the diferent
dren conventional terms for talking about types of objects, types of information should capture this variation. he informa-
activities, and relations. And it is adults who continually check tion is represented in diferent dimensions, for instance c-struc-
up with young children on just what meanings the children ture (for category and constituent), f-structure (for functional),
intended to convey. a-structure (for argument structure), and i-structure (for infor-
mation structure). Each dimension operates with its own funda-
Eve V. Clark
mental categories and principles. he diferent dimensions are
related through mapping principles.
C-structure is represented in terms of a version of x-bar syn-
Bloom, Paul. 2000. How Children Learn the Meanings of Words. tax, employing both lexical categories, such as noun, adjective,
Cambridge, MA: MIT Press. his book reviews how meaning acquisi- verb, and preposition, and functional categories, such as comple-
tion is linked to speaker intentions and theory of mind. mentizer, inlection, and determiner. Quite a restrictive approach
Clark, Eve V. 1993. he Lexicon in Acquisition. Cambridge: Cambridge
to functional categories tends to be taken; they are used for ele-
University Press. his book examines childrens word learning and
ments expressing crucial functional features whose distribution
their ability to coin words to ill gaps in their current vocabulary.
. 2009. First Language Acquisition. 2nd ed. Cambridge: Cambridge
is limited to a certain position within a phrase. Hence, a category
University Press. A review of irst language acquisition, and how cogni- such as I, assumed in some transformational theories to form a
tive and social factors interact in a usage-based approach. part of every clause in every language, is motivated within LFG
Clark, Eve V., and Barbara F. Kelly, eds. 2006. Constructions in Acquisition. for languages where an inlected verbal element occupies a par-
Stanford, CA: CSLI. his book reports on studies of how children ticular structural position. One example would be verb second
acquire constructions. languages, where the inite verb occurs in the second position in
Markman, Ellen M. 1989. Categorization and Naming in Children. the clause and, hence, initeness can be associated with this posi-
Cambridge, MA: MIT. his book presents a constraints-based account tion. his can then be captured through a functional category IP,
of lexical acquisition. headed by the inite verb in I and the initial element placed in the
Tomasello, Michael. 2003. Constructing a Language. Cambridge: Harvard
specifier position. In English, inite auxiliaries have properties
University Press. his book presents a usage-based approach to early
that motivate the use of a functional category I (see Dalrymple
syntactic acquisition (see syntax, acquisition of).
2001, 534).
A principle of economy of expression applies to constituent
structure to yield trees that look rather unorthodox from a stan-
LEXICAL-FUNCTIONAL GRAMMAR
dard X-bar perspective. his principle states that any constituent
Lexical functional grammar (LFG) is what is known as a con- introduced by a rule is optional unless some separate principle
straint-based parallel correspondence architecture for a theory of requires its presence. his can be illustrated by the analysis of
language (Bresnan 1982, 2001; Dalrymple 2001). It was called lexi- inite auxiliary verbs in English. he category I, to which these
cal because certain relations between elements, like that between verbs belong, is assumed to be introduced by a rule I I VP;
an active and a passive verb, were dealt with in the lexicon, as a however, in sentences that do not contain a inite auxiliary, the I
relation between lexical items. his contrasts with the approach is not present, giving a tree such as that in (1).
in transformational theories. Functional ambiguously refers
to grammatical relations, which are prominent in the theory, and
(1) IP
mathematical functions, which are used in the LFG-formalism.
he LFG-formalism can be mathematically modeled and, hence,
analyses expressed within it are susceptible to computational test-
ing (see Dalrymple et al. 1995). LFG is often referred to as a syn- DP I
tactic theory, but like many other syntactic theories, it is actually
a framework within which theories of language can be expressed.
As one of the founders of LFG puts it: [T]he formal model of LFG VP
is not a syntactic theory in the linguistic sense. Rather, it is an
architecture for syntactic theory. Within this architecture, there here is assumed to be a fair amount of typological
is a wide range of possible syntactic theories and sub-theories, variation in c-structure among languages. A language such as
some of which closely resemble syntactic theories within alterna- Wambaya, which has relatively free word order apart from the
tive architectures, and others of which difer radically from famil- constraint that an auxiliary-like element has to appear in second
iar approaches (Bresnan 2001, 43). position, is assumed to have a functional category IP, where the
435
Lexical-Functional Grammar
I takes an exocentric category S as its complement (for a discus- ill them from those that are restricted in this way; a subject, for
sion of the relevant data and potential constraints on the typo- instance, can be associated with a large number of thematic
logical variation in constituent structure, see Nordlinger 1997). roles, whereas an oblique is restricted as to which thematic
F-structure is assumed to be reasonably invariant among role can ill it. LMT associates feature values to thematic roles
languages. It takes the shape of feature-value matrices, where intrinsically for example, an agent is intrinsically associated
the features capture grammatical relations and functional fea- with [o] and by default for instance, the highest thematic
tures. he simplest features are those with atomic values, such as role according to a typologically motivated thematic hierarchy
[number plu]. Grammatical relations such as subj(ect), obj(ect) is associated with [r]. Grammatical functions are then deined
or adj(unct) are represented as features that take f-structures as in terms of these two features; a subject, for instance, is [-o] in
their values. Each element that has lexical semantic content has not being object-like and [-r] in not being restricted to any par-
associated with it a feature pred, which has a semantic form as ticular thematic role, whereas an oblique is [-o] but [+r]. Two
its value. A verb such as tickle, for instance, has the feature-value well-formedness constraints apply to the mapping; the func-
pair [pred tickle < (subj) (obj) >], that is, in the pred value of this tion-argument bi-uniqueness condition which states that each
verb, tickle requires a subject and an object (for more details thematic role must be associated with exactly one function and
on semantics within LFG, see especially Dalrymple 2001, 21754, each function with exactly one thematic role and the subject
who develops an approach to semantic composition called glue condition which states that every predicate must have a sub-
semantics). he pred feature also captures selectional properties, ject. For examples of how LMT works and how it can analyze
which are based on functions and functional properties, rather constructions such as locative inversion or complex predicates
than on syntactic categories; a transitive verb selects for a subject in interesting ways, see, for instance, Dalrymple (2001, Chap. 8)
and an object, not for two noun phrases. An f-structure for the sen- or Bresnan (2001, Chap. 14).
tence Oscar tickled the cat can be found in (2). The mapping between c-structure and f-structure can be
structurally defined or identified through morphological ele-
PRED

( )(
tickle SUBJ OBJ

) ments. English is an example of a language wherein functions
PRED Oscar are determined through their hierarchical position; the sub-
SUBJ GEND masc
ject appears in the specifier of the IP. This is then captured
(2) PRED cat formally through the phrase structure rule in (3a), where the
NUM sg
OBJ up arrow should be read as the f-structure associated with
my mother node and the down arrow as the f-structure
SPEC def
TENSE associated with this node. The resulting tree can be found in
past
(3b), where indices have been inserted to identify f-structures;
hree well-formedness conditions apply to f-structures. he the f-structure associated with the IP is referred to as f1, and
general uniqueness condition requires each feature to have a so on.
unique value. he completeness and coherence conditions ensure
(3) a.
compatibility between the requirements of a pred feature and its
local f-structure. Completeness requires that all functions spec-
iied by an elements pred feature be present in the f-structure
built up around that element; if, for instance, there had been
no obj in (2), completeness would have been violated. It also b. IP f1
requires those functions to have a semantic value, which prevents
an argument position from being illed by an expletive pronoun,
for instance. he coherence condition requires that all functions DP f2 I' f 3
present in a local f-structure be licensed by another elements (SUBJ)= =
pred feature. If, for instance, there had been an obl(ique) in (2),
coherence would have been violated since no such function is he arrows can now be replaced by the indices, and we get the
licensed by tickle. his is one example of the way that constraints equations in (4).
accounted for in terms of structure in other approaches are
expressed through f-structure in LFG. (4) f 1 SUBJ = f 2
he information captured in the pred feature is not a primi-
f1 = f 3
tive of the theory. he syntactic valency is, in fact, derived from
the semantic roles associated with the verb; hence, this aspect hese equations refer to three f-structures and deine their rela-
of the f-structure is derived from the a-structure of the element. tions; the f-structure f1 contains a feature subj which has as its
he relation between semantic and syntactic valency is speci- value the f-structure f2. he second equation states that f1 is iden-
ied through lexical mapping theory (LMT). LMT works in terms tical to f3, which means that any featurevalue pair associated
of two features [o(bject)] and [r(estrictive)]. he feature [o] with either of the two will also be associated with the other; the
captures the fact that certain thematic roles cannot ill an two nodes IP and I will then be associated with one f-structure.
object function, for instance agents, which can be subjects or In fact, the categories that form the clausal backbone CP, IP
obliques, but not objects. he feature [r] distinguishes those and VP will always share f-structure, so that featurevalue pairs
functions that are not restricted as to which thematic roles can introduced to any of them will also be shared by the others. hese
436
Lexical-Functional Grammar Lexical Learning Hypothesis
categories are referred to as co-heads (Bresnan 2001, 102). he Information about dimensions of information not discussed
equations in (4) give rise to the partial f-structure in (5). here can be found in publications listed in the bibliography on
the LFG Web site. At this site, information can also be found on
(5) SUBJ [ ] f extensions and applications of LFG, such as combining LFG with
2
f f optimality theory (Bresnan 2000), as well as work by Rens
1 3
Bod and Ron Kaplan (2003), which combines linguistic theory
As further elements are added, the information contributed by
and statistical methods to create an exemplar-based theory of
their lexical entries or by functional equations associated with
syntax.
structure will be inserted into the f-structure as dictated by the
functional equations. his mapping procedure from c-structure Kersti Brjars
to f-structure, like all LFG mapping relations, has the property
of monotonicity; information can be added but never deleted, WORKS CITED AND SUGGESTIONS FOR FURTHER READING
moved, or changed.
Bod, Rens, and Ron Kaplan. 2003. DOP model for lexical-functional
For a language like Latin, there are no arguments for an
grammar. In Data-Oriented Parsing, ed. Rens Bod, Remko Scha, and
elaborate hierarchical clause structure, but rather a latter exo-
Khalil Simaan, 21133. Stanford, CA: CSLI Publications.
centric structure is appropriate. In such a language, functions Bresnan, Joan. 2000. Optimal syntax. In Optimality heory: Phonology,
are not deined structurally and there is no structural equation Syntax and Acquisition, ed. Joost Dekkers, Frank van der Leeuw, and
of the kind illustrated in (3a). Instead, functions are identi- Jeroen van de Weijer, 33485. Oxford: Oxford University Press.
ied through case marking, and this is captured directly in LFG . 2001. Lexical-Functional Syntax. Oxford: Blackwell.
through an association between the value for the feature case Bresnan, Joan, ed. 1982. he Mental Representation of Grammatical
and a function. For Latin, there would then be a global equation Relations. Cambridge, MA: MIT Press.
as in (6). Dalrymple, Mary. 2001. Lexical Functional Grammar. San Diego,
CA: Academic Press.
(6) (CASE) = nom (SUBJ) = Dalrymple, Mary, Ronald M. Kaplan, John T. Maxwell III, and Annie
Zaenen. 1995. Formal Issues in Lexical-Functional Grammar. Stanford,
his equation can be inserted at any noun phrase node and is CA: CSLI Publications.
read as if the f-structure associated with this node contains the Falk, Yehuda. 2001. Lexical-Functional Grammar: An Introduction to
featurevalue pair [case = nom], then the f-structure associated Parallel Constraint-Based Syntax. Stanford, CA: CSLI Publications.
with the node above contains the feature subj and the f-structure Nordlinger, Rachel. 1997. Constructive Case: Evidence from Australian
associated with this node is the value of that subj feature. Or in Languages. Stanford, CA: CSLI Publications.
less formal language, if this node is nominative, then it is the sub-
he LFG Web site is located online at: http://www.essex.ac.uk/
ject of the node above. linguistics/external/LFG/.
he mapping principles permit non-one-to-one correspon-
dences between dimensions of information. For instance, an
f-structure can contain a subj function without there being
LEXICAL LEARNING HYPOTHESIS
a noun phrase in the corresponding c-structure. his is how According to this hypothesis, childrens grammatical develop-
constructions generally referred to as pro-drop are analyzed, ment is incremental and driven by the learning of lexical ele-
though in LFG they are more appropriately named pronoun ments (see Pinker 1984; Clahsen 1996; and Eisenbeiss 2000,
incorporation. he verb in the Italian sentence Rido I laugh, 2003, 2009 for overviews and references). he lexical learn-
for instance, is analyzed as consisting of just a verb. his verb ing hypothesis was developed by proponents of generative
contains in its f-structure description the type of informa- grammar in order to address the poverty-of-the-stimulus
tion contributed by the subject pronoun in the corresponding argument: In order to produce and understand new sentences,
English sentence. he verb form rido would be associated with children must generalize beyond individual input utterances.
equations such as those in (7). However, they do not have reliable access to systematic correc-
tions that would allow them to reject incorrect generalizations
about the target language. herefore, generative linguists have
postulated an innate language acquisition device, uni-
versal grammar (UG), that constrains childrens hypothesis
(7) space. According to the principles and parameters the-
ory, UG contains i) principles that constrain all grammatical
representations and ii) open parameters that provide a inite
set of values, that is, options from which learners can choose
(Chomsky 1981). For instance, generative linguists assume
The crucial part of (7) is the equation that introduces a pred that all sentences contain subjects, but that languages may dif-
feature with a pronominal value for its subject. This means fer with respect to the positioning of subjects and their overt
that the principle of completeness is satisfied by the verb realization (e.g., optional subjects in Italian versus obligatory
itself. subjects in English). In such a model, language acquisition
Some of the fundamental properties of LFG have been illus- only involves i) setting parameters to their target values and ii)
trated here mainly through reference to f- and c-structure. acquiring the lexicon.
437
Lexical Learning Hypothesis
If one assumes such a powerful acquisition device, one must but to their individual grammatical features (e.g., tense), which
explain why children need several years to acquire their tar- are stored in lexical entries for grammatical morphemes and
get grammar and initially produce non-target-like sentences project to phrases whenever these morphemes are combined.
for example, subjectless sentences in English. Faced with this According to such models, children should be able to acquire
developmental problem, proponents of the lexical learning individual features independently of one another, integrate them
hypothesis argue that UG is available from the onset of grammat- into lexical entries for individual lexical/morphological elements
ical development, but in order to set parameters, children still in an item-by-item fashion, and project each of these features
need to learn the grammatical properties of the lexical elements into phrases when these elements are combined. hus, whether
associated with these parameters. or not a childs utterance involves a realization of a particular
hese assumptions are in line with lexicalist generative grammatical feature and the corresponding syntactic operations
models: Initially, parameters referred to a heterogeneous set of does not depend on a global parameter value. Rather, it depends
linguistic properties, for example, subject omissions, word order, on the individual lexical items that the child has acquired so far.
or morphological marking. However, cross-linguistic (parametric) Hence, developmental dissociations between individual lexical
variation is closely linked to lexical properties, in particular to items and individual features are expected. For instance, dei-
properties of grammatical morphemes (see, e.g., Manzini and nite and indeinite articles are diferent lexical realizations of the
Wexler 1987). For instance, Germanic languages with postverbal functional category determiner, and German children acquire
negation exhibit a morphological distinction between irst and indeinite articles before deinite articles. Similarly, when they
second person. Proponents of lexicalist models argue that this sug- start producing deinite articles, German children use feminine
gests a relationship between parameter values for word order and forms correctly, but then incorrectly combine masculine forms
the person speciications of subject-verb-agreement markers. of articles with both masculine and neuter nouns. his suggests
In recent generative models, such markers or function words (e.g., that German children acquire the [FEMININE] distinction
auxiliaries) are analyzed as realizations of functional categories before they instantiate the feature [MASCULINE] that distin-
that project to phrases, just like the lexical categories verb and guishes masculines from neuters.
noun. For instance, subject-verb-agreement markers are viewed hus, in sum, the lexical learning hypothesis, that is, the idea
as realizations of the functional category INFL (Chomsky 1986). that syntactic development is driven by lexical development,
Proponents of lexical learning regard functional categories can provide accounts for the incremental nature of syntactic
as the only source of parametric variation (Chomsky 1989), and development, as well as for the observed correlations between
they argue that children should ix parameters and build up pro- lexical and syntactic development and the developmental dis-
jections of functional categories by learning the properties of the sociations that have been observed in childrens grammatical
lexical elements that encode the respective functional categories. development.
Hence, one should ind developmental correlations between the
Sonja Eisenbeiss
acquisitions of lexical items and the acquisition of the syntactic
properties associated with the projections of the corresponding
functional categories. Such correlations have been documented WORKS CITED AND SUGGESTIONS FOR FURTHER READING
for instance, a correlation between the acquisition of the German Chomsky, Noam. 1981. Lectures on Government and Binding. Dordrecht,
subject-verb-agreement paradigm and the target-like ordering the Netherlands: Foris.
of subjects, verbs, and negation (Clahsen 1996). Moreover, if one . 1986. Barriers. Cambridge, MA: MIT Press.
assumes incremental phrase-structure building, one can explain . 1989. Some notes on economy of derivation and representa-
developmental dissociations between realizations of diferent tion. In MIT Working Papers in Linguistics 10, 4374, Massachusetts
Institute of Technology, Cambridge.
functional categories for instance, the observation that German
Clahsen, Harald, ed. 1996. Generative Perspectives on Language
children master the use of agreement markers associated with
Acquisition: Empirical Findings, heoretical Considerations and
INFL before they consistently produce complementizers, that is,
Crosslinguistic Comparisons. Amsterdam: Benjamins. With relevant
realizations of the functional category COMP. contributions by Harald Clahsen, Sonja Eisenbeiss, and Martina
Children show even more complex dissociations, however Penke; Jrgen Meisel and Maria-Jose Ezeizabarrena; Andrew Radford;
(Eisenbeiss 2003): First, they start to realize diferent features of and homas Roeper.
the same category at diferent points. For instance, for the cat- Eisenbeiss, Sonja. 2000. he acquisition of the determiner phrase in
egory case, German children mark the nominative/accusative German child language. In he Acquisition of Syntax: Studies in
distinction before the accusative/dative distinction. Second, chil- Comparative Developmental Linguistics, ed. M.-A. Friedemann and
dren do not acquire all instantiations of the same features simul- L. Rizzi, 2662. London: Longman.
taneously. For example, German children show case distinctions . 2003. Merkmalsgesteuerter Grammatikerwerb. Eine Untersuchung
zum Erwerb der Struktur und Flexion der Nominalphrase. Available
on pronouns earlier than on articles. hird, childrens realiza-
online at: http://www.ub.uni-duesseldorf.de/home/etexte/diss/
tions of functional categories show lexeme-speciic restrictions.
show?dissid=1185.
For instance, German children initially restrict the possessive -s
. 2009. Generative approaches to language learning. Linguistics
to some familiar names (e.g., Mamas mommys). 47.2: 273310.
hese observations can be captured in feature-based, lexi- Manzini, Rita, and K. Wexler. 1987. Parameters, binding theory, and
calist versions of the lexical learning hypothesis (see Eisenbeiss learnability. Linguistic Inquiry 18 (July): 41344.
2003, 2009 for discussion): In these models, cross-linguistic var- Pinker, Steven. 1984. Language Learnability and Language Development.
iation is not so much related to functional categories, as such, Cambridge: Harvard University Press.
438
Lexical Processing, Neurobiology of
LEXICAL PROCESSING, NEUROBIOLOGY OF A broad perspective on the neural correlates of spoken word
recognition is provided by P. Indefrey and A. Cutler (2004), who
he lexicon is the store of words in the mental dictionary.
report a meta-analysis of 55 experiments in which subjects pas-
A typical English-speaking high school graduate knows about
sively listened to tones, pseudowords, words, or sentences. It
60,000 words, a literate adult perhaps twice that number (Miller
was found that all of the diferent types of auditory stimuli reliably
1991, 138). A word can be regarded as a long-term memory
activate overlapping, as well as partially diferentiated, central
association of semantic, syntactic, phonological, and
and posterior regions of the superior temporal gyri in both hemi-
orthographic structures. For example, the lexical entry for rose
spheres. In addition, the following hierarchical organization was
includes the following components, with the semantic compo-
observed: As the linguistic complexity of the stimuli increases,
nent symbolized by a picture for convenience:
there is recruitment of progressively more anterior regions of the
rose left superior temporal sulcus. hus, moving anteriorly, there is
meaning: irst an area responsive to pseudowords but not tones, then an
part of speech: noun area responsive to words but not pseudowords, and inally an
phonology: /roz/ area responsive to sentences but not words. he anterior area
orthography: ROSE that is selectively activated by words may contribute to the res-
olution of the lexical competition process described here; how-
During the past two decades, there has been remarkable prog-
ever, it is also conceivable that this operation is subserved by one
ress in understanding the neural substrates of lexical processing,
of the more posterior word-speciic areas (Orfanidou, Marslen-
mainly because of advances in two complementary approaches for
Wilson, and Davis 2006). After the phonological form of a word
investigating the functions of speciic brain structures: 1) the lesion
has been recognized, its semantic and syntactic components are
method, which, when used with ample numbers of patients who
retrieved. As summarized by Indefrey and Cutler (2004), these
are carefully studied both neuropsychologically and neuroanatomi-
processes may be executed by a wide distribution of predomi-
cally, can yield indispensable insights about the neural systems that
nantly left hemisphere brain regions, including most notably
are necessary for particular abilities; and 2) functional imaging tech-
the middle and inferior temporal gyri and the posterior inferior
niques, such as fMRI, which allow researchers to identify with more
frontal gyrus.
ine-grained spatial resolution the brain structures that are engaged
Turning to spoken word production, one of the most inlu-
during the normal performance of certain tasks (see neuroimag-
ential theories is that proposed by W. J. M. Levelt, A. Roelofs,
ing). Much more has been learned about the neural substrates of and A. S. Meyer (1999). According to their model, the produc-
lexical processing than can be summarized here, and so this review
tion of spoken content words depends on multiple processing
concentrates on cortical regions that have been linked with the rec-
stages, each of which generates its own characteristic output
ognition and production of spoken and written word forms.
representation (Figure 1). First, conceptual preparation involves
identifying the meaning of the word to be produced. Second,
Neural Substrates of Spoken Word Recognition and lexical selection involves activating the lemma for the word
Production that is, a unit that intervenes between semantics and phonology
It is well established that the sensorimotor aspects of spoken and that serves as the gateway to syntactic features (e.g., gram-
word processing depend on the left perisylvian cortex, and matical category, number, tense, etc.; these features are not
there is growing evidence that both the posterior superior tem- shown in Figure 1). hird, form retrieval involves calling up the
poral (auditory-related) and the posterior inferior frontal phonological code for the word. Fourth, syllabiication involves
(motor-related) sectors of this large anatomical territory contrib- determining segmental clusters and metrical assignments. Fifth,
ute to both speech perception and speech production (Imada phonetic encoding involves transforming syllabic units into
et al. 2006; Okada and Hickok 2006; Pulvermller et al. 2006; motor instructions. And sixth, articulation involves the inal pro-
Skipper et al. 2008). hese two regions interact not only via direct gramming of overt speech.
connections but also via an indirect pathway mediated by the he neural correlates of the irst stage, conceptual prepara-
inferior parietal lobule (Catani, Jones, and Ffytche 2005). tion, remain mysterious, largely because this stage constitutes
To understand spoken words, listeners must irst use the audi- the complex interface between language and thought and is also
tory input to activate stored representations of lexical-phono- heavily inluenced by social-cognitive perspective-taking abili-
logical form. It is only after this process of lexical access has been ties for example, the same piece of real estate can be called the
achieved that the semantic and syntactic properties of words can coast, the shore, or the beach, depending on ones communicative
be activated and used to construct higher-level representations of goals (Tomasello 1999, 119). Future research may show that con-
the utterance. Numerous behavioral studies suggest that speech ceptual preparation is subserved by widespread cortical struc-
information is continuously projected to the lexicon, so that an tures that underlie semantic processing (Kemmerer 2010; see
initial sequence like bla will activate all the words in the listen- semantics, neurobiology of). he next two stages, lemma
ers lexicon that begin with those sounds (black, bland, blanket, selection and phonological form retrieval, both involve core
etc.); as the input accumulates, the set of activated words dimin- lexical processes, and their neural correlates are beginning to be
ishes until only a single one matches the input, at which point understood. In a meta-analysis of 58 functional imaging studies
recognition can be said to occur (McQueen, Dahan, and Cutler including several studies employing magnetoencephalography,
2003). Pseudowords (e.g., blash) also activate partially matching which has excellent temporal resolution Indefrey and Levelt
candidate words, but ultimately no winner is selected. (2004) found that lemma selection is linked with the midsection
439
Figure 1. The LRM (i.e., Levelt, Roelofs, and Meyer) model of spoken word production. Left column: Word pro-
duction tasks involving lead-in processes that enter the central word production architecture at different stages.
Middle column: Core processes of word production and their characteristic output. Right column: Example fragments
of outputs generated at each stage. Reprinted by permission from Elsevier, copyright 2004, from P. Indefrey and
W. Levelt, The spatial and temporal signatures of word production components, Cognition 92: 10144.
of the left middle temporal gyrus and typically occurs during a related brain structures; however, the exact neural correlates of
time window of 150225 milliseconds (ms) post-stimulus in oral each stage are not yet clear (Bohland and Guenther 2006).
picture-naming tasks (see Color Plate 4). hey also found that Independently of Levelt, Roelog, and Meyers (1999) model,
phonological form retrieval is linked with the posterior portions a great deal of neuroscientiic research has focused on the pro-
of the left middle and superior temporal gyri and occurs during cess of mapping the meanings of words onto their correspond-
a time window of either 200400 or 275400 ms, depending on ing phonological forms during speech production. One
the studies that are considered. he three postlexical stages of important line of work, conducted by Hanna Damasio, Daniel
spoken word production are known to rely on a variety of motor- Tranel, and their colleagues (2004), suggests that this process
440
is subserved by intermediary units that are analogous to lem- the VWFA was reported by R. Gaillard et al. (2006; see also Martin
mas insofar as they function as relays, taking lexical-semantic 2006). In brief, prior to surgery for intractable epilepsy, the patient
structures as input and then pointing to the appropriate lexical- exhibited normal single-word reading, including a lack of increase
phonological structures. It is interesting that these intermediary in reading time for common words varying in length from three to
units may be neurally organized according to both semantic and eight letters; moreover, fMRI revealed his VWFA to have normal
grammatical principles. For example, lesion data suggest that, functional-anatomical characteristics, and local ield potentials
contrary to Indefrey and Levelts (2004) proposal, the retrieval recorded from implanted electrodes showed that this area was
of nouns for diferent categories of concrete entities may hinge sensitive to word frequency but not word length, again within
on intermediary units that do not reside in the left middle tem- normal parameters. After excision of tissue just posterior to the
poral gyrus but, rather, in the left temporal pole (TP) and inf- VWFA, the patients epileptic seizures were successfully elim-
erotemporal (IT) cortices. Speciically, studies in which oral inated, but his reading was markedly slow and inaccurate, with
picture-naming tasks have been administered to large cohorts of reading times increasing linearly with word length (i.e., letter-
brain-damaged patients have shown that 1) impaired access to by-letter reading). In addition, the VWFA no longer responded
proper nouns for unique persons (e.g., Jennifer Aniston) is asso- to printed words, even when they were contrasted with a simple
ciated with left TP lesions, 2) impaired access to common nouns ixation point. his study, therefore, provides powerful new evi-
for animals (e.g., horse) is associated with damage to the anterior dence that the VWFA is in fact necessary for access to the stored
sector of left IT, and 3) impaired access to common nouns for orthographic forms of words during reading.
tools (e.g., hammer) is associated with damage to the posterior Writing also depends on a large network of widely distributed
sector of left IT, a region called IT+ (Damasio et al. 1996; Damasio brain regions (Hillis and Rapp 2004; Rapcsak and Beeson 2002).
et al. 2004). Crucially, the patients have intact object recognition Information about the neural basis of lexical access during writ-
and conceptual knowledge since they can accurately describe ten word production comes primarily from patients with lexical
the entities they cannot name; in other words, the disorders are agraphia, a disorder in which words with regular mappings
purely anomic. Furthermore, functional imaging data indicate between phonology and orthography are spelled correctly, but
that the same cortical regions are activated in normal subjects words with irregular mappings (e.g., choir) are misspelled. he
in the same category-speciic ways when concrete entities are errors are usually phonologically plausible (e.g., circuit serkit)
orally named from either pictures (Damasio et al. 1996, 2004) and afect low-frequency words more than high-frequency
or characteristic sounds (Tranel et al. 2003; Tranel et al 2005). ones. Lexical agraphia is typically caused by damage to the left
here is also increasing evidence from several methodologies temporo-parieto-occipital junction (Brodmann areas 37 and/or
that the process of retrieving action verbs engages a quite difer- 39), although in some cases, there is involvement of the left ven-
ent neural pathway that includes the left ventrolateral premotor/ tral occipitotemporal region, close to, if not encompassing, the
prefrontal cortex (Damasio et al. 2001; Shapiro and Caramazza VWFA. Several functional imaging studies with normal subjects
2004; Tranel et al. 2001; Tranel et al. 2008). his region is reliably provide further support for a role of these cortical regions in writ-
activated when action verbs are accessed, and damage to it fre- ten word production (e.g., Petrides Alvisatos, and Evans 1995;
quently impairs the production of verbs but not nouns. Nakamura et al. 2000).
Neural Substrates of Written Word Recognition and Conclusion

Production When people recognize and produce the spoken and written
Reading and writing are recent inventions in human history forms of words, they usually concentrate on the meanings being
and must be explicitly taught. For literate individuals, how- expressed and remain blithely unaware of the complex compu-
ever, word representations include not just a phonological tations being executed by their brains in order to rapidly and
component but also an orthographic component that is ei- efectively process the lexical structures themselves. Cognitive
ciently processed by neural circuits that are gradually being neuroscience is beginning to reveal the intricasies of these neural
elucidated. systems, and dramatic advances are likely to happen in the com-
he activity of reading recruits numerous brain regions in ing years. Exciting new discoveries are appearing in the literature
the temporal, parietal, and frontal lobes (Hillis and Rapp 2004; almost daily, and this explosion of research will undoubtedly
Hillis and Tuiash 2002). Perhaps the most controversial region, provide fresh insights into the neurobiology of lexical process-
however, has been the visual word form area (VWFA), located ing, with signiicant implications for understanding and treating
in the left occipitotemporal sulcus bordering the fusiform gyrus disorders that result from brain injury.
(McCandliss, Cohen, and Dehaene 2003; Dehaene 2005). his
area responds more strongly to printed words than to other types David Kemmerer
of visually presented objects, such as faces, animals, and tools.
Also, disruption of the input projections to this area can induce WORKS CITED AND SUGGESTIONS FOR FURTHER READING
pure alexia without agraphia, a disorder in which reading can be Bohland, J. W., and F. H. Guenther. 2006. An fMRI investigation of sylla-
accomplished only in a laborious letter-by-letter manner, while ble sequence production. NeuroImage 32: 82141.
writing and all other linguistic skills are unafected. Despite these Catani, M., D. K. Jones, and D. H. Ffytche. 2005. Perisylvian language
indings, the question of whether the VWFA plays a genuine causal networks of the human brain. Annals of Neurology 57: 816.
role in reading has been hotly debated (e.g., Price and Devlin Damasio, H., T. J. Grabowski, D. Tranel, R. D. Hichwa, and A. R. Damasio.
2003). Recently, however, a compelling case study supporting 1996. A neural basis for lexical retrieval. Nature 380: 499505.
441
Lexical Processing, Neurobiology of Lexical Relations
Damasio, H., T. J. Grabowski, D. Tranel, L. L. B. Ponto, R. D. Hichwa, and Price, C. J., and J. T. Devlin. 2003. he myth of the visual word form area.
A. N. Damasio. 2001. Neural correlates of naming actions and of nam- NeuroImage 19: 47381.
ing spatial relations. NeuroImage 13: 105364. Pulvermller, F., M. Huss, F. Kherif, F. M. del Prado Martin, O. Hauk,
Damasio, H., D. Tranel, T. Grabowski, R. Adolphs, and A. N. Damasio. and Y. Shtyrov. 2006. Motor cortex maps articulatory features of
2004. Neural systems behind word and concept retrieval. Cognition speech sounds. Proceedings of the National Academy of Sciences
92: 179229. 103: 786570.
Dehaene, S. 2005. Evolution of human cortical circuits for reading and Rapcsak, S. Z., and P. M. Beeson. 2002. Neuroanatomical corre-
arithmetic: he neuronal recycling hypothesis. In From Monkey lates of spelling and writing. In he Handbook of Adult Language
Brain to Human Brain, ed. S. Dehaene, J.-R. Duhamel, M. Hauser, and Disorders: Integrating Cognitive Neuropsychology, Neurology, and
G. Rizzolatti, 13358. Cambridge, MA: MIT Press. Rehabilitation, ed. A. E. Hillis, 71100. Philadelphia: Psychology
Gaillard, R., L. Naccache, P. Pinel, S. Clemenceau, E. Volle, D. Hasboun, Press.
S. Dupont, M. Maulac, S. Dehaene, C. Adam, and L. Cohen. 2006. Rapp, B., and M. Goldrick. 2006. Speaking words: Contributions of
Direct intracranial, fMRI, and lesion evidence for the causal role of cognitive neuropsychological research. Cognitive Neuropsychology
left inferotemporal cortex in reading. Neuron 50: 91204. 23: 3973.
Hillis, A. E., and B. C. Rapp. 2004. Cognitive and neural substrates of Shapiro, K., and A. Caramazza, A. 2004. he organization of lexical
written language: Comprehension and production. In he Cognitive knowledge in the brain: he grammatical dimension. In he Cognitive
Neurosciences. Vol. 3. Ed. M. Gazzaniga, 77587. Cambridge, MA: MIT Neurosciences. Vol. 3. Ed. M. Gazzaniga, 80314. Cambridge, MA: MIT
Press. Press.
Hillis, A. E., and E. Tuiash. 2002. Neuroanatomical aspects of reading. Skipper, J. I., V. van Wassenhove, H. C. Nusbaum, and S. L. Small. 2008.
In he Handbook of Adult Language Disorders: Integrating Cognitive Hearing lips and seeing voices: How cortical areas supporting speech
Neuropsychology, Neurology, and Rehabilitation, ed. A. E. Hillis, 1526. production mediate audiovisual speech perception. Cerebral Cortex
Philadelphia: Psychology Press. 18: 243949.
Imada, T., Y. Zhang, M. Cheour, S. Taulu, A. Ahonen, and P. K. Tomasello, M. 1999. he Cultural Origins of Human Cognition.
Kuhl. 2006. Infant speech perception activates Brocas area: Cambridge: Harvard University Press.
A developmental magnetoencephalographic study. NeuroReport Tranel, D., R. Adolphs, H. Damasio, and A. R. Damasio. 2001. A neural
17: 95762. basis for the retrieval of words for actions. Cognitive Neuropsychology
Indefrey, P., and A. Cutler. 2004. Prelexical and lexical processing. 18: 65570.
In he Cognitive Neurosciences. Vol. 3. Ed. M. Gazzaniga, 75974. Tranel, D., H. Damasio, G. R. Eichhorn, T. J. Grabowski, L. L. B. Ponto,
Cambridge, MA: MIT Press. and R. D. Hichwa. 2003. Neural correlates of naming animals from
Indefrey, P., and W. J. M. Levelt. 2004. he spatial and temporal signa- their characteristic sounds. Neuropsychologia 41: 84754.
tures of word production components. Cognition 92: 10144. Tranel, D., T. J. Grabowski, J. Lyon, and H. Damasio, H. 2005. Naming
Kemmerer, D. 2010. How words capture visual experience: he per- the same entities from visual or from auditory stimulation engages
spective from cognitive neuroscience. In Words and the Mind: How similar regions of left inferotemporal cortices. Journal of Cognitive
Words Capture Human Experience, ed. B. Malt and P. Wolf, 289329. Neuroscience 17: 12931305.
Oxford: Oxford University Press. Tranel, D., K. Manzel, E. Asp, and D. Kemmerer. 2008. Naming static and
Levelt, W. J. M., A. Roelofs, and A. S. Meyer. 1999. A theory of lexical dynamic actions: Neuropsychological evidence. Journal of Physiology,
access in speech production. Behavioral and Brain Sciences 22: 138. Paris 102: 8094.
Martin, A. 2006. Shades of Dejerine forging a causal link between the
visual word form area and reading. Neuron 50: 1735.
McCandliss, B. D., L. Cohen, and S. Dehaene. 2003. he visual word form LEXICAL RELATIONS
area: Expertise for reading in the fusiform gyrus. Trends in Cognitive Lexical relations are ways in which words, or lexemes, share
Sciences 7: 2939.
something in common. his broad deinition includes relations
McQueen, J. M., D. Dahan, and A. Cutler. 2003. Continuity and
based on phonological relatedness, such as rhyming, and
gradedness in speech processing. In Phonetics and Phonology in
Language Comprehension and Production: Diferences and Similarities,
morphological relatedness, like being the range of tensed
ed. N. Schiller and A. Meyer, 3776. New York: Mouton de Gruyter. forms of a verb. However, in most contexts, the term is used to
Miller, G.A. 1991. he Science of Words. New York: Freeman. refer speciically to semantic relations among words and, most
Miozzo, M., and A. Caramazza, eds. 2008. Lexical processing. Cognitive frequently, to paradigmatic semantic relations among words,
Neuropsychology 25.4 (Special Issue). including synonymy, hyponymy, and antonymy. Such relations
Nakamura, K., M. Honda, T. Okada, T. Hanakawa, K. Toma, H. Fukuyama, are sometimes called sense relations (Lyons 1977), as it is usually
J. Konishi, and H. Shibasaki. 2000. Participation of the left posterior a single denotative sense of a word rather than every sense and
inferior temporal cortex in writing and mental recall of kanji orthogra- every aspect of the lexical item that is relevant to the relation.
phy: A functional MRI study. Brain 123: 95467.
hus, we expect the postal, rubber, and stomp senses of
Okada, K., and Hickok, G. 2006. Left posterior auditory-related cortices
stamp to enter into lexical relations with diferent sets of words.
participate both in speech perception and speech production: Neural
overlap revealed by fMRI. Brain and Language 98: 11217.
Paradigmatic and Syntagmatic Relations
Orfanidou, E., W. D. Marslen-Wilson, and M. H. Davis. 2006. Neural
Semantic relations are generally divided into two types, usu-
response suppression predicts repetition priming of spoken words
and pseudowords. Journal of Cognitive Neuroscience 18: 123752. ally called paradigmatic and syntagmatic. Syntagmatic rela-
Petrides, M., B. Alivisatos, and A. C. Evans. 1995. Functional activation tions are relations of combination that is to say, words that
of the human ventrolateral frontal cortex during mnemonic retrieval of ill diferent slots in a phrase, like book and read or delicious
verbal information. Proceedings of the National Academy of Sciences and food. hese can be grouped into relational types like mod-
92: 58037. iiermodiied or event verbagent. Some theories of lexical
442
Lexical Relations
semantics build such relations into lexical entries, for exam- synonyms, for example, when the same object is named inde-
ple in the selectional restrictions of Jerrold J. Katz and Jerry A. pendently by two subcommunities, either one word falls into dis-
Fodor (1963) and the lexical functions of meaning-text theory use or one or both of the words become specialized to a slightly
(Meluk 1996). diferent sense or context type. English, through its history of
Paradigmatic relations are relations of substitutability; the contact with other languages, has come to have many such
words in a semantic paradigm are diferent options for illing the near-synonyms, such as riseascend, smartintelligent, and dead
same phrasal slot. For example, red/yellow/blue are three options deceased. While these are substitutable for each other in many
for subject position in X is a primary color. Paradigmatic relations contexts, they all difer in meaning and use. For example, while
are studied because of their role in logical relations among sen- a balloon could ascend or rise, a person rises, rather than ascends,
tence meanings, such as entailment, and because of what they from a chair. And while people can be dead or deceased, plants
might tell us about how the mental lexicon is organized. can only be said to be dead. Many other cases of not-quite-synon-
ymy involve words that refer to similar, but not identical, things
Semantic versus Lexical Relations for example, tapas and hors doeuvres or shovel and scoop.
he term lexical relation is ambiguous, in that it can refer either he second problem with a truth-conditional deinition of
a) to [semantic] relations among words or b) to [semantic] rela- synonymy is that it allows as synonyms items that have difer-
tions among words that are represented in the mental lexicon, ent non-truth-conditional content. For example, the nouns dog,
as information in or links between the lexical entries for those doggy, and pooch may all be truthfully applied to a particular
words. Some authors reserve lexical relations for the b mean- animal, but choosing pooch implies diferent things about the
ing and use semantic relations for the a meaning. For exam- animal and the speakers relation to it than dog does. hus, some
ple, Derek Gross, Ute Fischer, and George A. Miller (1989) claim apply a more restrictive substitution test that takes into account
that while large and little are semantically opposite, they are not connotational and social aspects of meaning, as well as the truth
directly related to each other as lexical antonyms, whereas large conditional. To the extent that goodness of synonym relations
and small are both semantically and lexically related. In other can be afected by non-truth-conditional issues like register
words, large and small are not only semantically opposed; we and morphological complexity, synonymy can be considered a
have also learned through linguistic experience that the words lexical relation, as well as a semantic relation.
themselves are opposed. his distinction between lexical and Synonymy currently receives much attention in compu-
nonlexical relations is intended to explain why some word sets tational linguistics, as language generators and machine
are particularly strongly linked to each other, both in terms of co- translators require principled means for selecting the most
occurrence in corpora and psycholinguistic behavior for appropriate word for a context from a lexicon illed with near-
example, in word association experiments. synonyms. Such studies can be particularly concerned with dis-
cerning ways in which synonyms or near-synonyms can difer
Types of Paradigmatic Relations (see, e.g., DiMarco, Hirst, and Stede 1993).
he most studied paradigmatic lexical-semantic relations are
synonymy, hyponymy, and antonymy/contrast, because a) sub- HYPONYMY. Hyponymy, the type of relation, is the relation of
stitution of members of these sets typically results in regular con- sense inclusion, although it is often deined in terms of referen-
sequences for truth conditional semantics, and thus b) tial, or categorial, inclusion. In sense terms, beer is a hyponym
they are central organizational principles in many lexicological of beverage because the sense of beer includes all the informa-
theories (see semantic fields). While each of these relations tion that is in the sense of beverage, plus additional information
is easily exempliied, deinitions and the role of the relations in that identiies beers as special types of beverages. In catego-
the mental lexicon is still the subject of debate. Traditionally, rial terms, everything that beer denotes is included in the set of
deinitions that depend on the logical consequences of substi- everything that beverage denotes. Hyponym relations are, thus,
tution have been used (e.g., Lyons 1977). More recently, D. A. linguistic relexes of categorial inclusion relations, and, as such,
Cruse (1994) has proposed prototype-based deinitions of some theorists consider them to be semantic, but not lexical,
these relations, and M. Lynne Murphy (2003) has proposed a relations.
pragmatic approach. Next, we look at the relations in turn and Unlike synonymy, hyponymy is an asymmetrical relation, in
highlight some research issues associated with each. that beer is a type of beverage but beverage is not a type of beer.
Hyperonymy is the term for the converse relation from beverage
SYNONYMY. Synonymy, or sameness of meaning, is usually to beer. Hyponymy is usually said to be a transitive relation. For
deined in terms of a substitution test. If word X can be substi- example, if beer is a hyponym of beverage and beverage is a hyp-
tuted for a particular sense of word Y in any sentence with no onym of liquid, then beer is a hyponym of liquid. However, tran-
change to the truth conditions of the sentence, then X and Y are sitivity holds only in cases of proper sense inclusion not in all
synonyms. However, this deinition does not include many of the cases that pass the is a type/kind of test (thus proving that the test
things that are called synonyms in thesauruses and everyday can be misleading). For example, speed-reading is a type of read-
discourse, for a few reasons. ing, and reading is a type of leisure activity, but speed-reading is
First, languages generally avoid synonymy since it is eco- not usually considered to be a leisure activity. While reading can
nomical (both in lexical acquisition and in any interaction) to function as a leisure activity, it is not deined in terms of being
assume that diferent forms signal diferent meanings (see, e.g., leisurely, and thus reading is not a hyponym of leisure activity in
Clark 1992). When a language variety chances on a pair of perfect the logical sense of the term.
443
Lexical Relations Lexical Semantics
Inclusion relations, along with contrast relations, are central in all being direct hyponyms of color but difer in the part of
to most approaches to lexical meaning, particularly in the treat- the spectrum they designate. Still, they are not truly incompat-
ment of noun meaning. For other word classes, hyponym rela- ible since they may overlap some shades of turquoise may be
tions are less frequent or less clearly paradigmatic. For example, considered to be both blue and green. he fact that most people
the adjectives happy and sad describe states that belong to the would insist that it must be one or the other, however, indicates
category designated by emotion, but since emotion is a noun, it is our preference for acting as if contrast sets are incompatible.
not substitutable for happy and sad.
OTHER RELATIONS. he foregoing types of paradigmatic rela-
ANTONYMY/CONTRAST. While perfect synonymy involves words tion are generally held to be the most important to lexical and
that always refer to the same thing, logical incompatibility semantic theories, but many more have been noted in the lit-
involves words that never overlap in reference. his logical rela- erature. he most discussed of these is meronymy, the part of
tion contributes to two lexical-semantic relations, antonymy and relation, though this can be thought of as a cover term for many
contrast. other types of relation, such as segmentwhole (slicecake),
Binary opposition seems to have special status in language materialwhole (woodtable), leaderorganization (captain
and conceptualization since even where more than one incom- team), and so on.
patible alternative is available, binary relations may be dis- Whether a precise taxonomy of relations is possible or neces-
cerned. For example, although there are many emotions (happy, sary is an open question. While semantic ield views of the lex-
sad, angry, afraid, disgusted), happy is generally understood as icon rely on a small number of well-deined relations, theories
having a single antonym, sad. employing looser semantic networks or semantic domains might
Logical approaches to antonymy distinguish various types. allow for as many types of lexical relations as there are possible
Complementary (or contradictory) antonyms perfectly bisect relations among entities in the world. As the subtypes of anton-
a semantic domain. For example, in relation to electrical ymy and meronymy show, there is also the question of the level
items, if something is not on, then it is necessarily of and vice of speciicity that needs to be employed in representing these
versa. Contrary antonyms designate the extremes of a scale. relations in linguistic theory.
For instance, something that is large is necessarily not small, M. Lynne Murphy
but something that is not small is not necessarily large. (Some
authors, including Lyons 1977 and Cruse 1986, restrict the term WORKS CITED AND SUGGESTIONS FOR FURTHER READING
antonym to just contraries.) Converse antonyms indicate difer-
ent perspectives on the same relationship for example send/ Clark, Eve V. 1992. Conventionality and contrast. In Frames, Fields, and
Contrasts, ed. Adrienne Lehrer and Eva Feder Kittay, 17188. Hillsdale,
receive, teacher/student, north/south. So, if X is north of Y, then Y
NJ: Erlbaum.
is south of X. Other types of antonymy have received some atten-
Cruse, D. A. 1986. Lexical Semantics. Cambridge: Cambridge University
tion, but these tend not to show the same kinds of logical rela-
Press.
tions as those mentioned earlier. For example, kinship terms can . 1994. Prototype theory and lexical relations. Rivista di Linguistica
be opposed on the basis of gender (brother/sister) or generation 6: 16788.
(mother/daughter). DiMarco, Crysanne, Graeme Hirst, and Manfred Stede. 1993. he seman-
In the broad sense of the term, antonymy is often deined as tic and stylistic diferentiation of synonyms and near-synonyms.
a relationship of minimal diference; that is, antonymous words Proceedings, AAAI Spring Symposium on Building Lexicons for Machine
share most of their semantic content, but for one diference that Translation, Stanford, CA: 11421.
makes the two terms incompatible. On such a deinition, mother Gross, Derek, Ute Fischer, and George A. Miller. 1989. he organi-
and daughter are opposites because they only difer in the gen- zation of adjectival meanings. Journal of Memory and Language
28: 92106.
eration they refer to, while mother and son are not opposites
Jones, Steven. 2002. Antonymy. London: Routledge.
because they difer in both gender and generation.
Katz, Jerrold J., and Jerry A. Fodor. 1963. he structure of a semantic
As mentioned, the lexical relation antonymy is sometimes theory. Language 39: 170210.
contrasted to the semantic relation opposition, and antonym Lyons, John. 1977. Semantics. 2 vols. Cambridge: Cambridge University
pairs present the best evidence in favor of the position that rela- Press.
tions between lexical items, not just senses, are represented in Meluk, I. A. 1996. Lexical functions: A tool for the description of
the mental lexicon. Antonym pairings are particularly strong in lexical relations in a lexicon. In Lexical Functions in Lexicography
experiments such as word association tasks and lexical prim- and Natural Language Processing, ed. Leo Wanner, 37102.
ing, and antonyms co-occur frequently in text leading some Amsterdam: Benjamins.
(e.g., Jones 2002) to question whether antonymy is also a syntag- Murphy, M. Lynne. 2003. Semantic Relations and the Lexicon.
matic relation. Cambridge: Cambridge University Press.
Larger sets of incompatible items exist, for example solid/
liquid/gas, but more linguistic-semantic attention is usually paid
LEXICAL SEMANTICS
to the not-necessarily-incompatible relation of co-hyponymy
or contrast, which, along with hyponymy, is basic to the taxo- Lexical semantics is often loosely described as the study of word
nomic organization at the basis of semantic ield and network meaning, but both word and meaning require more precise
approaches. Such relations can also be deined in terms of min- deinition in this context. he term is usually used to describe the
imal diference. For example, the basic color terms are similar study of lexical, or content, words or lexemes (including nouns,
444
Lexical Semantics
verbs, adjectives), rather than grammatical words (conjunctions, are a major force in determining the form of a language and that
determiners), which are more usually studied in the context of lexicon and grammar do not constitute completely separate
sentential (or propositional) semantics. Lexical semantics can types of linguistic knowledge. hus, such approaches are usually
also refer to the semantics of non-word lexical items, such as lexico-centric. 4) Lexical concerns have also been at the forefront
idioms. he meaning aspect of lexical semantics most often of computational linguistics, in part because lexically
refers to denotative sense in particular that is, determining driven approaches to grammar have proved most computation-
what such words can and cannot refer to, as opposed to their ally practicable. Furthermore, the goals of most computational
connotation or social import. Some of the main issues that con- linguistics programs involve the creation of systems that can use
cern lexical semanticists are the following: language meaningfully thus requiring models of how meanings
might be represented and means to acquire and use such seman-
How is the meaning of a word best represented in a model of
tic information. 5) Meanwhile, advances in computer hardware
the mental lexicon?
and software led to the growth and development of corpus
Are diferent types of representation required for diferent
linguistics, which is particularly suited to the study of words
kinds of meaning?
and their use and has become one of the major methodological
How should multiple interpretations of a single word be
tools of lexical semantics and lexicography.
described and explained?
his conluence of diverse motivations and assumptions has
How are diferent words meanings related to one another?
contributed to the variety of approaches to lexical semantics and
to contrary positions on major questions in the ield.
Lexical Semantics in Linguistic Theory: Historical Context
Word meaning is a long-standing area of interest in philosophy Major Distinguishers of Theoretical Approaches
and, of course, lexicography, but attention to it in theoretical Any semantic theory must say something about the representa-
linguistics has varied by time and place. In recent decades, lex- tion of lexical meaning, and theories of lexical meaning must it
ical semantics (and lexicology more generally) has experienced into theoretical accounts of the meaning and grammar of larger
revitalization after a slow period in the early to midtwentieth constituents. hus, in a sense, there is no such thing as a free-
century. For instance, Noam Chomsky (1965, 84) described the standing lexical semantic theory, but instead there are theories
lexicon as simply an unordered list of all lexical formatives, of meaning that pay more or less attention to representation at
and Leonard Bloomield claimed that we have no way of dein- the lexical level.
ing most meanings and therefore the linguist cannot deine heories thus difer in the extent to which they take the lexical
meanings, but must appeal for this to students of other sciences or sentential meanings as their starting point. hose that start at
(1933, 1446). Of course, inluential lexical semantic work was the lexical level of analysis, for example, Anna Wierzbickas (e.g.,
pursued in this period, but much of the work in the generative 1996) Natural Semantic Metalanguage approach, are sometimes
tradition (e.g., Katz and Fodor 1963 and generative seman- criticized for lack of attention to the way that word meanings
tics in the 1970s) ran into problems of internal consistency or combine in order to create sentence meanings. On the other
explanatory insuiciency. Other semantic work was pursued in hand, those that start with complete propositions in mind, for
Europe by practitioners of structuralism and functional example, Ray Jackendofs (e.g., 1990) Conceptual Semantics, tend
linguistics or by philosophers of language. to focus on the aspects of word meaning that interact with each
Nowadays, the lexicon is central to most major theories of other in sentential contexts (particularly the relation between
language. here are several reasons (presented here in no par- predicate and argument), but not with more detailed
ticular order) for the (re)emergence of the lexicon and lexical nuances of meaning, as would interest a lexicographer.
semantics in linguistic study: 1) Psychological experimentation One of the most basic issues for a lexical semantic approach
in the 1970s (particularly by Eleanor Rosch) provided evidence is the issue of whether lexical meanings should be distinguished
that word meaning is represented in the mind quite diferently from concepts. hat is to say, is the meaning of a content word,
from the way that it is represented on a dictionary page that like apple, diferent from our conceptualization of the category
some lexical meanings might be based on prototype repre- apple? Is knowing about apples diferent from knowing the
sentations. Since the goals of general linguistics were, by this meaning of apple? hose theorists who think that lexical meaning
time, mostly concerned with the mental representation of is diferent from conceptualization generally make the distinc-
language, such evidence needed to be integrated with linguistic tion between deinitional and encyclopedic aspects of meaning.
theory more generally. 2) Since the 1980s, theories of grammar On this view, only deinitional meaning that which is suicient
have become more lexically driven (e.g., head-driven phrase to identify a referent and allow for grammatical interpretation of
structure grammar, lexical-functional grammar, the sentential context is relevant to lexical semantics. To use
and minimalism). In such theories, the main constraints on a simple example, the deinition of girl would be young female
sentence structure come from the syntactic and semantic human. Encyclopedic information, on the other hand, includes
requirements of the lexemes in the sentence, and lexical struc- other information that comes from our experience of the things
tures and rules account for the types of things that transformations and situations referred to by words. For girl, this might include
did in earlier Chomskyan approaches (cf. transformational information such as may wear pigtails and associated with
grammar). 3) Around the same time, the group of theoreti- the color pink. Jerrold J. Katz and Jerry A. Fodors (1963) desid-
cal approaches called cognitive linguistics was emerging. erata for a theory of linguistic semantics takes the position that
Cognitive linguistic theories hold that semantic considerations encyclopedic information should not be represented as part of
445
Lexical Semantics Lexicography
linguistic (including lexical) meaning. However, many, if not Chomsky, Noam. 1965. Aspects of the heory of Syntax. Cambridge,
most, theories of meaning have moved away from this assump- MA: MIT Press.
tion and treat lexical semantics as involving the interface between Cruse, D. A. 1986. Lexical Semantics. Cambridge: Cambridge University
language and conceptualization. Such approaches are less likely Press.
Fellbaum, Christiane, ed. 1998. WordNet: An Electronic Lexical Database.
to hold that meanings are represented in the mental lexicon (i.e.,
the repository of linguistic information about words), but instead
Jackendof, Ray. 1990. Semantic Structures. Cambridge, MA: MIT Press.
see lexical meaning as represented in the conceptual realm. In Katz, Jerrold J., and Jerry A. Fodor. 1963. he structure of a semantic
this case, lexical semantic theories become intertwined with the- theory. Language 39: 170210.
ories regarding the representation of concepts and must explain Pustejovsky, James. 1995. he Generative Lexicon. Cambridge, MA: MIT
a) whether/how lexicalized concepts difer from nonlexicalized Press.
concepts, and b) how the formal aspects of language interact Ravin, Yael, and Claudia Leacock, eds. 2000. Polysemy: heoretical and
with the conceptual representations of meaning. Computational Approaches. Oxford: Oxford University Press.
Another issue that has divided lexical semanticists (and lexi- Wierzbicka, Anna. 1996. Semantics: Primes and Universals. Oxford: Oxford
cologists) is whether word meanings can be deined on their own University Press.
or whether lexical meaning is derived (at least in part) through
semantic relations (lexical relations) among words. hat
LEXICOGRAPHY
is, does a words meaning emerge (to some degree) through the
relations between words in a languages lexicon? Lexical ield While lexicography is often thought of as a subield of linguis-
theorists and some computational linguists (e.g., the WordNet tics, it is a scholarly discipline in its own right with its own prin-
project Fellbaum 1998) take the position that meaning emerges ciples and practices. his discipline is divided into two subields,
from relations, whereas most working within a componential practical lexicography and theoretical lexicography. Practical
framework presume that lexical semantic relations should be lexicography is concerned with compiling, writing, and editing
explained by a lexical semantic theory, rather than being primi- dictionaries, which serve a double function: as a record of the
tive elements of the theory. vocabulary of the language and as a reference work to meet the
Finally, there is the very big question of how the senses of needs of users for information about words, their usage, and
words are to be represented in a linguistic model. he most com- their spelling. heoretical lexicography is concerned with the
mon approach is to devise a componential semantic metalan- deinition of general principles governing the compilation of
guage, which provides a limited and precise vocabulary for dictionaries.
representing elements of meaning and some form of grammar Dictionaries difer in their selection of vocabulary and other
for combining those elements into more complex meanings. he items that the editors believe merit inclusion, given the size
form that such metalanguages take varies considerably among and purpose of the volume. While most dictionaries use alpha-
theories, and some cognitive linguistics theories eschew meta- betized word lists, certain others such as Rogets hesaurus of
languages as such in favor of representations (for example, the English Words and Phrases are arranged by topic. It lists words
image schema) with more visual-schematic elements than lin- according to the ideas that they express, for example, abstract
guistic ones. What all of these approaches have in common is the relations (existence, relation, quantity, etc.), space (generally,
aim to represent meaning using a restricted set of meaning ele- dimensions, etc.), and intellect (formation of ideas, communica-
ments. hat is, meanings are composed from smaller meaning- tion of ideas, etc.), among others. Words and phrases are listed in
ful parts (often semantic primitives [primes]), and the set of the main body of a thesaurus according to their word class, but
meaningful parts available to a semantic theory is smaller than without a deinition or any information about pronunciation or
the set of lexical items that could be described by such a repre- etymology.
sentational system (cf. language of thought). Dictionaries come in various formats. Besides traditional
print dictionaries, online dictionaries and dictionaries on solid
Conclusion Lexical Semantics Today state media (such as CD or lash memory) have become increas-
he range of lexical semantic research today is extremely varied ingly popular during the last quarter of the twentieth century
in the topics studied, the methodologies applied, and the theo- because they facilitate rapid access to information, cross-refer-
retical assumptions behind them. Unlike some other subdisci- encing, and immediate updates with the latest vocabulary.
plines of linguistics, no particular theoretical approach can be General-purpose monolingual dictionaries are organized
said to be the clear leader in the ield. he advent and develop- alphabetically and use the same language for both the object and
ment of corpus linguistics means that much lexical work is now the means of description. While all dictionaries aim for compre-
based on empirical rather than just introspective evidence, hensiveness, the number and structure of lexical entries depends
and continuing developments in that ield strengthen the value upon the target audience, as well as constraints in funding and
of that evidence. time to complete the dictionary. General-purpose dictionaries
focus on the description of a standard language, aim to provide
M. Lynne Murphy
an exhaustive coverage of the words in a language (abridged
dictionaries focus on somewhat shorter lists), and are typically
more linguistic than encyclopedic.
Bloomield, Leonard. 1933. Language. New York: Holt, Rinehart, and Dictionaries are usually divided into three parts: an intro-
Winston. duction (including instructions on how to use the dictionary),
446
Lexicography Linguistic Relativism
the body (the alphabetically ordered list of entries), and appen- dictionary to get an idea of the microstructure of lexical entries,
dices (other information, such as weights and measures, punc- reviewers generally conduct an arbitrary sampling of diction-
tuation, etc.). he arrangement of the entries of a dictionary ary entries. Depending on the time constraints, this procedure
is referred to as its macrostructure. Dictionaries difer in the may, for example, lead the reviewer to scrutinize every 10th main
placement of homonyms, derived words, compounds, and entry on every 20th page for completeness, clearness, accuracy,
phrases, which can be given independent entries or included simplicity, and modernity. To ensure that the review is based on
in an entry. he layout and organization of the individual entry a representative sample, it is necessary to check that the difer-
is referred to as the microstructure of the dictionary. Each dic- ent parts of speech are adequately represented, that polysemy is
tionary difers in its conventions for structuring lexical entries. taken into account, and that there is a balance between words
Typically, the headword at the beginning of an entry is in bold from both the general and speciic domains. In reviewing dic-
and indented by a few spaces. Bold or italic typeface may be tionaries and devising methods for improving the structure of
used to mark the part of speech at the beginning of a lexical dictionaries, the reviewer notes the presentation of the text on
entry. Some dictionaries include the standard pronunciation of the page, as it plays a signiicant role in inluencing the acces-
headwords and spelling variants. When a word is polysemous, sibility of the content. he range of vocabulary is also important
its senses are often numbered, with the most frequently occur- since users typically expect dictionaries to ofer the latest words
ring sense irst. Similarly, when a sense or a group of senses from the domains of fashion, technology, and business, among
belong to a diferent word class or subclass, the sense(s) are others, while at the same time including variants. Polysemy, the
labeled accordingly, in combination with deinitions, exam- structure of deinitions, usage notes, examples, and etymologi-
ples, and usage notes. More technical, archaic, or obsolete cal information are also important criteria for the evaluation of a
senses and idiomatic phrases usually appear toward the end of dictionarys content.
the lexical entry.
Hans C. Boas
he headword of a lexical entry consists of a lemma (the basic
word form) that conventionally represents all of the inlected WORKS CITED AND SUGGESTIONS FOR FURTHER READING
forms of the unit. Following the headword, lexical entries pro-
vide a deinition of it in the form of a paraphrase (in the same Atkins, Beryl T. S., and A. Zampolli. 1994. Computational Approaches to
the Lexicon. Oxford: Oxford University Press.
language), which is semantically equivalent to it. Lexicographers
Bjoint, Henri. 2000. Modern Lexicography: An Introduction.
aim to ofer deinitions that are simpler than the word itself.
Another goal is to avoid circularity of deinitions, that is, to not Hartmann, R. R. K., and G. James. 1998. Dictionary of Lexicography.
deine two or more lexemes in terms of each other. he most London and New York: Routledge.
common type of deinition is the analytic deinition since it Zgusta, Ladislav. 1971. Manual of Lexicography. he Hague: Mouton.
aims at maximal inclusion and independence from the context.
Synonyms or antonyms are sometimes used as alternatives to
LINGUISTIC RELATIVISM
analytic deinitions because they are short. However, they may
require the dictionary user to look up other deinitions to under- Linguistic relativism refers to the idea that language inluences
stand their meanings. Depending on the scope and aim of the thought and worldview (see also language of thought). In
dictionary, lexical entries also provide examples that illustrate essence, thinking and worldview are relative to the language one
the headwords syntactic behavior or ofer additional semantic learns to speak in childhood. Language, thought, worldview, and
information. inluence are thus key concepts in linguistic relativism.
Metalexicography has the goal to improve the structure Various understandings of language have led to propos-
and content of dictionaries. One way of achieving this goal is als for several levels of relativism, from semiotic relativity when
to be involved in research about lexicography. here are sev- referring to the general faculty of language, to structural relativ-
eral professional organizations and academic journals devoted ity when referring to the grammatical properties of languages,
to metalexicography. Another way is to suggest methods and and to functional relativity when referring to communicational
criteria for reviewing and evaluating dictionaries. Evaluating patterns of interaction within and across languages (Lucy 1997;
and assessing dictionaries, also known as dictionary criticism, Hymes 1966). Most past and current research has concentrated
is diicult because it is not always clear what types of criteria on structural relativity.
should be applied. One solution to this problem has been to Several levels of thought have been posited as potentially
take large sets of dictionary reviews and determine the range under the inluence of language, including the neurologi-
of criteria applied by diferent reviewers in their evaluations. cal, cognitive, and propositional levels. To date, little work has
hese include reversibility, alphabetization, directionality, investigated neurological variation across speakers of diferent
coverage, reliability, currency, redundancy, retrievability, and languages (Gilbert et al. 2006). Most work addresses conceptu-
equivalents. alization by examining cognitive processes, such as memory,
Another diiculty of dictionary criticism is the large number categorization, inference, analogy, and emotions (Lucy
of entries. Reviewers analyze only a small sample of entries or 1992; Levinson 2003; Boroditsky, Schmidt, and Phillips 2003).
focus on particular features of dictionaries. To begin with, they Very few studies, if any, have proposed that language may inlu-
typically focus on the preface of the dictionary that explains how ence the actual content, or propositional level, of thought. his
to use it, who the intended audience is, and what types of infor- idea, known as linguistic determinism, is commonly discredited
mation are included in the main body. After lipping through the as scientiically untenable.
447
Literacy
Concerning the relationship between language and thought, of the code and sociocultural emphases in the cross-linguistic
language has variably been suggested to inluence, impact, study of literacy learning.
shape, mould, condition, limit, or channel thinking. he code approach is the most prominent research frame-
An important issue, then, concerns the scope of these efects. work because of its focus on the universal and language-spe-
Current research has been asking whether language efects are ciic features that can explain the cognitive-linguistic processes
on line in the process of producing and comprehending lan- underlying
guage (see Slobin 1996 on thinking for speaking) or whether
decoding mastering a languages written code relative to its
language efects pervade human cognition (i.e., efects exist
spoken language units. For English, this means that children
whether or not one is engaged in linguistic acts). Most research
must become aware of how letter patterns (graphemes) cor-
has assumed the latter type of language efects on thought.
respond to the smallest segment of their spoken language, the
Finally, note that linguistic relativity was originally formulated
phoneme, as the means for achieving automatic and luent
as a scientiic principle by Benjamin Lee Whorf in 1940 (1956,
word-level recognition.
214, 221). he principle has since then been relabeled the Sapir-
comprehension deriving an overall interpretation of an
Whorf hypothesis, following an article by Harry Hoijer (1954)
authors intended meanings as actively constructed through
referring to Edward Sapir and Whorf, who contributed to the
interactions with the textual medium.
early development of linguistic relativity in the 1920s and1930s
composition the generation and organization of ones own
(Sapir 1985; Whorf 1956).
ideas as expressed through interactions with the textual
Stphanie Pourcel medium.
A contentious debate concerns whether proicient word-decod-

WORKS CITED AND SUGGESTIONS FOR FURTHER READING ing abilities must occur before reading comprehension skills can
develop (known as the simple view of reading) (Vellutino et al.
Boroditsky, Lera, Lauren Schmidt, and Webb Phillips. 2003. Sex, syntax
2007) or whether reading comprehension develops concurrently
and semantics. In Language in Mind, ed. Dedre Gentner and Susan
Goldin-Meadow, 6179. Cambridge, MA: MIT Press. with general spoken language comprehension (Cain and Oakhill
Gilbert, Aubrey, Terry Regier, Paul Kay, and Richard Ivry. 2006. Whorf 2007).
hypothesis is supported in the right visual ield but not the left. A second perspective integrates facets of the code and social-
Proceedings of the National Academy of Sciences 103: 48994. ization frameworks in highlighting purposes and types of literacy,
Hoijer, Harry. 1954. he Sapir-Whorf hypothesis. In Language in especially for alphabetic knowledge. he basic level of literacy is
Culture, ed. Harry Hoijer, 92105. Chicago: University of Chicago alphabetic and functional. Individuals who break the alphabetic
Press. code are able to negotiate daily activities that involve recogniz-
Hymes, Dell. 1966. Two types of linguistic relativity. In Sociolinguistics, ing and accessing known meanings from their spoken language,
ed. William Bright, 11457. he Hague: Mouton.
such as reading street signs or writing familiar food items for a
Levinson, Stephen. 2003. Space in Language and Cognition: Explorations
grocery list. Functional literacy is inadequate for meeting current
in Cognitive Diversity. Cambridge: Cambridge University Press.
standards in either educational or workplace contexts. In con-
Lucy, John. 1992. Grammatical Categories and Cognition.
Cambridge: Cambridge University Press. trast, critical literacy stresses proiciency. Individuals must be
. 1997. Linguistic Relativity. Annual Review of Anthropology capable of using literacy tools competently for learning how to
26: 291312. learn. Proiciency includes knowing how to analyze critical link-
Sapir, Edward. 1985. Selected Writings in Language, Culture and ages among ones prior knowledge the meaning or signiicance
Personality. Berkeley: University of California Press. of a read or written text relative to perspectives expressed and,
Slobin, Dan. 1996. From thought and language to thinking for speak- at the highest level, integrating this information with other texts
ing. In Rethinking Linguistic Relativity, ed. John Gumperz and as the process for generating new questions. his ability to draw
Stephen Levinson, 7096. Cambridge: Cambridge University Press. on and contrast multiple sources of information to formulate
Whorf, Benjamin Lee. 1956. Language, hought, and Reality. Cambridge,
new understandings entails intertexuality.
MA: MIT Press.
A third stance broadens the concept of literacy proiciency
from the traditional code and socialization emphases to mul-
LITERACY tiple literacies. his construct, rooted in the profound sociocul-
tural changes in communication brought about by the digital
Prominent Research Frameworks age, encompasses computer literacy, information literacy, and
Written symbolic codes of languages may be alphabetic, syl- digital media literacy as components.
labic, morphosyllabic, (Perfetti 2003; Ho et al. 2007), or alpha- hese three literacy frameworks are not mutually exclusive.
syllabic (Mishra and Stainthrope 2007). In each case, children Moreover, notions of being literate and their associated stan-
must irst learn how the written code of their language embodies dards will continue to evolve as outcomes of sociocultural inter-
spoken language units (Perfetti 2003, 17). Children must also actions with new technologies.
have language socialization experiences that promote thinking Since literacy knowledge originates from spoken language
and talking in more literate ways if they are to achieve academic knowledge, the study of literacy crosses multiple disciplines and
language proiciency (Wilkinson and Silliman 2000) or linguis- subareas. Language studies range from language-learning
tic literacy (Ravid and Tolchinsky 2002). Contemporary research environment; theory of mind and language acqui-
frameworks and their related studies relect diferent aspects sition; word meaning; the mental lexicon; and acquisition
448
Literacy
of syntax (see syntax, acquisition of), to language varia- their interactions for proicient literacy learning remain uniden-
tion and second language acquisition. Literacy learning tiied (Cain and Oakhill 2007).
processes are also subjects of developmental language study
from difering viewpoints. Subareas include phonological Current State of Research
awareness, reading, composition, spelling, distinctions OVERVIEW OF THEORY. Contemporary research on literacy has
between the spoken and written communication modes, and the been catalyzed by two general theoretical traditions on the
efectiveness of teaching reading and teaching writing human capacity for knowledge: Sociocultural science and cog-
in educational programs that, internationally, span alphabetic nitive science. Each involves an efort to build a comprehensive
and nonalphabetic languages. Furthermore, literacy research and coherent account of human knowledge capacity, but difer
has expanded to incorporate neuroimaging in order to iden- in their view of how knowledge accrues to individuals.
tify neurobiological correlates of dyslexia and efects on brain he sociocultural tradition, which encompasses the language
function of scientiically based reading (SBR) interventions. socialization framework, counts its origins in American prag-
Behavioral studies have examined associations among oral lan- matism, such as that of William James and John Dewey. he
guage impairment, dyslexia, and text comprehension and related key concept is that human knowledge is embedded in the social
disorders of reading and writing (for reviews, see Cain and physical context; socioculturalists view the individual within
and Oakhill 2007; Scarborough 2005). a speciic social context as the fundamental unit of analysis for
studying human learning and development. Diferent strands of
Modern History the sociocultural tradition emphasize alternate ways of social/
Research on literacy learning is relatively new, initiated in cultural analysis (e.g., interpersonal exchanges versus broad cul-
the 1970s. Since its inception, one major principle has guided tural patterns versus local sociopolitical power hierarchies). Two
this work: Children should have signiicant home and school of these variations, social constructivism and participation/prac-
opportunities for the integration of oral and written language tice theory, have played signiicant roles in language and literacy
experiences. hese experiences support the development of lit- research. Sociolinguistic approaches have been prominent in
erate stances in comprehension, speaking, reading, and writing. academic discourse studies, whereas critical theory, which advo-
Becoming literate is a social process, inluenced largely by chil- cates for social change and empowerment, has played a major
drens search for meaning. Prior to school-based reading and role in literacy studies.
writing, childrens engagement in literacy-like actions in play, he cognitive science tradition traces its origins to the nine-
such as scribbling on paper with crayons, and with adults, for teenth-century studies of individual diferences in perceptual
example, storybook reading, forms the foundation for later liter- processing (e.g., Wilhelm Wundt). Modern-day approaches,
acy learning. however, highlight the mechanisms by which people process
Initial research on literacy was strongly shaped by studies of and integrate information, and the unit of analysis tends to be the
classroom language. Nearly four decades ago, sociolinguists individual. he cognitive science tradition also represents a vari-
launched a new direction for inquiry into language and literacy ety of conceptual approaches sharing the premise that the indi-
learning, focusing on oral language use in classrooms. he irst viduals real-time information processing should be the focus
research concentrated on language functions, the communi- of inquiry. Diferent approaches vary from a strong nativist (see
cative demands of classrooms, individual diferences, and the innateness and innatism) and representational perspec-
social basis and social integration necessary for learning. Initial tive to the primacy of the emergence of knowledge from system
reading studies addressed assessments of reading comprehen- experience interactions, as stated by the connectionists. For
sion, whereas current studies emphasize efective instructional example, neuroimaging studies range from information process-
models of decoding and reading comprehension. In the United ing, which emphasizes the processing constraints and multilevel
States with the 2001 passage of the No Child Left Behind Act integration of information, to connectionism, which investigates
(NCLB), federal educational policies for the irst time exerted a parallel processing and the extraction of inherent regularities
profound inluence on the way that beginning reading is taught. from the input.
he expectation was that explicit SBR instruction would guide
reading practices and curriculum development in phonemic ELABORATION OF APPROACHES. he cognitive science approach
awareness, decoding, and luency. in which the code framework is embedded has been most inlu-
A broader array of language-related features is implicated in ential in revealing a) the precursory phonological awareness
more literate spoken and written language uses beyond phone- knowledge needed for beginning reading across alphabetic
mic awareness, however. hese include the scope and density of and nonalphabetic languages and necessary language-speciic
vocabulary knowledge, command of more advanced syntactic knowledge (e.g., how the consonant cluster patterns of spoken
constructions applied to diverse reading and writing purposes, Czech inluence phonemic awareness development) (Caravolas
and familiarity with a variety of narrative and expository dia- and Bruck 1993); b) the instructional design and content that
logue structures and their organization. Non-language factors best facilitates beginning reading in struggling readers; and
also contribute, such as working memory capacity for difer- c) neurobiological signatures of dyslexia. he focus in instructional
ent kinds of information-processing demands, the motivation to studies is on experimental investigations as the scientiic basis for
learn, inferencing and integration, and metacognitive strategies determining the treatment validity of instruction to prevent read-
for the self-regulation of literacy learning. he causal relation- ing problems in grades 1 to 3. hese studies, conducted primarily
ship of these language and non-language-related variables and in the United States, employ randomized controlled ield trials,
449
Literacy Literariness
often using a response to intervention model to determine the Cain, Kate, and Jane Oakhill, eds. 2007. Childrens Comprehension
eicacy of outcomes in alphabetic reading. While diferences exist Problems in Oral and Written Language: A Cognitive Perspective. New
in the form of response to intervention designs, all involve a hier- York: Guilford.
archical process of alphabetic instruction and ongoing reading-re- Caravolas, Markka, and Margaret Bruck. 1993. he efect of oral and
written language input on childrens phonological awareness: A cross-
lated assessments. Minimal responsiveness may mean that a child
linguistic study. Journal of Experimental Child Psychology 55: 130.
requires special education support to be successful.
Ho, Connie, David W. Chan, Kevin H. Chung, Suk-Han Lee, and Suk-Man
he cognitive-experimentalist approach is not without criti- Tsang. 2007. In search of subtypes of Chinese developmental dys-
cism, particularly as this research is relected in the NCLB goal lexia. Journal of Experimental Child Psychology 97: 6183.
that all grade 3 children will read proiciently by 2014. One cri- Mishra, Ranjita, and Rhona Stainthrope. 2007. he relationship between
tique pertains to individual diferences. Given the individual phonological awareness and word reading accuracy in Oriya and
diversity in neurobiological makeup and sociocultural experi- English: A study of Oriya-speaking ifth-graders. Journal of Research
ences, it is not possible to erase normal variation in the distri- in Reading 30: 2337.
bution of reading ability. Instead, normal variation should be Perfetti, Charles. A. 2003. he universal grammar of reading. Scientiic
treated as an asset to build upon and not as a liability (Berninger Studies of Reading 7: 324.
and Richards 2002). Ravid, Dorit., and Liliana Tolchinsky. 2002. Developing linguistic liter-
acy: A comprehensive model. Journal of Child Language 29: 41747.
In contrast, sociocultural approaches converge on the belief
Scarborough, Hollis S. 2005. Developmental relationships between lan-
that literacy learning does not consist exclusively of recruiting
guage and reading. In he Connections between Language and Reading
neurobiological and cognitive events inside the learners head. Disabilities, ed. H. Catts and A. Kamhi, 324. Mahwah, NJ: Lawrence
A signiicant question is how academic discourse serves as the Erlbaum.
social mechanism for children to learn how to do literacy Silliman, Elaine R., Louise C. Wilkinson, and Maria R. Brea-Spahn.
and advance their language learning as members of the larger 2004. Policy and practice imperatives for language and literacy
school literacy culture. Study designs are typically descrip- learning: Who shall be left behind? In Handbook on Language and
tive or quasi-experimental. A limitation of the sociocultural Literacy: Development and disorders, ed. C. Stone, Elaine R. Silliman,
tradition is that descriptive studies function best to generate B. Ehren, and K. Apel, 97129. New York: Guilford.
new hypotheses about causal mechanisms but cannot yield Vellutino, Frank R., William E. Tunmer, James J. Jaccard, and RuSan
Chen. 2007. Components of reading ability: Multivariate evidence for
broader generalizations unlike randomized controlled trials.
a convergent skills model of reading development. Scientiic Studies
However, future literacy research may lead to mixed-methods
of Reading 11: 332.
approaches that combine the tools of the cognitive and socio-
Wilkinson, L.C., and E. R. Silliman 2000. Classroom language and lit-
cultural sciences. eracy learning. In Handbook of Reading Research. Vol. 3 Ed. M. Kamil,
P. Mosenthal, P. Pearson, and R. Barr, 33760. Mahwah, NJ: Lawrence
AN EXAMPLE OF INDIVIDUAL DIFFERENCES. Literacy research in Erlbaum.
both the cognitive and sociocultural sciences examines group
diferences. However, classrooms consist of particularized difer- LITERARINESS
ences. he translation of group-diferences research into every-
day practices that meet the learning needs of individual children his term refers to the perceived distinctive quality of the lan-
is far from an easy task. In any third grade in the United States, guage of literary, as opposed to nonliterary, texts. If the linguis-
some children are still struggling with luent decoding and spell- tic study of literature attempts to understand how linguistic
ing; others have no problems with decoding but face signiicant form is adapted to literary purposes, then identifying a text as
comprehension barriers when reading expository texts; and still literary based on its language is one of the central problems.
others may be exerting great efort to unravel the complexity of here is currently among scholars of literature, both linguists
academic discourse demands, which then impedes their lan- and literary critics alike, little agreement about the status of lit-
guage and literacy learning. No uniform set of reasons accounts erature as language. Despite this lack of consensus, or maybe
for these individual patterns. Some patterns may be grounded because of it, the investigation of literature using linguistic
primarily in sociocultural experience, such as less familiarity models has become a productive ield within applied linguis-
with academic discourse; other patterns may involve complex tics. he question of literariness is closely tied to the modern
interactions of neurobiological, cognitive, social, linguistic, and concept of literature and its history, and so it seems best to pro-
communicative factors. ceed by examining irst the history of the concept of literature
While much has been learned about literacy processes, two before turning to central issues that the question of literariness
signiicant educational challenges persist: understanding varia- has raised.
tions in individual proiles and how to craft evidence-based prac- Although most cultures consider verbal art a separate,
tices that will assist individual children to become full members recognizable class of speech, its rendering into print has sub-
of their larger literacy communities. jected it to the transforming efects that mark the inluence
of print on every aspect of modern culture (see print cul-
Elaine R. Silliman and Louise C. Wilkinson ture ). Literature (from Latin littera letter) in its restric-
tive, modern English sense of imaginative writing in the main
WORKS CITED AND SUGGESTIONS FOR FURTHER READING genres of poetry, prose, and drama which has claim to con-
Berninger, Virginia W., and Todd L. Richards, 2002. Brain Literacy for sideration on the ground of beauty of form or emotional efect
Educators and Psychologists. San Diego, CA: Academic Press. (Oxford English Dictionary), arose in the nineteenth century in
450
Literariness
conjunction with the increasing availability of authorship as a important element in the organicist metaphors of literary struc-
profession. Before the advent of movable type printing, to write ture developed principally by Robert Penn Warren and other
or copy a book required a great investment of time and energy, so-called New Critics in the United States, though by no means
and so only highly valued items would have been widely circu- unique to them. Organicist approaches see the form of a text as
lated. Even after print technology began slowly to difuse into highly responsive to its meaning, and the New Critics especially
the wider culture, it was never doubted that only important valued the ability of a text to reconcile within itself the various
works would ever be printed, distributed, and saved. he rise of strands of meaning that its language evokes. In semiotic terms,
industry and the middle class, with increased literacy rates and an icon is a sign in which the signiier somehow resembles its
an expanded market for writing, saw the normative concept of referent. Iconicity operates to a limited extent in language, for
literature emerge essentially as a means for diferentiating tra- example, in onomatopoeia, where a word sounds like the thing
ditionally sanctioned texts from those of supposedly ephemeral it represents, as is often the case for linguistic representations
quality. he reading and study of the superior texts were proof animal sounds English cows moo and sheep baa (but
moted in secondary and university curricula as proitable for see Haiman 1985 for arguments that iconic signs are widely
cultural improvement and development of national identity exploited in language). In poetry, sounds sometimes mimic the
(see nationalism and language ). thing being described, as in these lines from Tennysons he
Arguments for the suitability of literature for this project Princess:
depended for their success on demonstrating that the privileged
he moan of doves in immemorial elms
texts possessed certain inalienable qualities. Matthew Arnold,
And murmuring of innumerable bees (7.2212)
an English school inspector and poet, argued in 1880 that the
superior character of truth and seriousness in the matter and he repeated nasal consonants suggest the hum of bees on a
substance of the best poetry, is inseparable from the superiority summer afternoon.
of diction and movement marking its style and manner ([1880] Iconicity often contributes to thematization, the acting out
1988, 416). For Arnold, and the liberal humanism he has come to by a texts words or other speech elements of a particular theme
represent, the reading of literature functions to stabilize soci- given semantically in the text. he speaker of George Herberts
ety because it educates citizens in supposedly universal sociobe- Deniall, for example, laments that his prayers are not being
havioral norms. heard by God:
Like Arnold, most inluential theorists in the irst half of the
When my devotions could not pierce
twentieth century never questioned the status of literature as
hy silent eares;
an identiiable and beneicial form of linguistic behavior. hey
hen was my heart broken, as was my verse:
expected that the literary text could be diferentiated from non-
My breast was full of fears
literary texts by some constellation of intrinsic linguistic char-
And disorder:
acteristics, the discovery of which, Roman Jakobson argued in
1921 when he coined the term literariness, should be the goal of he inal line of the stanza does not rhyme, nor does it match the
literary linguistics. But when formalist investigators failed over iambic rhythm established in the irst four lines. he disorder
time to adduce a convincing set of characteristics necessary and spoken of in the line is acted out by the lines lack of sonic it,
suicient for identifying literature, attention turned to the role of implying in the process a parallel between the form of the prayer
extrinsic factors, such as audience and medium, in establishing and its failure to penetrate. In the inal line of the inal stanza the
literary distinctiveness. Many literary critics have resolved the prayer at last comes around:
problem by historicizing the concept of literature itself, arguing
O cheer and tune my heartlesse breast,
that the search for intrinsic features determinate of literariness
Deferre no time;
cannot be successful because wherever or whenever the cate-
hat so thy favours granting my request,
gory of literature arises, it does so from speciic sociocultural
hey and my minde may chime,
forces that situate readers diferently with regard to the purposes
And mend my rime.
attributed to the literary texts within the cultural or theoretical
discourse that promotes the concept. Approaches grounded in When the speakers mind is aligned with Gods wishes, his prayer
linguistics, however, prefer the term verbal art for their subject also becomes formally complete. Iconicity is one way that pat-
because they also question the suiciency of the traditional con- terns of linguistic elements can contribute to larger patterns
cept of literature to account for the variety of genres developed of meaning. Poetic iconicity is a local phenomenon, however,
around the world, by principally oral cultures, that function in lit- which depends on the immediate environment of the signs
erary ways for the culture in question. Most contemporary study involved. he phrase and disorder is not without rhythm, nor is
can thus be characterized as interactionist, refusing to privilege it unrhymable, but in the context Herbert has created, the phrase
intrinsic or extrinsic characteristics but understanding textual stands out for its ill it. Organicist approaches value the extent to
genre categorization as a complex process involving interaction which the local patterns of signiicance can be reconciled to one
among text, reader, performance situation, and various sociocul- another, making each poem a coherent emotional and semantic
tural practices. whole. In the inal stanza of Deniall, the local pattern of end
Some intrinsic characteristics that have been recognized rhyme acts out an additive semantic logic, so that the notions
as occurring across time and languages in texts that become of chime and time together constitute the meaning of the word
literary include iconicity and defamiliarization. Iconicity is an rime, and at the same time the words function sonically within
451
Literariness
the rhyme scheme to act out the metonymy whereby my rime Strauss, he and his co-authors exhaustively catalog patterns of
refers to the poem as a whole. linguistic or structural elements within a poem, often with no
he ideological content for the New Critical emphasis on attempt to discover motivations for individual patterns except to
unity and coherence in literature is illuminated by comparison observe their interaction as formal patterns. hey aim to demon-
to Viktor Shklovskys inluential theory of ostranenie defamiliar- strate that the whole poem is essentially a tissue of many such
ization. Shklovsky was one of several scholars and writers who overlapping and interlocking patterns, a complex and indivis-
met informally in Moscow and St. Petersburg. Often referred to ible totality where a perpetual interplay of sound and mean-
as the Russian Formalists, this group was the irst to mix literary ing establishes an analogy between the two facets, a relationship
analysis with an increasing awareness of the status of the liter- either paronomastic and anagrammatic, or igurative (occasion-
ary text as a linguistic object and, therefore, subject to descrip- ally onomatopoeic) (1980, 23).
tion by speciically linguistic tools (see Erlich 1955). Shklovsky Objections to internalist theories of literariness include skep-
argued that the distinguishing characteristic of the literary text ticism about any readers being able to detect all or even many
was that it defamiliarized items or events the perception of which of a poems linguistic patterns, as well as observations that some
had become automatized by the reader due to familiarity or passages of nonliterary prose contain as many patterns as poetry.
repeated exposure. Leo Tolstoy, as Shklovsky points out in sev- Indeed, the strongest reactions against internalist theories con-
eral examples, makes the familiar seem strange by not naming cern the genre of prose. Internalist theories of literariness often
the familiar object, like the description in War and Peace of an elevate the importance of poetry because as a genre, it is maxi-
opera scene as pieces of painted cardboard, or when the nar- mally distinct from everyday language. As a result, objections
rator of Shame describes the sequence of actions in a logging to these theories are often concerned with accounting for the
without using the word ([1917] 1988, 21). Shklovsky called these characteristics of prose genres. Prose generally has observably
and similar techniques of creative distortion prim devices, fewer sonic foregrounding devices and is more likely than poetry
and Jakobson went so far as to claim in 1921 that if literary his- to make use of devices that also appear in everyday language,
tory wants to become a science, it must recognize the artistic such as irony, metaphor, or repetition. here is some evidence
device as its only concern (quoted in Erlich 1955, 57; see also that these devices occur with greater frequency in literary than in
foregrounding). nonliterary prose, and that there are some linguistic forms, such
he young Jakobson, whose speciically linguistic theory of as free indirect speech, that tend to occur only in literary prose
poetics eventually became widely inluential, believed that he (see Miall 2006 and narrative). On the balance, however, liter-
could justify his admiration for the futurist poetry of Velimir ary prose tends to require some attention to elements of literary
Khlebnikov by explicating in linguistic terms the complex, sug- interaction that are extrinsic to the text in order to account for
gestive, phonemic, and morphemic patterning of poems heavy in literariness.
neologisms, like Kuznechik he Grasshopper (Jakobson 1987, One important class of extrinsic approaches, often termed
252). Khlebnikovs poetry, and that of other so-called Russian reader response theories, focus on the reader as the source of the
Futurists, so clearly eschewed traditional forms of poetic prac- distinctiveness of literature. Perhaps the strongest version of this
tice that there was no question of its being considered canoni- extrinsic theory is ofered by Stanley Fish (1980), who describes
cal. Like Arnold, the Futurists were interested in distinguishing an impromptu experiment (later executed to the same ends with
true literature from the products of the mass marketers, whom readers from three continents) in which he told undergraduates
they called traitors, and they also identiied a strong nation- that a list of linguists on a classroom chalk board was a poem
alistic purpose in reading and writing literature. Unlike Arnold, and asked them to interpret it, which they had no trouble doing.
however, they saw literature as having a revolutionary, rather his demonstrated for Fish that literariness was wholly a func-
than a stabilizing, purpose (inluenced perhaps by the difer- tion of prior reader commitment, rather than of anything within
ing cultural conditions prevailing under the unstable czarist the text. Readers learn interpretive practices from the communi-
regime in Russia and the imperative to govern as a global power ties of which they are members, and so what counts as literary is
in England). what the community has determined to be so. Although Fishs
As Jakobsons poetics matured, he increasingly saw the role description of the interpretive community has not generally been
of the device in terms of a Peircean semiotics. he devices do not retained, his position on literariness became, until very recently,
function alone to interrupt the direct awareness of the iden- the default standard within literary theory. Feminist and postco-
tity between sign and object (1987, 378), but the whole text is lonialist challenges to the hegemony of Western literary cultural
revealed as a system of systems of equivalences, that, through practice, demonstrating that texts created and valued by a domi-
similarities and contrasts at all levels of linguistic organization, nant class will be read quite diferently by less privileged classes,
up to and including the arrangement of the entire text, display helped establish that a precise mode or history of reception can-
the text as primarily interested in the linguistic medium, the not be inferred from the text itself (Harrison 2005, 7). Evacuating
materiality of its linguistic signs. his approach resembles the the text of any determining role in its own reading allows critics
organicist metaphors of the New Critics in that linguistic form to explore how the text participates in the various discourses of
is motivated by poetic meaning, and devices are valued less for power circulating at the time of its writing and reception.
their interruption of usual relationships of signiicance than for Current theories of literariness that utilize linguistic tools for
the surplus of meaning that iconic relationships create. analysis are unwilling to locate the determining characteristics
In Jakobsons best-known literary-critical essays, however, speciically within the text or the reader, but generally see textual
such as Baudelaires Les Chats, written with Claude Levi- genre categorization as a complex process involving interaction
452
Literariness Literary Character and Character Types
among text, reader, and various sociocultural practices. Reuven (2001) have proposed the expansion of the notion of read-
Tsurs sophisticated analysis of religious trance poetry, for exam- ing to include the manipulation of poems to deform them
ple, identiies not only textual devices that contribute to a hypnotic in ways that defamiliarize the poems and prompt new expe-
reading but also diferent types of cognitive styles of individual riences of them, by reversing them (so that they can be read
readers, who are psychologically predilicted to react to certain backwards), rearranging the words or lines, even removing or
environmental variables in diferent ways and so read texts dif- replacing parts of speech. Such formally interactive processes
ferently. Nigel Fabb, on the other hand, adopts a modularist of reading, increasingly built into hypertext, video games, and
approach, identifying two types of structures in poetry, inherent other forms of electronic textuality, continue to obscure the
and communicated structure. Inherent structure is determinate traditional text/reader division that igured so prominently in
and is processed, rather than interpreted, by the appropriate early notions of literariness. Arguments for the literariness of
module (such as syntax or meter). Communicated structure is nontraditional (i.e., nonprint, nonverbal) formats usually rely
implied by textual evidence in the context of the readers knowl- on some form of the strong sociocultural construction argu-
edge, so that being a sonnet holds of a text by inference, rather ment, yet there are also attempts to canonize some digitally
than as an independent fact about the text. he important thing mediated works over others based on intrinsic characteristics,
for literariness is again that the text provides only some of the suggesting that interactive deinitions of literariness continue
cues necessary for categorizing it as literary. Communicated to prevail for the time being.
structure is dependent on individual learning, so that responses
Claiborne Rice
to any given textual characteristic might readily vary by indi-
vidual. Fabb also argues that literary texts communicate more
about their form than verbal behavior generally does, and so in
a Jakobsonian sense draw more attention to form. Form in this Arnold, Matthew. [1880] 1988. he study of poetry. In he Critical
sense takes on the characteristics of meaning (because implied); Tradition, ed. David H. Richter, 41116. Boston: Bedford.
therefore, form is more likely to be ambiguous, indeterminate, Attridge, Derek. 2004. he Singularity of Literature. London: Routledge.
Bauman, Richard. 1984. Verbal Art as Performance. Long Grove,
metaphorical, or ironic in literary texts.
IL: Waveland.
Emotional responses to literature have always been impor-
Erlich, Victor. 1955. Russian Formalism. he Hague: Mouton.
tant for the identiication of literariness. Indian theories of Fabb, Nigel. 2002. Language and Literary Structure. Cambridge:
rasa, for example, categorized texts by the type of emotion they Cambridge University Press.
prompted (see dhvani and rasa). Fresh empirical approaches Fish, Stanley. 1980. Is here a Text in his Class? he Authority of
to the issue of literariness have focused on the role of emotion in Interpretive Communities. Cambridge: Harvard University Press.
literary response. Miall identiies literariness as a combination Haiman, John. 1985. Natural Syntax. Cambridge: Cambridge University
of formal elements in the literary text and the array of responses Press.
these initiate when read (2006, 144). Readers who encounter Harrison, Nicholas. 2005. Who needs an idea of the literary? Paragraph
foregrounding devices experience them as defamiliarizing, a 28.2: 117.
cognitive response that has an associated afective dimension. Herbert, George. [1633] 1974. Deniall. In he English Poems of George
Herbert, ed. C. A. Patrides, 96. London: J. M. Dent & Sons.
Subsequent changes in understanding seem to be guided largely
Jakobson, Roman. 1980. A postscript to the discussion on grammar of
by the feeling evoked by defamiliarization, especially the feeling
poetry. Diacritics 10.1: 2135.
of self-modiication that, Miall argues, accompanies the recon- . 1987. Language in Literature. Ed. Krystyna Pomorska and
textualization of the defamiliarized concept (see literature, Stephen Rudy. Cambridge: Harvard University Press. his text collects
empirical study of). Jakobsons writing on literary topics throughout his career.
In the last decade, literary critics interested in the ethical McGann, Jerome J., and Lisa Samuels. 2001. Deformance and interpre-
dimensions of literary reading have begun to reconsider, to some tation. In Radiant Textuality, 10536. London: Palgrave Macmillan.
degree, the importance of the defamiliarizing efect of literature. Miall, David S. 2006. Literary Reading: Empirical and heoretical Studies.
Derek Attridge (2004) has recuperated the concept of literariness Frankfurt: Peter Lang.
for literary studies by identifying it with the inventiveness that a Shklovsky, Viktor. [1917] 1988. Art as technique. Trans. Lee T. Lemon
text manifests whenever it is enacted by a reader. For Attridge, and Marion J. Reis. In Modern Criticism and heory, ed. David Lodge,
1630. London: Longman.
literariness inheres not in some fundamental unchanging core in
Tompkins, Jane P., ed. 1980. Reader-Response Criticism: From Formalism
the work but in the inventiveness it shows over time, because it
to Post-Structuralism. Baltimore: Johns Hopkins University Press.
remains open to change and porous to new contexts, continuing hough somewhat dated, this remains an excellent introduction to the
to introduce what is unknown into the known. Readers experi- range of ideas usually identiied as reader-response.
ence that inventiveness anew when they perform the work, and Tsur, Reuven. 2003. On the Shore of Nothingness: A Study in Cognitive
they desire to do justice to it by shifting the norms and habits Poetics. Exeter: Imprint Academic.
they use for dealing with the world.
As literature undergoes remediation into digital for-
LITERARY CHARACTER AND CHARACTER TYPES
mats (and it should be noted that the Russian Formalists were
applying Shklovskys concept of ostranenie to ilm analysis Discussions of character types date back to Aristotle and appear
already before 1920), the notion of literariness is undergo- within numerous brands of literary criticism. Within the context
ing concomitant reinement. Motivated in part by new media of the language sciences, however, character types have particu-
forms of verbal art, Jerome J. McGann and Lisa Samuels lar relevance to two strands of narratology.
453
Literary Character and Character Types Literary Universals
he irst strand, which David Herman refers to as classical or narrative universals, which is modeled in part on the
narratology (see also generative poetics and narrative, study of language universals, focuses more on the construction
grammar and), irst lourished in the 1960s. hese linguistic than the reception of texts. Such approaches tend to treat char-
approaches to literature heavily inluenced by Vladimir Propps acter types as narrative components that carry out action neces-
Morphology of the Folktale (originally published in 1928 but not sary for certain prototypical plot structures (see prototype). At
available in translation until some 30 years later), the works of the same time, writers adopting this approach (e.g., Hogan 2003)
Algirdas Julien Greimas, and structuralism in general treat also consider the emotional efects elicited by diferent forms of
character types as functional units of meaning. Propps seven characterization in terms of empathy and identiication. In addi-
classiications of character types were determined by the roles tion, recent work by such writers as Toolan suggests that certain
that characters commonly occupy in Russian folktales: the hero, contemporary grammars might provide better models of char-
the villain, the helper, the donor (provider of magical agents), acter types than the early generative grammar and trans-
the sought-for-person and her father, the dispatcher, and the formational grammar upon which the theories of classical
false hero. Building upon Propps model, Greimas claimed that narratology relied. For example, Toolan examines characters in
characters were signiicant for the actantial roles they perform terms of the meaning-oriented grammar detailed by Michael
within a narrative. In Structural Semantics ([1966] 1983, 198), Halliday, which considers the diferent types of participants
he described actants as embodying a small number of roles in actions. hese participants include a medium (the afected
in the drama of discursive utterances. Greimass three pairs participant), agent (a participant acting intentionally), force
of actants subject/object, sender/receiver, and helper/oppo- (an inanimate agent), instrument (participant controlled by the
nent were intended to correspond to grammatical concepts. agent), beneiciary, and recipient (see also thematic roles).
Subjects (characters who do the action) and objects (characters
Karen Renner
who undergo the action) are clearly related to the equivalently
named sentence constituents, while helper and opponent can be
regarded, in Mieke Bals words, as adverbial adjuncts ([1985]
1997, 201). he category of sender/receiver proved most prob- Bal, Mieke. [1985] 1997. Narratology: Introduction to the heory of
lematic: While it attempted to supplement Propps dispatcher Narrative. Toronto: University of Toronto Press.
with Roman Jakobsons distinction between the initiator of a Culler, Jonathan. 1975. Structuralist Poetics: Structuralism, Linguistics,
and the Study of Literature. Ithaca, NY: Cornell University Press.
communication (e.g., a speaker) and the addressee of that com-
Contains an excellent survey of structuralist approaches to narrative,
munication (the receiver), the relationship between those linguis-
including a section on character, pp. 2308.
tic concepts and the corresponding character types was unclear.
Gerrig, Richard J., and David W. Allbritton. 1990. he construction of lit-
Critics have since expanded on the theoretical foundations laid erary character: A view from cognitive psychology. Style 24: 38092.
by Propp and Greimas, and such work has provided useful gener- Greimas, A. J. [1966] 1983. Structural Semantics: An Attempt at a
alizations about characters as structural units of meaning within Method. Trans. Daniele McDowell, Ronald Schleifer, and Alan Velie.
narratives (see Schleifer and Velie 1987, for instance, for a typol- Lincoln: University of Nebraska Press.
ogy of literary genres based upon Greimass receiver-actant). Herman, David. 1997. Scripts, sequences, and stories: Elements of a
Furthermore, Bals use of semantic axes and Michael Toolans postclassical narratology. PMLA 112: 104659.
semantic feature analysis ofered ways to account for the less Hogan, Patrick. 2003. he Mind and Its Stories: Narrative Universals and
generalizable details of characterization such as speciic physi- Human Emotion. Cambridge: Cambridge University Press.
Propp, Vladimir. [1928] 1968. Morphology of the Folktale. Trans. Laurence
cal and psychological qualities which were outside the scope of
Scott. 2d ed. Austin: University of Texas Press.
story grammars.
Schleifer, Ronald, and Alan Velie. 1987. Genre and structure: Toward an
Within the second strand of narratology, or what Herman
actantial typology of narrative genres and modes. Modern Language
terms postclassical narratology, considerably more attention has Notes 102: 112250.
been devoted to narrative reception. his change in focus has Schneider, Ralf. 2001. Toward a cognitive theory of literary charac-
occurred in tandem with the rise of cognitive linguistics ter: he dynamics of mental-model construction. Style 35: 60740.
in the 1970s and 1980s, which focused more attention on the Toolan, Michael. [1988] 2001. Narrative: A Critical Linguistic Introduction.
mental processes behind grammatical systems; it also undoubt- 2d ed. New York: Routledge.
edly was inluenced by reader response theories that considered
the impact of a readers experience and expertise, literary and
LITERARY UNIVERSALS
otherwise, upon their interpretations of a text (see interpre-
tive community and competence and performance, In parallel with language universals (see laws of language,
literary). Recently, literary characters have been discussed as universal grammar, and universals, nongenetic,
mental models or schemas (see story schemas, scripts, among other entries), literary universals are, generally speaking,
and prototypes) sets of expectations generated by exem- patterns and structures exhibited widely by works of literature
plars, personal experience, stereotypes, and literary knowl- across various familiar literary boundaries, whether national,
edge which undergo continual modiication as the reader generic, or historical. Literary universals apply to various domains
receives more information about a character (see Gerrig and besides world literature as a whole: individual regional and
Allbritton 1990; Schneider 2001). national traditions; discrete literary forms and genres (poetry,
heorists continue to mine the connections between linguis- drama, narrative, etc.); separate literary histories and periods;
tic models and narrative. Work done on literary universals foundational literary concepts (plot, character [see literary
454
Literary Universals
character and character types], etc.); and common liter- three or more, and general or universal literature examined
ary devices (e.g., metaphor), among others. Literary universals all literatures as a whole (see Wellek and Warren 1977, 4653).
may also be present as correlations across these various scalar Such distinctions correlated with the nineteenth-century evolu-
domains. hus, Aristotles observation that tragedies have a tionary paradigm: Comparative literature is coined by analogy
beginning, a middle, and an end applies to literary works more with the sciences of comparative zoology, comparative anat-
broadly. omy, comparative philology, and so on (see philology and
A speciic call for literary universals appeared in Hogan (1994; hermeneutics).
as a subtype of aesthetic universals), but the general concept he nature of literature as verbal art, and thus cognitively
can be traced to Goethe, if not Aristotle. he Aristotelian bina- founded in language, is not the sole spring of its universality
ries of universal/particular and substantial/nonsubstantial, (see verbal art, evolution and, and verbal art, neu-
when intersected, generate a four-category ontology (see Lowe ropsychology of). he roots of literary universals also lie in
2006). Literary universals include both nonsubstantial univer- the common stock of anthropological development and social
sals (abstract properties and relationships) and substantial uni- behavior, or human universals (Brown 1991). Art is one such
versals: literary kinds (genres) and literary morphology, very human universal and, if the prehistoric impulse of making spe-
generally considered. Examples of the latter include metrical cial is the ultimate origin of all kinds of artistic production and
analysis, biblical form criticism, thematics of creation stories, aesthetic appreciation (Dissanayake 1992), it follows that liter-
Freytags pyramid, Proppian functions, Bakhtinian speech ature, as one of the arts, will also exhibit certain universal and
genres, and so on. Substantial universals implies something nontrivial patterns. To the extent that it stems from the mysteri-
very diferent from the oxymoron concrete universals, some- ous biology of play, literature may re-present any or all of the four
times invoked by New Critics and others, to suggest the possibil- fundamental types of human and biological play described by
ity that literature can entirely transcend the universal/particular Roger Caillois (1961): mimesis (dress-up and lets pretend/
dichotomy. Even when substantial (i.e., measurable, able to mimicry and camoulage), alea (games of chance/random varia-
be cataloged), literary universals work outside of any particular tion), agon (sports and contests/survival of the ittest), and ver-
instantiation or touchstone. tigo (swings and slides/light and chasing). It could also be that
his entry briely discusses ive areas of literary universals: their literature somehow recapitulates the Darwinian drama of sur-
rationale and origins, some basic terminology, their relationship vival (Meeker 1997), and the analysis of basic plots in world lit-
to dominant strands of literary theory, various successful ind- erature (e.g., Polti [1921] 1977) suggests that the same social and
ings and limitations, and possible lines of future investigation. sexual competitions of early human life are repeatedly replicated
he focus remains on universals as they apply directly to liter- in literature. On the other hand, given the diiculty that socio-
ary study, as opposed to universalist models conceived as more biology and evolutionary psychology have in explaining
purely linguistic or cognitive, and which merely invoke literary why the human is so diferent from the rest of the natural world,
terms (e.g., story grammar). In contrast, literary universals more promising ways of sourcing literary universals may lie in
seek to explore what uniies the incredible richness, beauty, and the study of creativity as a universal human phenomenon. Only
diversity of global literatures, past and present. the human seems to actualize the original sense of creature, or
the still-becoming-creation.
Rationale and Origins
As with language universals, the key criterion for literary univer- Terminology
sals is not that they occur in all known literatures (though this he basic vocabulary of literary universals stems from paral-
is possible for absolute universals; see absolute and statis- lels in the theory of language universals. Absolute universals
tical universals), but that they are represented, more often are those that apply to all literatures, past and present. As with
than chance alone would suggest, by literatures that are areally absolute universals of language, these may be few in number
and genetically distinct (see areal distinctness and litera- and diicult to substantiate completely since available informa-
ture), that is, free from the kinds of relations and inluences that tion about both the languages and literatures of the world is far
are to be expected when literatures are linked by literary history from complete. Nevertheless, some absolute literary universals
or geography (Hogan 2003, 1719). hus, if epic can be called a do appear to exist. One simple absolute universal is that liter-
universal genre, it is not so because it was written both by Homer ature (including oral literature or orature) occurs in all known
and by Milton, who knew (i.e., read) Homer, but because it is a cultures. Whether this is historically monogenetic, like language
literary form also recognizable in the Mahbhrata in India, the in Homo sapiens sapiens may be, or polygenetic, like the inven-
Tale of the Heike in Japan, and the Popul Vuh in South America tion of writing systems certainly was, remains an open ques-
all traditions that are areally and genetically unrelated. tion since oral literature long precedes the written record (see
One of the earlier precursors of literary universals is the idea oral composition). Another possible absolute is that all
of Goethes world literature or Weltliteratur, by which Goethe literatures (eventually) develop fundamental generic diferen-
seems to have meant a broad cultural unity whose understand- tiations, such as between poetry and prose (see poetic form,
ing could also lead to global social progress. Weltliteratur was universals of). Another content-oriented absolute is the uni-
one of the inspirations for the discipline of comparative litera- versality of myth (stories of creation, lood, etc.) in the earliest
ture. Relecting its scientiic ambitions, this ield was subdeined recorded traditions. Often, there are striking parallels between
so that comparative literature per se examined the relation- very speciic elements among even the most areally and genet-
ships of two national literatures, world literature compared ically distinct myths, which may lend credence to the existence
455
Literary Universals
of a monogenetic mother literature. For instance, it is possible concrete object of the work of art will elude our grasp (1977, 18).
to reconstruct a protoline of Indo-European epic poetry like he In short, universal formulas have little purchase in any individ-
killed the dragon (Watkins 1995), suggesting that heroic tales ual act of literary criticism. Yet, one page later, the fundamental
are a common origin for global literatures. need for a universal theory reasserts itself: Like every human
Of course, common origin may imply prevalent rather than being, each work of literature has its individual characteristics;
across-the-board. If universals are not absolute, they are statis- but it also shares traits with humanity. [hus, the] characteriza-
tical, that is, occurring more often than chance alone would pre- tion [of its individuality] can be accomplished only in universal
dict. he common distinction between poetry, drama, and iction terms, on the basis of literary theory (1977, 19). he universal/
is a statistical universal because these forms are widely but not particular paradox of literary studies was already in place at the
universally distinguished in the literary traditions of the world. ields birth around the turn of the twentieth century, when the
Universals that correlate (in ways also not inluenced genet- nomothetic/idiographic divisions of the German university were
ically or areally) are typological universals (see typology). dominant, and all disciplines were preconceived in the category
One typological universal may be that if a tradition has a cate- of Wissenschaften, sciences.
gory for noniction, it will also (n.b. the awkward non-) have Linguistic approaches to literature, inaugurated by Ferdinand
a category for iction, or that drama presupposes poetry. de Saussure alongside the birth of modern structural lin-
In other words, such literary categories may function like basic guistics, also imply universals. hough an odd literary digression
color terms (Berlin and Kay 1991), with traditions that difer- from his linguistic theory, the mysterious anagrams Saussure
entiate fewer kinds of literature, including the same few kinds. culled from Latin texts might serve to show that literature
Logical universals are typological universals that are logically involves an equally systematic (and compensatorially non-ar-
entailed by the nature of the given literary phenomena: hus, a bitrary?) selection of signs (even if authorially unintended; see
narrative has only two options for recounting a plot sequence, Starobinski 1979). In any case, the literary situation is an instance
either temporally or out of that temporal order (e.g., a lashback). of language, and thus the linguistic universals of the moment of
his suggests, in turn, a less obvious but important statistical uni- communication will apply and will also be available for artis-
versal: very few plots are atemporal, far from the half expected by tic exploitation. Roman Jakobson (1960) famously delineated
random distribution. how each of six components of the communicative situation
Above all, literary universals are empirical they can be (addresser, message, contact, etc.) could be exploited for difer-
(coldly) documented and are not the products of one cultural ent linguistic and literary purposes (emotive, poetic, referential,
point of view imposed upon another. hus, any of the common etc.), with diferent purposes predictably dominant in diferent
usages of universal(ity) in literary studies that imply norma- literary genres. Among other brilliant readings, Jakobsons per-
tive, hegemonic, or totalizing judgments do not pertain to lit- spicacious exploration of the neuropsychology of the metaphoric
erary universals as discussed here but are instances of critical and metonymic poles of linguistic competence in relation to
contamination that parallel genetic and areal inluence. (For the divergent styles of various Russian novelists (1956) also sug-
further discussion of this issue, along with other terminology of gests that literary universals (pace Wellek and Warren) can illu-
literary universals, see Hogan 2006.) minate seemingly idiographic cases in literary criticism.
he coming of various posts in late-twentieth-century lit-
Universals and Literary Theory erary theory (poststructuralism, postmodernism, postcolonial-
One of the irst universalist schemes in Western literary theory is ism) has added new complexity and urgency to the search for
presented in Platos Ion, which probes the inability of oral poets universals in literature (see Carroll 1995). Even while prioritiz-
to account for their activities. Socrates explains poetic divine ing irreducible diferences, polyvalent identities, and the local
possession by imagining a magnetic chain beginning with the particulars of each text and reader, current literary theory none-
Muse, the oracular lodestone, whose power suspends a descend- theless proceeds by placing literature in the context of grand
ing series of iron rings: original composer, intermediate reciter, sociopolitical, economic, or linguistic structures.
and inal audience. hough meant in the Platonic scheme to
belittle all of art as a derivative and irrational form of knowledge, Findings and Limitations
this doctrine of inspiration has remained a fundamental myth of he domains with the most advanced treatment of literary uni-
literary theory through the twentieth century and beyond, as for versals thus far are poetics (i.e., prosody and meter) and nar-
Muse has been substituted a virtual series of pervasively pow- rative. Literary universals in the study of narrative are treated
erful and likewise subconscious lodestones: psyche, economy, thoroughly elsewhere (see narrative universals, narra-
ideology, identity, empire, and so on. tology, and story schemas). One recent discovery by Patrick
Modern literary theory, however, has taken an attitude Colm Hogan is that narratives seem to fall into three prototype
toward universals that is schizophrenic, tacitly assuming that stories, a signiicant improvement over existing, more reductive
universalizing theoretical modes are possible while vociferously approaches (e.g., Joseph Campbells androcentric monomyth
denying that literary works are anything but contingent idio- of the heros journey).
graphic particulars. Two passages from Ren Wellek and Austin Well before literary universals was coined, Paul Kiparsky pre-
Warrens inluential heory of Literature, just a page apart, are sciently argued for a universal metrics: [A] theory of meter
symptomatic. he irst distances itself from scientiic universal- cannot restrict itself to one poetic tradition, any more than a the-
ism of any kind: [N]o general law can be assumed to achieve ory of grammar can restrict itself to one language. We must make
the purpose of literary study: the more general the more the our theory account for metrical systems of other languages, and
456
Literary Universals
begin to construct a universal metrics (1981, 266). Metrical Future Directions

schemes are by nature universal formulas for generating and As the aforementioned paths are furthered, two broad new ave-
judging individual poems: [W]e can only begin to state invari- nues of investigation present themselves. he irst has to do with
ant facts about the iambic pentameter line if we state them in deining and integrating the fundamental disciplinary unit of
terms of an abstract representation of the line, rather than by ref- analysis of literature. In each discipline, the unit of analysis drives
erence to any of the actual performances of the line (Fabb 2002, the research paradigm, such as event and its time-context for
6). Although the stanza has received some attention, the focus of history, or the atom and its forces in physics. Literary studies are
prosody is the verse line (or line group, e.g., the elegiac cou- blessed, or cursed, with an array of merely informative terms,
plet), afording a vast corpus for analysis. such as character, theme, genre, and reading, of which none by
Other suggestive work on literary universals has been done itself provides the overarching anecdote for literary studies. In
on the level of fundamental literary concepts. Evolutionary psy- practice, moreover, the ield is divided among such competing
chology has conlated Darwinian survival with literary conlict intradepartmental interests as criticism, theory, creative writing,
(the primary problem or adversary to be overcome in a given rhetoric and composition, education, ilm studies, and jour-
story), but this is only one of the bases of literature, being limited nalism. here may nonetheless be a grand theory of literariness
primarily to narrative or dramatic works. In poetry, probably the that unites these disparate subields; if so, it would also likely
most ancient literary form, the dominant literary universals seem reveal the fundamental interconnections among all the sister
to be imagery (regarding content) and meter (regarding form). arts (literature, visual art, performing art, and new media). Will
Some early formalists believed that imagistic language was the there be a new grammatike, the ancient word for literary study,
root of all literature. However, literature is animated by both now a more limited purview of linguistics?
imagistic tropes and also schemes (recalling the classical distinc- he second little-explored territory for literary universals
tion within the rhetorical canon of style), that is, the deliberate (even since Carroll 1995) is the area of diachronic universals
patterning of existing linguistic components (i.e., multitudes of (cf. language change, universals of). For instance, do all
parallelism: rhyme, chiasm, ascending cola, etc.). Schemes literatures begin with poetry and then proceed to prose? Does
are, in Jakobsons vocabulary, either poetic or phatic (i.e., myth typically diverge into history (fact) and iction (imag-
at play with the message itself or its instantiation in a linguistic ination)? Are there other universals of the rise and fall of various
code), rather than referential, iconic, or invoking deliberate genres over time? he word tradition, often used in this entry as
visualization. synonymous with a literature, implies an entire literary history,
Tropes have found other universalizing uses since Peter and the historical mode is one of the literary disciplines oldest
Ramus, Giambattista Vico, and other Enlightenment rhetori- and largest strands. Much of this standing evidence might be
cians elevated four of these traditional igures of thought to the mined for diachronic literary universals.
status of master tropes: metaphor (see metaphor, universals
Christopher M. Kuipers
of), metonymy, synecdoche, and irony. Of these, metaphor
has been most ascendant (see, e.g., Kvecses 2005). he cogni-
tivist theory of conceptual metaphor even suggests that the
fundamental metaphoric principle, to understand one thing in Berlin, Brent, and Paul Kay. 1991. Basic Color Terms: heir Universality
terms of another, applies to all concepts, since the mind is uni- and Evolution. 2d ed. Berkeley: University of California Press.
versally embodied through the constant real-world experience Brown, Donald E. 1991. Human Universals. New York: McGraw-Hill.
of comparison (Lakof and Johnson 1999). Kenneth Burke took a Burke, Kenneth. [1945] 1969. A Grammar of Motives. Berkeley: University
of California Press.
speciic form, the drama, as a metaphor of all social action ([1945]
Caillois, Roger. 1961. Man, Play, and Games. Trans. Meyer Barash. New
1969). Going a step beyond both Aristotles triad and Jakobsons
York: Free Press.
six factors of communication, Burkes pentad of dramatism
Carroll, Joseph. 1995. Evolution and Literary heory. New York: Cambridge
(scene, act, agent, agency, purpose; with a sixth term, attitude, University Press.
supplemented later) was conceived synecdochically so that its Dissanayake, Ellen. 1992. Homo Aestheticus: Where Art Comes from and
terms could pair into any of ten ratios (e.g., the pathetic fallacy Why. New York: Free Press.
is an instance of the scene-act ratio). Fabb, Nigel. 2002. Language and Literary Structure: he Linguistic
Even as previously literary things as metaphor and drama Analysis of Form in Verse and Narrative. Cambridge: Cambridge
show great power for comprehending human thought and soci- University Press.
ety, however, there remains the problem of the way that such Hogan, Patrick Colm. 1994. he possibility of aesthetics. British Journal
indings feed back to illuminate literature per se. If we all use met- of Aesthetics 34.4: 33750.
. 1997. Literary universals. Poetics Today 18.2: 22349.
aphor continually in language, what then sets apart the poetic
. 2003. he Mind and Its Stories: Narrative Universals and Human
metaphor of a Shakespeare sonnet? Is all literature ultimately
Emotion. New York: Cambridge University Press.
dramatic? As the study of literary universals moves forward, it
. 2006. What are literary universals? Literary Universals Project.
must be remembered that ever since Saussurean structuralism Available online at: <http://litup.unipa.it/docs/whatr.htm>.
and Russian (and other Eastern European) formalisms, that the Jakobson, Roman. 1956. Two aspects of language and two types of apha-
fundamental theoretical agenda is literariness (literaturn- sia disturbances. In Fundamentals of Language, ed. Roman Jakobson
ost), rather than the particular literary work (which remains the and Morris Halle, 5582. he Hague: Mouton.
realm of criticism) the langue of literature, as it were, rather . 1960. Closing statement: Linguistics and poetics. In Style in
than the parole. Language, ed. homas Sebeok, 35077. Cambridge, MA: MIT Press.
457
Literature, Empirical Study of
Kiparsky, Paul. 1981. Stress, syntax and meter. In Essays in Modern sound (metrical efects or alliteration), syntax (such as
Stylistics, ed. Donald C. Freeman, 22572. New York: Methuen. ellipsis or inversion), or semantics (metaphor, hyperbole,
Kvecses, Zoltn. 2005. Metaphor in Culture: Universality and Variation. etc.). he Russian Formalist critic Victor Shklovsky, commenting
Cambridge: Cambridge University Press. on the purpose of literary devices, argued that literary art exists
Lakof, George, and Mark Johnson. 1999. Philosophy in the Flesh: he
to make one feel things; its purpose is to increase the diiculty
Embodied Mind and Its Challenge to Western hought. New York: Basic
and length of perception ([1917] 1965, 12). he immediate efect
Books.
Lowe, E. J. 2006. he Four Category Ontology: A Metaphysical Foundation
of foregrounding is to make strange (ostranenie), to defamiliar-
for Natural Science. Oxford: Clarendon Press. ize. hese ideas allow the empirical researcher to frame speciic
Meeker, Joseph. 1997. he Comedy of Survival: Literary Ecology and a hypotheses addressing the impact that foregrounded passages
Play Ethic. 3d ed. Tucson: University of Arizona Press. have on readers.
Polti, Georges. [1921] 1977. he hirty-Six Dramatic Situations. Trans. First, as studied by Willie van Peer (1986), readers should ind
Lucille Ray. Boston: he Writer. that passages high in foregrounding are striking when compared
Starobinski, Jean. 1979. Words upon Words: he Anagrams of Ferdinand with passages low in foregrounding. To test this hypothesis, van
de Saussure. Trans. Olivia Emmet. New Haven, CT: Yale University Peer chose four short poems and carried out a comprehensive
Press. analysis of the foregrounding in each line at the three diferent
Watkins, Calvert. 1995. How to Kill a Dragon: Aspects of Indo-European
levels (sound, syntax, semantics). his enumeration enabled
Poetics. New York: Oxford University Press.
the lines of the poems to be rank-ordered for density of fore-
Wellek, Ren, and Austin Warren. 1977. heory of Literature. 3d ed. San
Diego, CA: Harcourt Brace Jovanovich.
grounding. Readers were then asked to respond to the poems by
underlining all words and phrases they found striking. In all of
the poems, the frequency of readers underlinings was found to
LITERATURE, EMPIRICAL STUDY OF correlate highly with the density of foregrounding.
he broad ield of empirical research over the last three decades A similar study by David S. Miall and Don Kuiken (1994) took
has included a range of topics and approaches. Sociologically up the additional suggestions of Shklovsky that art makes us feel
oriented researchers have taken up literary socialization, such as and that perception is lengthened. Working with three modern-
the reputation of authors, and audience research; book histori- ist short stories, they analyzed the presence of foregrounding in
ans have surveyed the experiences of readers, especially those in each sentence. he stories were then presented a sentence at a
the working class over the last two centuries, and the role of read- time on computer; reading times per sentence were recorded
ing clubs; writers interested in aesthetic response have studied while readers undertook a irst reading at their normal reading
individual literary experience and compared literary response to speed; they then read the story a second time while providing
experiences of other media, such as ilm, computer gaming, or a rating of each sentence. For all of the stories, after adjusting
hypertext. At the heart of the empirical endeavor is the formation for sentence length, the speed of reading correlated signiicantly
of theories and narratives about the role and status of literature with foregrounding (highly foregrounded sentences took about
based on actual data, either verbal or numeric: his may consist twice as long to read as sentences without foregrounding); and
of readers memoirs or statistics for library borrowings, the study readers ratings for strikingness and intensity of feeling also cor-
of questionnaires elicited from readers, or evidence of reading related with foregrounding. he readers in both this study and
processes gathered during carefully controlled laboratory exper- that by van Peer were university students from a wide range of
iments. In this entry, the primary focus is on the formal aspects of backgrounds, yet correlations with foregrounding were signii-
literary texts as relected in studies of literary reception. cant regardless of their expertise in literature; thus, response to
Reception studies cover a wide spectrum of topics, includ- foregrounding appears to be independent of literary education.
ing style and narrative; readers feelings and the relation of liter- hese indings suggest a theory of text processing: he encoun-
ary understanding to the self; individual diferences in readers ter with foregrounding is found striking by the reader, who then
preferences or the inluence on reading of personality traits; slows down in order to gain a better apprehension of the unusual
cross-cultural diferences in reading; and the relation of literary textual features; the experience is defamiliarizing and arouses
experiences to other media. Some empirical studies attempt to feeling, and feeling may be the vehicle by which the reader elicits
clarify or improve on the models of reading developed by dis- an alternative framework for reconceptualizing the meaning of
course psychologists; others may represent an attempt to test a the text at that moment. he main indings have been conirmed
particular claim about reading proposed by literary theorists or in several later studies on the efects of foregrounding.
narratologists. Either explicitly or implicitly, a number of Literary reading is often said to be engaging in its power to
studies have raised the question of literariness: whether lit- evoke ideas and feelings about the self. A method for investigat-
erary texts involve response processes that are measurably dis- ing this idea, termed self-probed retrospection, was irst demon-
tinctive in some way. strated in a study by Ufe Seilman and Steen F. Larsen (1989).
hey proposed that a literary text was more likely to evoke per-
Experimental Examples sonal resonance than a nonliterary text. Readers were given either
In this section I discuss three themes that have been pursued a short story or an expository text (on population growth), both
empirically and provide examples of the ways that readers of about 3,000 words. While reading, readers put a pencil mark
responses have been studied. in the margin whenever they were reminded of something from
he term foregrounding refers to stylistically distinctive their own lives; otherwise, reading occurred at a normal pace.
aspects of literary texts. hese may be apparent at the level of he two texts gave rise to the same number of remindings: about
458
Literature, Empirical Study of
13 for each text. After reading, readers reviewed their marks and reading. he authors have examined the relationship with the
completed a short questionnaire on each reminding: its type, narrator both theoretically and empirically in a series of studies
vividness, emotional quality, and the like. he types of remind- involving aspects such as dialogue, plot, point of view, and char-
ing were found to distinguish the two texts: For the literary text, acterization. For instance, in studying the efects of free indirect
twice as many remindings involved a memory of the self as an discourse (where the narrators voice represents the speech or
actor. he expository text, in contrast, invoked more memories of thought of a character without attribution), they found that this
things heard or read about. It was also noticed that remindings style led readers to endow the narrator with the personality of the
in general were more frequent in the opening sections of both character. Taking a story about a husband and wife, Rope, by
texts (the downward trend was more marked in the literary text). Katherine Ann Porter, that is related almost entirely in free indi-
hese indings suggest that readers of a literary text situate them- rect discourse focused on the male character, they constructed
selves by recruiting speciic, self-related information, particularly several other versions of the story in which the character roles
at the beginning of a text where an appropriate schema must be were reversed and the dialogue was rewritten as direct quoted
developed, and that this information refers predominantly to the speech. After reading the story, readers were asked several ques-
active engagement of the reader in the world. tions about their impressions of the narrator and the characters.
Several subsequent studies have built on the remindings It was found, for example, that the rationality of a character was
method. Larsen and Jnos Lszl (1990) studied the cultural rated higher when it was associated with the narrator through
proximity of readers, working with Hungarian and Danish read- free indirect discourse. Judgments about the likely gender of the
ers of a Hungarian story. Of the remindings produced, they narrator were also aligned with the gender of the character rep-
found that markedly more event memories were produced by resented through free indirect discourse.
the Hungarians and, of these, signiicantly more were of expe-
rienced rather than reported events. Lszl and Larsen (1991) Prospects
then extended the method to look at the implications of point Although studies of the kind just described are not yet well
of view in iction. Several passages of the Hungarian story known among literary scholars, they point to a basis for rethink-
were rewritten to change point of view. As before, personal- ing the nature of literary studies and education. In contrast to
event remindings were signiicantly more frequent among the claims voiced by critics from I. A. Richards to E. D. Hirsch that
Hungarian readers; in addition, shifting to the inside point of the responses of ordinary readers are ill-informed or whimsi-
view of a character increased the percentage of such remind- cal, empirical studies demonstrate the presence of signiicant
ings from 55 percent to 75 percent for the Hungarians (but had regularities in readers responses, enabling signiicant conclu-
no efect on Danish readers). here was also some evidence that sions to be drawn about the efects of literary language and form.
inside point of view inluenced readers toward more emotional Empirical studies also shift the focus away from the interpretative
remindings, regardless of their cultural background. issues that have largely preoccupied literary scholars onto the
Another variant of the remindings method was developed by experiential aspects of literary reading. In this respect, they invite
Keith Oatley (1999) to examine gender and personality diferences a reconsideration of the role of literature in human culture.
in readers. Instead of a simple mark, readers were instructed to
David S. Miall
write a letter in the margin: an E for an emotion; M for an autobio-
graphical memory; and T for a train of thought. Among readers of
short stories by Alice Munro and Carson McCullers, female read-
ers produced overall signiicantly more emotions than the male Bortolussi, Marisa, and Peter Dixon. 2003. Psychonarratology: Foundations
readers; in addition, male readers produced fewer emotions when for the Empirical Study of Literary Response. Cambridge: Cambridge
the protagonist was female, whereas gender of protagonist had University Press. An integrated approach to narrative theory and
the empirical study of reading, including exemplary studies by the
no inluence on female readers. In a second study, the method
authors.
was used to examine the aesthetic distance of readers from a
Kuiken, Don, David S. Miall, and Shelley Sikora. 2004. Forms of self-im-
short story and how their responses mirrored their adult attach- plication in literary reading. Poetics Today 25: 171203.
ment styles. Kuiken, Miall, and Shelley Sikora (2004) developed Larsen, Steen F., and Jnos Lszl. 1990. Cultural-historical knowl-
the method of self-probed retrospection to elicit verbal commen- edge and personal experience in appreciation of literature. European
taries by readers. he reader is invited to read a text and make Journal of Social Psychology 20: 42540.
marginal marks whenever a passage seems striking or evocative; Lszl, Jnos, and Steen F. Larsen. 1991. Cultural and text variables in
the reader later returns to the marked passages and chooses (say) processing personal experiences while reading literature. Empirical
ive on which to provide a commentary. Readers are readily able Studies of the Arts 9: 2334.
to recover the thoughts and feelings that occurred during reading, Martindale, Colin. 1990. he Clockwork Muse: he Predictability of Artistic
giving access to at least some of the mental processes that appear Change. New York: Basic Books.
Miall, David S. 2006. Literary Reading: Empirical and heoretical Studies.
to make literary reading distinctive.
New York: Peter Lang. Chapter 3 in this book provides an introduction
A third set of studies is focused on the role of the narrator in
to the methods of empirical study, while Chapter 7 surveys the princi-
iction. Marisa Bortolussi and Peter Dixon (2003) elaborate a pal research topics in empirical reception studies.
theoretical framework that accounts for the readers relation to Miall, David S., and Don Kuiken. 1994. Foregrounding, defamiliariza-
the narrator; in particular, they propose that readers treat the tion, and afect: Response to literary stories. Poetics 22: 389407.
narrator as a conversational participant and make inferences Oatley, Keith. 1999. Meetings of minds: Dialogue, sympathy, and identi-
about the narrators personality and values that inluence their ication, in reading iction. Poetics 26: 43954.
459
Logic and Language
Seilman, Ufe, and Steen F. Larsen. 1989. Personal resonance to litera- context, the latter sentences contain at least one logical connec-
ture. Poetics 18: 16577. tive. In standard approaches, the truth conditions for molecular
Shklovsky, Victor. [1917] 1965. Art as technique. In Russian Formalist sentences are given recursively, piggy-backing, as it were, on
Criticism: Four Essays, ed. and trans. L. T. Lemon and M. J. Reis, 324. the truth conditions for atomics. (An example follows.)
Lincoln: University of Nebraska Press.
For present purposes, a language is a precise syntax (involv-
Steen, Gerard, and Dick Schram, eds. 2001. he Psychology and Sociology
ing, among other things, a precisely deined set of sentences,
of Literature: In Honour of Elrud Ibsch. Amsterdam: John Benjamins.
A wide-ranging collection, mainly illustrating recent empirical studies
some of which are atomics, some molecular) coupled with truth
of literature. conditions, which, as noted, provide truth-in-a-case conditions
van Peer, Willie. 1986. Stylistics and Psychology: Investigations of for all sentences. So, in addition to specifying a syntax, ones
Foregrounding. London: Croom Helm. speciication of a language involves specifying a class of cases in
van Peer, Willie, ed. 2007. Foregrounding. Language and Literature 16.2 terms of which all sentences, provided by ones speciied syntax,
(Special Issue). A recent collection of contributions to foregrounding, enjoy truth-in-a-case conditions. (An example follows.)
including both theoretical and empirical studies.
van Peer, Willie, Frank Hakemulder, and Sonia Zyngier. 2007. Muses Logical Consequence Qua Truth Preservation
and Measures: Empirical Research Methods for the Humanities. he consequence relation of a language (or fragment thereof)
Cambridge: Cambridge Scholars Publishing.
is the chief concern of the ield of logic, broadly understood. In
Zwaan, Rolf. 1993. Aspects of Literary Comprehension: A Cognitive
efect, a consequence relation yields what follows from what,
Approach. Amsterdam and Philadelphia: John Benjamins.
what sentences of the language logically follow from what sen-
tences. Given a language, as understood here, we deine logical
LOGIC AND LANGUAGE consequence or validity (i.e., semantic validity) as follows, where
L is a language, and A and B sentences of L.
Every language, suitably understood, has a logic, suitably under-
stood. he suitable understanding is a common semantic con- Val. B is a consequence of A in L if and only if there is no case in
ception of logic and language. On this conception, the logic of which A is true but B untrue.
a language is the so-called consequence relation, which, on the
If B is a consequence of A in L, we say that the argument from
semantic conception, essentially involves truth preservation. he
A to B is (semantically) valid in L, that B logically follows from A
chief goal of this entry is to briely convey the basic and very
in L, that A (semantically) implies B, with all such terminology
common sense in which every language has a logic. (N.b.: for
being equivalent (for present purposes).
space and simplicity reasons, this essay privileges the so-called
(Val), in turn, may be generalized. We say that a set X of
semantic, or model-theoretic, approach to logic. Moreover, this
L-sentences is veriied-in-a-case just if every member of X is
essay again, for space reasons only aims to convey basic ideas;
true-in-that-case. In turn, we say that the argument from X to A
it doesnt aim to be a history or even survey of the semantic con-
is valid just if theres no case in which X is veriied but A untrue.
ception of logic and language.)
Similarly, we say that a sentence A is logically true in L just if there
is no case in which A is untrue.
Languages and Truth Conditions
In specifying the logic of a language (or some fragment thereof),
one seeks precision. Much as physics idealizes away from the
Sample Language and Logic
Consider an example of the foregoing ideas, in particular, a so-
messiness of physical reality, formal semanticists and logicians
called classical propositional language. (Such languages are ter-
(at least those concerned with natural languages) idealize away
ribly simple; they have no quantiiers. To simplify even more,
from the messiness of linguistic reality. One such idealization is the
our propositional language will contain no names or predicates!)
assumption that all (declarative) sentences of a language have so-
One motivation for the language is that we seem to have so-called
called truth conditions. (Another immediate idealization is that we
truth-functional connectives in English (and natural languages,
can easily, and precisely, specify the target declarative sentences,
generally), and one might be interested in clearly specifying the
the sentences that, in some sense, are used to make assertions. In
logic of that (truth-functional) fragment of our language. For
what follows, sentence will be short for declarative sentence.)
example, there seems to be a truth-functional usage of and, one
For present purposes, such truth conditions are best thought of as
in which and expresses conjunction, where a conjunction is true
truth-in-a-case conditions, that is, conditions that provide, for any
in a given case if and only if both conjuncts are true in the given
relevant case, what it takes for sentences to be true-in-that-case.
case. Similarly, a truth-functional usage of negation in English is
If one thinks of cases as possible worlds, then truth conditions
evident, one in which, for example, negation does no more nor
provide the conditions under which sentences are true-in-w, for
less than toggle truth values.
any possible world w. Similarly, if one thinks of cases as situa-
As noted, we irst need to precisely specify a language. We
tions, then truth conditions provide the conditions under which
need to specify a vocabulary (in efect, the building blocks
sentences are true-in-s, for any situation s. Moreover, if one thinks
of the language) and, in general, a full syntax, which contains a
of cases as Tarskian models, then truth conditions provide the con-
(precisely deined) set of sentences; we then give our truth con-
ditions under which sentences are true-in-M, for any model M.
ditions. We proceed to deine our language L as follows.
In addition to the assumption of truth conditions, another
idealization is that sentences may be cleanly, precisely carved (1) Vocabulary: p, q, and r, with or without subscripts
into the atomics and molecular sentences, where, in the present (numerals for natural numbers), are our atomics. In addition
460
Logic and Language Logical Form
to the atomics, we have a set of punctuation marks, namely, consequence relation, which carries the information about what
( and ). Furthermore, we have a set of connectives: & is a follows from what in the given language.
binary connective (takes two sentences to make a sentence),
and ~ is a unary connective (takes one sentence to make a Artiicial Versus Natural Languages
sentence). Our three given sets of symbols are disjoint. One might agree that every artiicial language, as understood
(2) he set S of L-sentences is deined recursively as here, has a logic, that is, a consequence relation, speciied via
follows. (Val). What, though, of natural languages?
a) All atomics are L-sentences. he question is a good one, but very complex. Natural lan-
b) If A and B are L-sentences, then so too are ~A and (A&B). guages appear to have arguments that are truth preserving
c) Nothing is an L-sentence unless its being so follows from in the strict sense of (Val) arguments such that theres no
(2a) and (2b). case in which the premises are true but the conclusion untrue.
(Consider the limit example: the argument from A to A.) he
(3) L-cases are (total) functions c from S into V = {1,0}, where
trouble, of course, concerns the relevant cases involved in nat-
V is our set of semantic values.
ural languages truth conditions. Assuming (a not insigniicant
(4) Truth conditions: an L-sentence A is true in a case c if assumption) that all sentences of a natural language have truth
c(A) = 1. conditions in the relevant sense, it remains unclear what counts
a) An atomic sentence A is true in a case c if c(A) = 1. (A is as a relevant case in such truth conditions.
false in c otherwise.) For present purposes, the right account of cases for natural
b) A sentence of the form ~A is true in a case c if c(A) = 0. languages is not pressing. he pressing issue is whether, in the
c) A sentence of the form (A&B) is true in case c if c(A) = 1 end, natural languages are suiciently equipped with truth-in-a-
= c(B). case conditions, whatever the cases may be. If so, the chief point
With our language L so given, we can now see the sense in which remains: Any such language, in virtue of (Val), enjoys a logic.
every language at least given suitable idealizations has a logic.
Applying (Val), we immediately see that, for any L-argument, it is Logical Theories and Rivalry
either truth preserving, in which case valid in L, or not (in which A logical theory of a language (or fragment thereof) is a theory of
case, invalid in L). the consequence relation on that language (or fragment). One
way in which logical theories might disagree is on the choice
EXAMPLE. Consider the L-argument from premise (p&q) to of logical connectives, but this is not necessary for disagree-
conclusion q. According to (Val), this argument is valid in L just ment. Two logical theories might agree on the class of (rele-
if theres no case in which (p&q) is true but q untrue. In L, vant) connectives in a language (fragment) while nonetheless
our cases are functions from the L-sentences into {1,0}. Is there a disagreeing about the logical behavior of such connectives a
case in which (p&q) is true but q not true? No. To see this, just disagreement that, in general, will show up in rival truth con-
consider the truth conditions on L-sentences. According to those ditions for the given connectives. (Such disagreement over
conditions, a sentence of the form (A&B) is true in a case just if truth conditions often centers on what counts as a case in the
both A and B are true in the given case. So, for any case c, if c(p&q) relevant truth conditions.) Suice to say that rivalry currently
= 1, then c(p) = 1 and c(q) = 1, in which case c(q) = 1. Hence, there reigns in the ield of logic, at least concerning the right logical
is no case c in which (p&q) is true but q isnt true. theory of natural language (or fragments thereof). Problems
What one also notices or would notice, on relection about of vagueness and consistency, truth, and paradox
L is familiar truth-functional behavior for negation. For exam- are particularly fertile phenomena for contemporary rivalry
ple, given the truth conditions in L, and given (Val), it is easy to among logical theories.
see that p is a consequence of ~~p, and vice versa. In other
J. C. Beall
words, the double negation of a sentence is logically equivalent
to the given sentence.
SUGGESTIONS FOR FURTHER READING
One may proceed (as an exercise) to record the other logical
forms that are valid in our artiicial language L. Once recorded, Any textbook on classical logic (of which there are many) will be
one has a precise account of the logical behavior of & and ~ suitable further reading. From there, one might turn to textbooks
in L. In turn, one may evaluate whether such logical behavior on nonclassical or so-called intensional logics (of which there
accurately captures the behavior of corresponding connectives are many). As a irst step, one might proitably peruse entries
in ones natural language. In this respect, the artiicial L serves under logic in the Stanford Encyclopedia of Philosophy, edited
the role of idealized models in natural sciences: It gives a clear by Edward N. Zalta, available online at: http://plato.stanford.
account of how the target phenomena (in this case, logical con- edu/.
nectives) behave. Rivalry among logical theories (more on which
follows) turns on the extent to which L is an accurate model.
LOGICAL FORM
Of course, L is but one example of a language (i.e., an ideal-
ized, artiicial language), and a very simple one at that. Still, it he construction of systems in which valid inference can be char-
is not hard to see that, provided languages come equipped with acterized has been the central concern of logic since its incep-
precisely deined cases and each sentence enjoys truth-in-a- tion with the ancients. Beginning with Gottlob Freges seminal
case conditions, (Val) quickly yields a logic for the language a insights of the late nineteenth century (Frege 1879, 1893), it has
461
Logical Form
been understood that accomplishing this goal in a manner suf- predecessors, Tarski was skeptical that natural languages are
iciently rigorous that the inferred proposition can be taken to such systems. Indeed, his recommendation was that we eschew
be proven requires strict attention to the structural properties anything other than formal languages when engaging in scien-
of propositions, to their logical form. What was crystalized by tiic discourse. A more moderate view emerged, however, which
Frege was that this form can be revealed only in a language that tried to show that natural languages, at least to a certain extent,
difers in two key respects from the supericial form of natural could be rendered in the logical idiom. Most closely associ-
languages: i) he structure of propositions is function-argument, ated with W. V. O. Quine (1950), the idea was that logic is to
not subject-predicate, and ii) expressions of generality, unlike be understood as schematic. On this view, there are no logical
proper names, do not occur as arguments, but rather bind vari- propositions, per se, but only propositional schema; valid infer-
ables that do. Together, these two diferences aforded the irst ence is characterized with respect to such schemata, and holds
adequate account of multiple generalization; by distinguish- for any instantiation of the schema. Natural languages can
ing xy(P(x,y)) from yx(P(x,y)) in terms of the scopes of then be regimented as instances of the schemata; a sentence
the universal and existential quantiiers, Frege was able to allay then has a certain logical form because it conforms to a prop-
one of the central problems that had plagued traditional logic. ositional schema of a certain form. (A simple example: John
(Cf. Kneale and Kneale 1962, 483f.) came and Mary left is an instance of the schema & , and
Freges insight, that grammatical form does not reliably reveal so has the logical form John came & Mary left, where & is
logical form, was taken up by Bertrand Russell, most notably in the logical symbol for conjunction.) Quines view nevertheless
the theory of descriptions (Russell 1905). Russell proposed that is no departure from the tradition that distinguishes grammat-
deinite descriptions, as in he present King of France is bald, ical from logical form; it does depart, however, in holding that
is not to be understood in the manner of a proper name, that is, systematic associations can be established between grammat-
standing as an argument, but rather as a complex term of gen- ical forms and logical forms, for signiicant aspects of natural
erality. hus, the proper logical form is not B(k), but rather languages (1960).
x(K(x) & y(K(y) x = y)) & B(x); that is, there is one and only Accepting the traditional separation of grammatical and log-
one present King of France and he is bald. By taking this to be ical form is not universal; rejection of it has been central within
the proper logical form, Russell argued that a number of logical linguistic theory since the mid-1970s. On this latter view, the
issues could be directly addressed. For example, the ambiguity of derivation of a sentences logical form is an aspect of its syntac-
he present King of France is not bald could be accounted for tic derivation, hence, an aspect of its grammatical form. Again,
by taking the negation as having scope either inside or outside the central case is quantiication; in pivotal work, Robert May
the existential quantiier; negation having broader scope brings (1977) showed that the scope of quantiiers, including multi-
the case into conformance with the law of the excluded middle, ple quantiiers, can be represented, as in the manner noted,
as Russell observed. by assuming that there is a transformational rule that moves
Both Frege and Russell realized not only that the insights quantiier phrases, leaving a trace, interpreted as a variable
about logical form being surveyed clariied the formal nature of bound by the moved phrase. By hypothesis, the formulation
inference, but that these aspects of form also allowed for seman- of Mays rule QR requires theoretical resources no greater
tic elucidation, being directly related to an account of the con- than those independently needed within linguistic theory to
ditions for the truth of propositions. For instance, for Russell, otherwise express transformational mappings, for instance,
a substantial virtue of the account of descriptions was that the wh-movement. he class of syntactic representations gener-
intuition that he present King of France is bald is false can be ated by transformational mappings, including those efected
directly accommodated. But that we can proceed beyond intui- by QR, is known as LF. hus, sentences with multiple quanti-
tive elucidation to a formally and materially adequate deinition iers, such as Everyone loves someone will have two distinct
of truth, based on the sort of conception of logical form pioneered LF-representations, roughly:
by Frege and Russell, is due to Alfred Tarski ([1936] 1956, 1944).
[everyonei [someonej [ti loves tj]]]
In the case at hand of quantiication, Tarskis semantic clauses
run as follows: With respect to a universe of objects U, x(P(x)) and
is true just in case every sequence of objects of U satisies P(x);
[someonej [everyonei [ti loves tj]]]
x(P(x)) is true if this is so for some sequence. (A sequence S
satisies an open formula P(x) if there is an assignment of a which can be deined as representing the difering scope
value a of S to the variable x such that a is P.) Because Tarskis orderings of the quantiiers, the traces of QR being interpreted
method iterates, it extends to multiple generalization, distin- as variables bound by the quantiiers, so that in this regard,
guishing the truth conditions of xy(P(x,y)) from those of LF-representations constitute logical forms. hat grammars
yx(P(x,y)), thus providing semantic foundation for the syn- of natural languages have the rule QR is now a widely (if not
tactical insights of Frege. universally) accepted assumption within linguistics (Fiengo
Central to the importance of Tarskis formalization of seman- and May 1994; Fox 2000; Hornstein 1984, 1995; Larson and
tics is that it paved the way to metalogic, the study of the prop- Segal 1995; May 1985; Reinhart 1997); among the most well-
erties of logical systems, centrally their consistency, soundness, known independent arguments are those from weak crossover
and completeness. Tarski was clear to maintain that the dei- (Chomsky 1976) and anaphoric binding more generally, and
nition of truth could be reliably applied only to those systems antecedent contained deletion (May 1985). It has also become
whose propositions have the requisite logical form; like his a commonly accepted assumption within recent thinking in
462
Logical Positivism Mapping
philosophy of language (King 2001; Neale 1990; Stanley 2000; Firstly, formal logic as it has been developed by Gottlob Frege,
Ludlow 2002). is seen both as an instrument for analysis and as an ideal language
wherein all scientiic knowledge is expressable. Many logical posi-
Robert May
tivists also accepted Freges logicism, namely, the view that math-
WORKS CITED AND SUGGESTIONS FOR FURTHER READING ematics is reducible to logic. Hence, they endorsed the view that
mathematics is a language, not a science like, for example, physics.
Chomsky, Noam. 1976. Conditions on rules of grammar. Linguistic Secondly, it follows that a clear distinction can be made
Analysis 2: 30351.
between analytic and synthetic sentences. he former consist
Fiengo, Robert, and Robert May. 1994. Indices and Identity. Cambridge,
of logical and mathematical tautologies, whereas the latter can
MA: MIT Press.
Fox, Danny. 2000. Economy and Semantic Interpretation. Cambridge,
be either true or false and are therefore dependent on the way
MA: MIT Press. things are; that is, they are empirical.
Frege, Gottlob. [1879] 1967. Begrifsschrift: A Formula Language Modeled hirdly, this leads to the principle of verifiability: If a sen-
Upon hat of Aithmetic, for Pure hought. Trans. Stefan Bauer- tence is meaningful, then it should be possible to determine its
Mengelberg. In From Frege to Gdel, ed. Jean van Heijenoort, 582. truth value.
Cambridge: Harvard University Press. Straightforward consequences of these principles are, irstly,
. [1893] 1964. he Basic Laws of Arithmetic. Trans. Montgomery that metaphysical statements are neither analytic, as they are not
Furth. Berkeley and Los Angeles: University of California Press. tautologies, nor synthetic, as they do not refer to the empirically
Hornstein, Norbert. 1984. Logic as Grammar. Cambridge, MA: MIT accessible world; hence, they are meaningless. And, secondly, if all
Press.
of the sciences can be expressed in one and the same language, that
. 1995. Logical Form: From GB to Minimalism. Oxford: Blackwell.
is, the language of mathematics, all sciences can be uniied into a
King, Jefrey C. 2001. Complex Demonstratives: A Quantiicational
Account. Cambridge, MA: MIT Press.
single framework. Hence, the unity of science is a reachable goal.
Kneale, William, and Martha Kneale. 1962. he Development of Logic. It should be noted that although there are connections with
Oxford: Oxford University Press. Ludwig Wittgensteins views, as expressed in the Tractatus
Larson, Richard, and Gabriel Segal. 1995. Knowledge of Meaning. Logico-Philosophicus, and although Wittgenstein attended some
Cambridge, MA: MIT Press. of the meetings of the Vienna Circle, it would not be correct to
Ludlow, Peter. 2002. LF and natural logic, In Logical Form and Language, label him a logical positivist.
ed. Gerhard Preyer and Georg Peter. Oxford: Oxford University Press. On a more reined account, qualiication is needed. It suf-
May, Robert. 1977. he Grammar of Quantiication. Ph.D. diss., ices to look at the original manifesto of the Vienna Circle, the
Massachusetts Institute of Technology. Wissenschaftliche Weltaufassung (the Scientiic Worldview), to
. 1985. Logical Form: Its Structure and Derivation. Cambridge,
notice that the logical positivist program also included ethical-
MA: MIT Press.
societal views. In recent years, many authors have made a strong
. 1999. Logical form in linguistics. In he MIT Encyclopedia of
the Cognitive Sciences, ed Robert A. Wilson and Frank C. Keil, 4867.
case to have a second and historically more nuanced look at logi-
Cambridge, MA: MIT Press. cal positivism (see, e.g., Michael Friedman 1999).
Neale, Stephen. 1990. Descriptions. Cambridge, MA: MIT Press It is generally accepted that both Karl R. Popper, the found-
Quine, W. V. O. 1950. Methods of Logic. New York: Henry Holt. ing father of falsiicationism, and Willard Van Orman Quine, the
. 1960. Word and Object. Cambridge, MA: Technology Press. founding father of naturalized epistemology, have been the most
Reinhart, Tanya. 1997. Quantiier scope: How labor is divided between important critics of logical positivism. he former questioned
QR and choice functions. Linguistics and Philosophy 20: 399467. the veriiability theory; the latter rejected the analytic-synthetic
Russell, Bertrand. 1905. On denoting, Mind 14: 47993. distinction.
Stanley, Jason. 2000. Context and logical form. Linguistics and
Philosophy 23: 391434. Jean Paul Van Bendegem
Tarski, Alfred. [1936] 1956. he concept of truth in formalized lan-
guges. In Logic, Semantics, Metamathematics, trans. J. H. Woodger. WORKS CITED AND SUGGESTIONS FOR FURTHER READING
Ayer, Alfred. 1936. Language, Truth and Logic. London: Victor Gollancz.
. 1944. he semantic conception of truth. Philosophy and
Friedman, Michael. 1999. Reconsidering Logical Positivism.
Phenomenological Research 4: 34175.
Janik, Allan, and Stephen Toulmin. 1973. Wittgensteins Vienna. New
LOGICAL POSITIVISM York: Simon and Schuster. his book outlines the cultural setting
wherein logical positivism could arise.
Also known as logical empiricism, logical positivism was an
important philosophical movement in the irst half of the twenti-
eth century that reached its peak in the interbellum period and is
associated with the Vienna Circle (Wiener Kreis) and the Berlin
M
Circle (Berliner Kreis). he most prominent members of the
former were Moritz Schlick, Rudolf Carnap, Otto Neurath, Hans
MAPPING
Hahn, and Friedrich Waismann; of the latter, Hans Reichenbach,
Kurt Grelling, Carl Gustav Hempel, and Richard von Mises. In what follows, mapping is used in the general mathematical
On the standard account (see, e.g., Alfred Ayer 1936), logical sense of a partial or total correspondence between elements,
positivism is committed to the following principles: relations, and/or structures in two sets.
463
Mapping
Much of the theoretical thinking in modern linguistics has Creative metaphors are often elaborations of conventional ones,
been strongly linked to the development since the 1950s of cog- as in the following typical literary example:
nitive science, artiicial intelligence, and neuroscience. he irst
Perhaps time is lowing faster up there in the attic. Perhaps the
wave of cognitive science looked upon the brain as a sophisti-
accumulated mass of the past gathered there is pulling time out
cated symbol-processing digital computer, and linguistic models
of the future faster, like a weight on a line. (McDonald 1992,
in the ifties and sixties took a largely algorithmic approach, with
8283)
a strong focus on syntax and logic.
In the 1970s and 1980s, there was a sharply diferent second hought and language are embodied. Conceptual structure
wave of thinking that launched a rigorous, empirically based arises from our sensorimotor experience and the neural struc-
study of conceptual mappings: analogy, frames, meta- tures that give rise to it. he properties of grammars are the prop-
phor, metonymy, grammatical constructions, and men- erties of humanly embodied neural systems. Inference inherently
tal space projections. his original and ambitious research built into a source domain will be transferred by projection to an
program revisited from a modern point of view some funda- abstract domain. For example, the conventional metaphors of
mental issues that have been known since antiquity. It drew SEEING as TOUCHING (e.g., I couldnt take my eyes of her) and
on a powerful multidisciplinary mix of psychology, linguistics, KNOWING as SEEING (e.g., I see what youre saying) combine
computational modeling, and philosophy. Names associated with one schema for the English preposition over to motivate
with pioneering eforts in the new ield of conceptual map- overlook: he line of sight travels over (i.e., above) the object;
pings include Douglas Hofstadter, Melanie Mitchell, Dedre hence, there is no contact; hence, it is not seen; hence, it is not
Gentner, and Keith Holyoak, for analogy; George Lakof, Mark noticed or taken into account. In contrast, look over (she looked
Johnson, and Mark Turner, for metaphor and image schemas; over the draft) uses a related but diferent schema for over, a path
Erving Gofman and Charles Fillmore, for frames and frame covering much of a surface, as in she wandered over the entire
semantics; Ronald Langacker, Charles Fillmore, and Adele ield. his sense combines with the same mappings to produce
Goldberg, for cognitive and construction grammars; Gilles a very diferent abstract meaning the object this time is seen
Fauconnier, Eve Sweetser, and John Dinsmore, for mental space and noticed.
projections; and Geofrey Nunberg for metonymic mappings Metonymic mappings link two relevant domains, which may
(pragmatic functions). be set up locally. hey typically correspond to two categories of
In the 1990s and up to the time of writing, there was substan- entities, which are mapped onto each other by a pragmatic func-
tial further evolution of our thinking on these issues. he creative tion. For example, authors are matched with the books they write,
dimension of conceptual mappings was explored through the or hospital patients are matched with the illnesses for which they
study of conceptual blending and compression (Fauconnier are being treated. Metonymic mappings allow an entity to be
and Turner 2002; Coulson 2001) and through the modeling of identiied in terms of its counterpart in the projection. So, when
emergent structure in analogy (Hofstadter 1995; Hummel and a nurse says he gastric ulcer in room 12 would like some cofee,
Holyoak 1997). he role of primary metaphors was discovered by he/she uses the illness (the gastric ulcer) to identify the patient
Joe Grady (1997); constraints on mappings were proposed within who has it. Metonymy allows information to be compressed. If
metaphor theory and within blending theory. Jack is the patient and if the nurse is addressing a physician, his/
Metaphor was once commonly viewed as literary, igurative, her statement simultaneously conveys that Jack wants cofee and
poetic something exotic that we add to ordinary language to that he has a gastric ulcer, which could be further intended as a
make it more colorful, vivid, and emotional. But since the incep- question to ask if cofee is permitted under the circumstances.
tion of conceptual metaphor theory, it is widely acknowl- Im in the phone book uses a metonymic mapping from people
edged that metaphor is, in fact, central to thought and language to names. It says not only that my name is written in the phone
and necessary for human language in its many forms. In order book but also that the number linked to my name is indeed my
to talk and think about some domains (target domains), we use phone number. So it really says something about me, not just
the structure of other domains (source domains) and the corre- about my name: how to reach me, that I dont mind making my
sponding vocabulary (see source and target). Some of these number publicly available, and so on.
mappings are used by all members of a culture, for instance, in Metonymic and metaphoric mappings can combine to pro-
English, TIME as SPACE. We use structure from our everyday con- vide even greater compression, as in Martina is three points away
ception of space and motion to organize our everyday conception from the airport, said by a sports announcer of the tennis star
of time, as when we say Christmas is approaching, he weeks go Martina Navratilova, who was about to lose a tournament match.
by, Summer is around the corner, he long day stretched out with he points stand metonymically for the events of losing a point.
no end in sight. Rather remarkably, although the vocabulary often hree such events would lead to defeat. he events are on a met-
makes the mapping transparent, we are typically not conscious of aphorical spatial scale to which the tennis player gets mapped.
the mapping during use unless it is pointed out to us. hough cog- On that scale, the player is metaphorically at a spatial distance
nitively active, such mappings are opaque: he projection of one of three points from the end of the match which would mean
domain onto another is automatic. Metaphoric mappings may defeat. A metonymic chain takes us from the end of the match to
also be set up locally, in context, in which case they are typically defeat, then to exclusion from the rest of the tournament, then
perceived not to belong to the language but rather to be creative to returning home. he airport (a place) stands metonymically
and part of the ongoing reasoning and discourse construction. for an event (lying home) that starts in that place. hrough the
464
Mapping
metonymic chaining, lying home links to leaving the tourna- multiple modalities (Alac 2006) and anchored by human cul-
ment, which links in turn to losing the match, itself caused by tural artifacts as part of socially distributed cognition
the three lost points. Strikingly, very little of this is indicated (Hutchins 1995).
by the linguistic structure itself. It is constructed by means of the Biologically, it is currently widely assumed that mappings are
cognitive models that we have for games, tennis, tournaments, efected by means of neural binding (Shastri 1996). Computational
and travel and by applying to them the appropriate mappings. models of such binding have been proposed within the neural
he same sentence can take on completely diferent meanings if theory of language (Feldman 2006). Experimental techniques to
we bring in diferent cognitive models. show the psychological reality of various mappings have been
Mental space projections link elements and relations in con- devised by Lera Boroditsky (2000), Ray Gibbs (Gibbs et al. 1997),
nected mental spaces. For instance, in saying Liz thinks her hus- and Seana Coulson (2001), among others
band is tired, we build a mental space for Lizs reported beliefs,
Gilles Fauconnier
with a counterpart for her husband and properties within that
space (tired) that may or may not be satisied in connected
spaces: Liz thinks her husband is tired, but actually hes in great
shape. In saying Last year, Lizs husband was tired, we build a Alac, Morana. 2006. How brain images reveal cognition: An ethno-
mental space for last year, and in saying Liz thinks that last graphic study of meaning-making in brain mapping practice. Ph.D.
year, her husband was tired, we build a space for last year embed- diss., University of California, San Diego.
Boroditsky, L. 2000. Metaphoric structuring: Understanding time
ded in a belief space, itself embedded in a base space. presup-
through spatial metaphors. Cognition 75.1: 128.
positions (such as Lizs having a husband) can spread across
Coulson, Seana. 2001. Semantic Leaps. Cambridge: Cambridge University
spaces: In the last example, we infer that Liz has a husband, that
Press.
she thinks she has a husband, and that last year, she also had Fauconnier, Gilles. [1985] 1994. Mental Spaces. Cambridge: Cambridge
this husband. But any of these presuppositions can be prevented University Press.
from projecting by an explicit overriding entailment. . 1997. Mappings in hought and Language. Cambridge: Cambridge
In mental space projection, the access principle allows a University Press.
description of an element to identify its counterpart in another Fauconnier, Gilles, and Mark Turner. 2002. he Way We hink. New
mental space. For example, if Liz got married to Bob yesterday, York: Basic Books.
we can say Last year, Lizs husband was tired, identifying Bob in . 2008. Rethinking metaphor. In Cambridge Handbook of
the mental space last year by means of his counterpart (Lizs Metaphor and hought, ed. Ray Gibbs, 5366. Cambridge: Cambridge
University Press.
husband) in the mental space now.
Feldman, Jerome. 2006. From Molecule to Metaphor. Cambridge,
Conceptual blending generalizes the notion of conceptual
MA: MIT Press.
mapping to arrays of multiple mental spaces with the creation Gentner, Dedre. 1983. Structure-mapping: A theoretical framework for
of new blended spaces and the emergence of novel structure. analogy. Cognitive Science 7: 15570.
Such arrays of connected spaces are called integration networks. Gentner, Dedre, Keith Holyoak, and Boicho Kokinov, eds. 2001. he
Partial mappings link the mental spaces in such networks, and Analogical Mind: Perspectives from Cognitive Science. Cambridge,
selective projection maps the spaces onto novel blended spaces. MA: MIT Press.
he mappings are supported by a small number of vital rela- Gibbs, R., J. Bogdonovich, J. Sykes, and D. Barr. 1997. Metaphor in idiom
tions, such as analogy, change, identity, rolevalue, causeefect. comprehension. Journal of Memory and Language 37: 14154.
Compression is systematic in integration networks: A vital rela- Gofman, E. 1974. Frame Analysis. New York: Harper and Row.
tion in one part of the network can be compressed into a dif- Grady, J. 1997. Foundations of meaning: Primary metaphor and primary
scenes. Ph.D. diss., University of California, Berkeley.
ferent (or a scaled-down) vital relation in another part of the
Hofstadter, Douglas. 1995. Fluid Concepts and Creative Analogies.
network. Take, for example, My tax bill gets longer every year. he
New York: Basic Books.
inputs are the mental spaces corresponding to diferent years. In Hummel, J., and K. Holyoak. 1997. Distributed representations of struc-
each one, there is a tax bill. hese input spaces are linked by the ture: A theory of analogical access and mapping. Psychological Review
vital relation of analogy: Each one is structured by the frame of 104: 42766.
paying taxes in a particular year, and each tax-paying situation Hutchins, Edwin. 1995. Cognition in the Wild. Cambridge, MA: MIT
is analogous to the others. he inputs are also linked by disanal- Press.
ogy: Each tax bill is diferent (longer than the previous one). Lakof, George, and Mark Johnson. 1980. Metaphors We Live By.
he analogous input spaces are integrated into a single blended Chicago: University of Chicago Press.
space, in which all the tax bills are fused into one: Analogy is Lakof, George, and Rafael Nez. 2000. Where Mathematics Comes
compressed into identity. Disanalogy is compressed into change. From: How the Embodied Mind Brings Mathematics into Being.
New York: Basic Books.
In the blended mental space, there is a single tax bill that changes
Liddell, Scott K. 2003. Grammar, Gesture, and Meaning in American Sign
over time.
Language. Cambridge: Cambridge University Press.
Metaphors typically result from double-scope integration McDonald, Ian. 1992. King of Morning, Queen of Day. New York: Bantam
networks, whereas metonymy turns out to be the compression of Books.
one vital relation into another. Mitchell, M. 1993. Analogy-Making as Perception. Cambridge, MA: MIT
Conceptual mappings are not prompted only by spoken or Press.
signed language. hey are part of human thought, communi- Nunberg, G. 1978. he Pragmatics of Reference. Bloomington: Indiana
cation, and interaction quite generally; they are signaled through University Linguistics Club.
465
Markedness
Nez, Rafael. 2005. Creating mathematical ininities: Metaphor, blend- and leveling toward the unmarked term in pidgins/creoles,
ing, and the beauty of transinite cardinals. Journal of Pragmatics dialects, informal speech, and so forth. A further outgrowth has
37: 171741. to do with markedness hierarchies that are scalar in nature, such
Nez, Rafael, and Eve Sweetser. 2006. Looking ahead to the as the noun phrase accessibility hierarchy for relativiza-
past: Convergent evidence from Aymara language and gesture in the
tion (e.g., from less marked to more marked role of the relative
crosslinguistic comparison of spatial construals of time. Cognitive
pronoun in the relative clause, subject > direct object > indirect
Science 30: 40150.
Shastri, Lokendra. 1996. Temporal synchrony, dynamic bindings, and
object > prepositional object).
SHRUTI a representational but non-classical model of relexive rea- he formal (generative) approaches, especially univer-
soning. Behavioral and Brain Sciences 19.2: 3317. sal grammar (UG), the principles and parameters plus
Sweetser, Eve. 1996. Reasoning, mappings, and meta-metaphorical minimalist approach to syntax (Chomsky 1995), and optimal-
conditionals. In Essays in Semantics and Pragmatics, ed. Masayoshi ity theory in phonology (Prince and Smolensky, 2004), focus
Shibatani and Sandra hompson, 22134. Amsterdam: John on competence and reject criteria for markedness related to
Benjamins. use (performance). Using cross-linguistic details of grammar,
Turner, Mark. 1991. Reading Minds. Princeton, NJ: Princeton University UG has been posited; UG determines a set of possible core gram-
Press. mars for languages by setting parameters, so that systems that
Williams, Robert. 2005. Material anchors and conceptual blends in time-
fall within a core grammar constitute the unmarked phenomena
telling. Ph.D. diss., University of California, San Diego.
and more marked elements are found in the periphery (see core
and periphery) (Chomsky 1981). More recently, work has also
MARKEDNESS focused, for example, on expression of markedness relations by
constraints, on an explanation of markedness asymmetries
he original insight of markedness was that many linguistic phe-
through constraint interaction, and the use of constraint forms
nomena consist of polar opposed pairs for example, the phono-
to express markedness hierarchies.
logical feature unvoicedvoiced and the grammatical relation
In both of these approaches, the unmarked category has also
activepassive and that typically there is an asymmetry, such that
at times been assimilated with the concept of naturalness, as in
one term is more general and thus unmarked (given irst in the
natural phonology and natural morphology, as well as in optimal-
examples) and the other is more constrained and thus marked.
ity theory; some see it as overlapping with normality, regularity,
Markedness was irst developed in phonology as an explanation
generality, and productivity; it has also been used, for example,
for asymmetries in phonological systems based on cross-linguistic
in studies of word order to deine the basic, dominant, or pre-
comparisons, with evidence from typology and universals for
ferred WORD ORDER (e.g., subject-verb-object in English); and
example, more (unmarked) oral consonants than marked nasal
it has certain elements in common with the notion of proto-
ones: Unmarked consonants occur in places of nonconditioned
type. While not everyone uses the term markedness and some
neutralization (e.g., only unmarked voiceless consonants in word-
linguists think that it is an unwieldy cover term with too wide a
inal position in Russian). Later, markedness was used to study
range of application and no central deinition, others in both tra-
grammatical semantics (where the unmarked term has a larger
ditions see it as a major conceptual and explanatory tool that will
semantic range than the marked term), to explain the order of pho-
continue to be of interest and utility for understanding various
nological acquisition in child language (unmarked terms learned
phenomena of language.
before marked terms, e.g., stops before fricatives) and apha-
sia (marked terms lost before unmarked ones), and to identify Linda Waugh
implicational universals, in which the presence of a marked
element implies the presence of the corresponding unmarked WORKS CITED AND SUGGESTIONS FOR FURTHER READING
element, but not vice versa (all of these in Jakobson 1990). Since
Battistella, Edwin. 1990. Markedness: he Evaluative Superstructure of
then, it has developed into an important (though controversial)
Language. Albany: State University of New York Press. Battistellas two
concept in other areas of linguistics, such as morphology, syn-
books are the most accessible long treatments of the topic.
tax, lexical semantics, historical linguistics, second . 1996. he Logic of Markedness. New York: Oxford University Press.
language acquisition, stylistics, and so on. Battistellas two books are the most accessible long treatments of the
Since about the 1960s, two substantially diferent approaches topic.
to markedness have developed and with them diferent types of Chomsky, Noam. 1981. Markedness and core grammar. In heory of
evidence, and explanations, for markedness. he FUNCTIONAL- Markedness in Core Grammar, ed. A. Belletti, L. Branmdi, and I. Riozzi,
(-typological) approaches (e.g., Givon 1990; Croft 2003), based 12346. Pisa: Scuola Normale Superiore di Pisa.
on earlier work in typology universals, depend on diagnostic . 1995. he Minimalist Program. Cambridge, MA: MIT Press.
criteria, not only from linguistic systems but also from language Croft, William. 2003. Typology and Universals. 2d ed.
use, and these are related to functional criteria, communica- Cambridge: Cambridge University Press.
Givon, Talmy. 1990. Syntax: A Functional-Typological Introduction, 2.
tive needs, processing eiciency, learnability, memory, and so
on. Criteria include zero or simple expression for the unmarked
Jakobson, Roman. 1990. On Language. Ed., with an introduction by,
term (isomorphism) unmarked singular with zero expression as
L. Waugh and M. Monville-Burston. Cambridge: Harvard University
in cat versus marked plural with the marker -s in cats; text (token) Press.
frequency (unmarked term more frequent); contextual distribu- Prince, Alan, and Paul Smolensky. 2004. Optimality heory: Constraint
tion (unmarked category has greater freedom of occurrence); Interaction in Generative Grammar. Malden, MA: Blackwell.
466
Market, Linguistic
Waugh, Linda, and Barbara Laford. 2000. Markedness. In language forms. Moreover, this mismatch is itself recognized by
Morphology: An International Handbook on Inlection and Word- individuals, albeit implicitly or unconsciously, who understand
Formation, I: 27281. Berlin: Walter de Gruyter. An accessible treat- the symbolic value of language. Language is, therefore, another
ment of the topic. form of cultural capital in that it is symbolic in the way it both
values and is valued in terms ultimately related to the structure
of the ield. For Bourdieu, the most predominant ield structures
MARKET, LINGUISTIC
are those of social class, which also express the distribution of
Pierre Bourdieu deines a linguistic market as a system of rela- power in society.
tions of force which determine the price of linguistic products and here are relations of linguistic production and authorized
thus helps fashion linguistic production (1989, 47). If linguistic language within the linguistic market. Moreover, everyone
habitus is the subjective element of habitus connected with enters the market in order to compete as a way of gaining and
language use, linguistic market represents the objective field sanctioning social prestige, and, consequently, status and posi-
relations. As always with Bourdieu, the two are in a constant state tion, through the acknowledgment of others in the market. Value
of dynamic interrelationship, as well as evolving dynamically as is ascribed to individuals; it is not within their own capacity to
a part of the transformation of social structures. give it to themselves. here is a kind of anticipation and actu-
In positing a concept such as linguistic market, Bourdieu is alization of proits, much in the same way as in any market (see
targeting traditional linguistics. His quarrel is with all linguis- Bourdieu [1982] 1991, 76 f). Bourdieu refers to many examples
tics going back to the work of Ferdinand de Saussure, which he where the power relation between two or more individuals is
sees as treating language as an object of study rather than as a expressed in the language they use with respect to one another.
practice. he concept thus constitutes language as logos rather For example, in the postcolonial context (see colonialism
than praxis. Bourdieus critique extends to Noam Chomsky and and language), those in a position of dominance sometimes
Chomskyan linguistics, with its discovery a of semibiological abdicate their position of authority by linguistically reaching
language acquisition device, deep syntactical structure down to the interlocutor. However, he sees this as simply a
(see underlying structure and surface structure), strategy of condescension aimed at reasserting their domina-
and universal grammar. Bourdieu cannot accept the tion. Normally, it is the opposite that applies: hose dominated
Chomskyan precepts that linguistics should be concerned with are forced to adopt the language of the dominant. Bourdieu also
an ideal speaker-listener, a homogeneous speech community, contrasts the broken English of the black American vernacu-
and perfect grammatical competence. Bourdieus alternative lar with the air of naturalness of the English (1992, 143). For
can be summed up as follows: Chomskyan linguistics, both are natural and unbroken since
they follow the same complex principles (e.g., binary merge
In place of grammaticalness it puts the notion of acceptability,
and wh-movement). he point is not only that power relations
or, to put it another way, in place of the language (langue), the
are expressed in such linguistic exchanges but that the linguis-
notion of legitimate language. In place of relations of communica-
tic market also deines what is and is not linguistically valued by
tion (or symbolic interaction) it puts relations of symbolic power,
rewarding and sanctioning speciic forms of language. In the-
and it replaces the meaning of speech with the question of the
ory, everything is available to all in the market. However, some
value and power of speech. Lastly, in place of speciically linguis-
already hold speciic forms of linguistic capital, which they have
tic competence, it puts symbolic capital, which is inseparable
obtained from family background, education, and professional
from the speakers position in the social structure. (1977, 646;
trajectory. Moreover, such symbolic value is not only expressed
italics in original)
in language forms but also structurally homologous to other
In other words, Bourdieu is seeking to socialize, or at least soci- forms of cultural capital; indeed, it can be found in physical body
ologicalize, all the major principles of traditional linguistics. gestures, as well as other forms of self-presentation. For those
he linguistic market is, therefore, essentially an expression without this capital, it is almost impossible to catch up.
of linguistic relations. However, like all markets, not everyone Ultimately, such relations are expressed in political relations,
to be found within it is equal, and linguistic knowledge is never where certain individuals and representatives are endowed with
perfect. In reality, some are found to have greater practical mas- the power to sanction. For Bourdieu, these are acts of quasi-
tery (connaissance). his knowledge is itself deined not simply magic as, through this endowment, power is literally invested
in terms of use but as an expression of legitimate language. In in someone by a formal acknowledge of status a form of social
most social contexts, there is a dominant language form. his consecration. A most obvious form of this phenomenon is when a
is most evident at a national level where there is received pro- title is bestowed on an individual: Head of Department, for exam-
nunciation and other standard language forms. However, it can ple. Some who write similarly of the linguistic variation between
extend to sublevels and categories and ield microcosms. In each individuals conclude with a deicit model of language, whereby
case, there is a right way of using language. his rightness is lack of language competence is addressed through complemen-
deined by social common assent or common acknowledg- tary education. Ultimately, this leads to a form of linguistic
ment. he particularity of language is that while orthodox lan- communism where all are linguistically equal (see Bourdieu
guage forms are maintained by this consensus and recognized and Boltanski 1975). However, the logic of the linguistic market
as such reconnaissance not all can use them. here can be a is that such compensatory measures will always give rise to dis-
mismatch between any one individuals connaissance and recon- appointing results in terms of social inclusion since, ultimately,
naissance, resulting from upbringing and proximity to legitimate they go against the logic of practice constituting the ield the
467
Marxism and Language
market in the irst place. Just as communist alternatives to capi- like consciousness, only arises from the need, the necessity, of
talism eventually collapsed, leading to an embracing of liberal intercourse with other men (Marx and Engels 1964, 42).
economics and free-market principles, so linguistic communism
he stress on language as central to human activity, or praxis,
cannot work since it runs counter to the raison dtre of the lin-
indicates the important role that Marx and Engels gave it in their
guistic market, which, in terms of substantive cause and efect,
account of the distinctiveness of human life. Language forms an
is social diferentiation. However, this should not be seen as a
essential part of the evolving process by which human beings in
form of poststructuralist nihilism; rather, Bourdieu is ofering a
social relationships create historical reality through the negotia-
metanoia, a new gaze or way of looking at the world through
tion of material needs and the requirement for self-reproduction.
his epistemological thinking tool (see Grenfell 2004, Chapter 7).
It is important to note, however, that language was not viewed as
Michael Grenfell either primary or derivative; it was not the faculty that enabled
human beings to become social in the irst place, nor was it the
WORKS CITED AND SUGGESTIONS FOR FURTHER READING means by which they could express themselves once they had
been socialized. Instead, it was an aspect of the social, material
Bourdieu, Pierre. 1977. he economics of linguistic exchanges. Social
Science Information 16.6: 64568. activity labor in its general, technical sense by which human
. [1982] 1991. Language and Symbolic Power. Trans. G. Raymond beings were constituted qua human beings and by which they
and M. Adamson. Oxford: Polity Press. acted upon nature and other human beings in order to create
, with Loc Wacquant. 1989. Towards a relexive sociology: A work- history.
shop with Pierre Bourdieu. Sociological heory 7.1: 2663. Within the Marxist tradition, the stress on the constitutive
, with Loc Wacquant. 1992. An Invitation to Relexive Sociology. aspect of language as a form of labor material practice was
Trans. L. Wacquant. Oxford: Polity Press. almost lost, as the term labor itself became narrowly conceived
Bourdieu, Pierre, and Luc Boltanski. 1975 Le ftichisme de la langue. simply to mean certain types of work. As a result, more atten-
Actes de la recherch en sciences sociales 2: 95107.
tion was paid to other statements by Marx and less to his original
Fehlen, Fernand. 2004 Pre-eminent role of linguistic capital in
focus on language as social activity. hese comments included
the reproduction of the social space in Luxembourg. In Pierre
his reference to the existence of a bourgeois form of language
Bourdieu: Language, Culture and Education, ed. M. Grenfell and M.
Kelly, 6172. Bern: Peter Lang. (Marx and Engels 1964, 249), his assertion that ideas do not
Grenfell, Michael. 1993. he linguistic market of Orlans. In France: exist separately from language (1973, 163), and his declaration
Nation and Regions, ed. M. Kelly and R. Bock, 7299. Southampton, UK: that the ideas of the ruling class are in every epoch the ruling
ASM & CF. ideas (1964, 60). Marxs remarks, which amount essentially to
. 2004. Agent Provocateur: Pierre Bourdieu. London: Continuum. the observations that the language in use is afected by the class
Snook, Ivan. 1990. Language, truth and power: Bourdieus Ministerium. relations that hold in a given social formation and that ideology
In An Introduction to the Work of Pierre Bourdieu, ed. R. Harker, is disseminated in language, were again rather narrowly inter-
C. Mahar, and C. Wilkes, 16079. Basingstoke: Macmillan. preted within orthodox Marxism.
In the Soviet Union, in particular, a whole set of somewhat
MARXISM AND LANGUAGE fruitless debates ensued as to whether language belonged to the
base or superstructure of society. For N. S. Marr, for example,
he aim of Marxism is to understand history and society accord-
languages were stratiied in such a way that between communi-
ing to the precepts irst outlined in the works of Karl Marx and
ties employing distinct languages, the speech of the same class
Friedrich Engels, later developed by other thinkers in this tra-
would be closer than the speech of diferent classes using the
dition, in order to efect revolutionary social change. Given the
same language. In this account, language belongs to the social
fact that Marxism is in part a description of the determinants of
superstructure of society, which is simply determined by class;
everyday life as a way of explaining the social order, it is some-
the idea that the unity of a group not based on class (such as the
what surprising, therefore, to note that the Marxist contribution
nation) could be explained by the idea of a common language
to thinking about language has been limited. his omission has
was dismissed. Marrs inluence, which was widespread in the
been unfortunate both for Marxism and for those nonformal-
1930s and 1940s, was ended by Stalins equally dogmatic dec-
ist accounts of language that stress its historicity in a general
laration in Marxism and the problems of linguistics [1950]
sense and its speciic and variable links to particular social
1974) that languages did not have a class character but rather
formations.
a national character and were thus not part of the superstruc-
In he German Ideology, as part of their attack on philosophi-
ture. Despite the title of Stalins piece, and though it was an
cal idealism, Marx and Engels provide a sketch of their material-
important correction to the misleading efect of Marrs theories,
ist conception of history. With regard to the nature and function
it did not represent any sort of breakthrough in the Marxist treat-
of language, they assert:
ment of language.
From the start the spirit [mind] is alicted with the curse of In fact, precisely such an advance had been heralded in the
being burdened with matter, which here makes its appear- writings of a number of linguists in the Soviet Union primarily in
ance in the form of agitated layers of air, sounds, in short, of Vitebsk and Leningrad which, in efect, amounted to a school of
language. Language is as old as consciousness, language is prac- Marxist linguistics. Because of the terror exercised by Stalinism,
tical consciousness that exists also for other men, and for that the exact membership of this group is unknown and the names
reason alone it really exists for me personally as well; language, used for publishing may or may not be those of the authors of
468
Marxism and Language
the works. Nevertheless, the principal texts are recognized as theorists concerned themselves directly with language, and
V. N. Voloinovs Marxism and the Philosophy of Language, pub- when they did, as in the case of Walter Benjamin or Jean-Paul
lished in 1929 and translated in 1973; Mikhail Bakhtins Problems Sartre, it is diicult to see how the work qualiies as Marxist in
in Dostoyevskys Poetics, published irst in 1929 and translated any recognizable sense. Yet a number of Marxist theorists, such
from the second (1964) edition in 1984; and P. N. Medvedevs he as Ferrucio Rossi-Landi (1983), Terry Eagleton (1982), and Jean-
Formal Method in Literary Scholarship: A Critical Introduction Jacques Lecercle (2006), have produced interesting work based
to Sociological Poetics, published in 1928 and translated in 1978. on Voloinovs text. More signiicantly, it was the inspiration for
Despite the fact that the work of Bakhtin is the best known to much of the later work of Raymond Williams, the major British
readers in the West, the most signiicant contribution to a strictly socialist critic of the twentieth century.
Marxist treatment of language was provided by Voloinovs pio- Williamss chapter on language in Marxism and Literature
neering text. (1977) stressed the importance of Voloinovs theory of signii-
he radical thrust of Voloinovs work came in his opposition cation, both in general and for his own original work on histori-
to two key tendencies that he identiied in thinking about lan- cal semiotics in Keywords (1976). Beginning with Voloinovs
guage: individualistic subjectivism and abstract objectivism. argument that signs are neither expressive nor systematic in any
he irst, traced by Voloinov to the German idealist tradition and simple sense but, rather, communicative media deployed in the
articulated most clearly in the work of Wilhelm von Humboldt, social process of making history, Williams stressed that signs are
takes the individual human psyche as the most important site shaped by past use but are engaged at the same time in the cre-
of linguistic production and focuses on the individual creative ative making of the present (and are thus of necessity open to
act of speech. Regarding speech as a type of aesthetic cre- the future). his idea of the historical variability of signs, which
ativity, this approach rejects language, understood as a Voloinov calls their mulitaccentuality, formed the basis of
ixed system, as simply the product of the abstract methods of Williamss investigation of the vocabulary of a number of discur-
linguistics. he second tendency, abstract objectivism, is the sive ields, centrally those that involved discussion of culture and
binary opposite of the irst and is typiied in the model proposed society. In essence, what he provides in Keywords and Marxism
by Ferdinand de Saussure and developed by structuralism. and Literature is a retrospective theoretical account of his work
In this approach, the static and apparently immutable linguistic in Culture and Society (1958), a text that efectively began the
system is divorced from history, is distinguished rigorously from debates that led to the appearance of cultural studies as an aca-
individual instances of language use, and is considered to be demic discipline. hough rarely acknowledged as such, it was an
composed of nothing other than the normatively identical forms historical materialist approach to language that lay at the base of
of lexis, grammar, and phonetics. If the irst focuses on the this important intellectual development.
unceasing process (energeia) of individual linguistic creativity, Marxs comments on the existence of bourgeois language
then the second treats language as a inished product (ergon), and Voloinovs assertion that the sign becomes an arena of
open to the objective gaze of the science of linguistics. the class struggle ([1929] 1973, 23) point to another ield of
For Voloinov, the concentration on individual conscious- research in which Marxist thought has been signiicant: the
ness as the basis of an explanation of linguistic signiication politics of language, with particular regard to the historical
is a mistake. he individual consciousness cannot serve as the construction of national languages, the class-based hier-
foundation of linguistic analysis because it is itself in need of archy of language within education, and the role of language
explication from a social point of view: [C]onsciousness takes in imperialism and colonialism. Important work in this area
shape and being in the material of signs created by an organized was conducted by Antonio Gramsci, the Italian Communist
group in the process of its social intercourse nurtured on signs, Party intellectual and leader, who drew attention to the class
it derives its growth from them; it relects their logic and laws perspective in his discussion of the merits and demerits of the
(Voloinov [1929] 1973, 13). his does not, however, mean that use of dialect versus a national form of language in political
the individual consciousness is formed by and in the normatively struggles in Italy. Other examples include Rene Balibars his-
identical signs of the abstract objectivist system. On the contrary, torical research on the emergence of a standard language in
Voloinovs point is that signs themselves, as dynamic com- France in Les franais ictifs (1974) and Linstitution du franais
plexes of form and meaning, are not simply presented as given, (1985), and Tony Crowleys related work in the British con-
ixed elements of a system but are open products of the activ- text in he Politics of Discourse (1989). Writing from the post-
ity the material practice of language making between socially colonial conjuncture, the Kenyan writer Ngg Wa hiongo
organized individuals. Language, in this sense, is not the middle used a Marxist approach to denounce the colonial linguistic
term that unites the individual and the social, nor is it a medium legacy in his Decolonising the Mind (1986). And in educa-
that relects a preexistent reality. Instead, it is an aspect of the tional debates, Basil Bernsteins theory of restricted and
constitutive social activity labor, in Marxs original sense that elaborated codes attempted to explain the diferential aca-
allows for the very possibility of the individual, the social, demic achievement of children from diferent social classes. In
and reality itself. Reproduction in Education, Society and Culture, written with
Despite the importance of semantic indeterminacy to post- Jean-Claude Passeron (1977), and Language and Symbolic
structuralist literary theory, and the stress on context in linguis- Power (1992), Pierre Bourdieu used a neo-Marxist framework
tic pragmatics, the radical challenge of Voloinovs work has not to account for the same phenomenon.
been taken up widely in twentieth-century thinking on language.
Tony Crowley
Even in the tradition of Western Marxism, few of the major
469
Meaning and Belief
WORKS CITED AND SUGGESTIONS FOR FURTHER READING sum, meaning and belief are diferent (the distinctness compo-
Bakhtin, M. M. [1929] 1984. Problems in Dostoyevskys Poetics. Ed. and
nent of the traditional view), and meaning allows the articulation
trans. Caryl Emerson. Manchester, UK: Manchester University Press. of belief (the expressive component).
Balibar, Rene. 1974. Les franais ictifs. Paris: Hachette. In the last half-century or so, this picture of the relation
. 1985. LInstitution du Franais. Paris: Presses Universitaires de between meaning and belief has been challenged from a num-
France. ber of perspectives. One important challenge concerns the
Balibar, Rene, and Dominique Laporte. 1974. Le Franais National. interaction of meaning and belief, addressing such questions as
Paris: Hachette. whether meaning is a relatively neutral vehicle for expressing
Bernstein, Basil. 1971. Class, Codes and Control. Vol. 1. London: Paladin. belief or something that may afect belief. his is the challenge of
Bourdieu, Pierre. 1992. Language and Symbolic Power. Ed. John linguistic relativism. Another concerns the validity of the
hompson, trans. Gino Raymond and Matthew Adamson. Cambridge,
division between meaning and belief. his is the critique of ana-
UK: Polity.
lyticity. he two challenges point in somewhat contradictory
Bourdieu, Pierre, and Jean-Claude Passeron. 1977. Reproduction in
Education, Society and Culture. London: Sage.
directions. (hese are not by any means the only ways in which
Crowley, Tony. 1989. he Politics of Discourse: he Standard Language meaning and belief have been discussed in recent years. For
Question in British Cultural Debates. Houndmills: Macmillan. example, Akeel Bilgrami [1992] addresses the issue of how to rec-
Eagleton, Terry. 1982. Wittgensteins Friends. New Left Review oncile meaning externalism with certain subjective aspects
1.135: 6490. of belief. Unfortunately, a short entry can only point to a couple of
Gramsci, Antonio. 1985. Selections from Cultural Writings. Ed. David key issues that have arisen in connection with this broad topic.)
Forgacs and William Nowell-Smith, trans. William Boelhower. As to linguistic relativism, a number of writers (most famously
London: Lawrence and Wishart. Edward Sapir and Benjamin Lee Whorf) have argued that mean-
Lecercle, Jean-Jacques. 2006. A Marxist Philosophy of Language.
ing is not simply a means for articulating belief, but a means
Boston: Brill.
of shaping belief (as well as emotion, action, even perception).
Marx, Karl, and Friedrich Engels. 1964. he German Ideology.
A popular version of this view is developed in George Orwells
Moscow: Progress Publishers.
. 1973. Grundrisse. Trans. Martin Nicolaus. Harmondsworth: novel 1984, where the government seeks to control peoples
Penguin. ideas by changing their language. he idea of any conceptual
Medvedev, P. N. [1928] 1978. he Formal Method in Literary Scholarship: A scheme relativism is diicult to sustain in global terms, as writers
Critical Introduction to Sociological Poetics. Trans. Albert J. Wherle. such as Donald Davidson (1984; see Chapter 13) have pointed
London: Johns Hopkins University Press. out. However, it is clear that we do not have at least some beliefs
Ngg Wa hiongo. 1986. Decolonising the Mind: he Politics of Language before we have some categories, and commonly those categor-
in African Literature. London: James Currey. ies go along with words and meanings. For example, small chil-
Rossi-Landi, Ferruccio. 1983. Language as Work and Trade. Trans. dren do not have beliefs about gravitation or terrorism because
Martha Adams. South Hadley, MA: Bergin and Garvey.
they do not have the relevant concepts, and the concepts are
Stalin, Joseph. [1950] 1974. Marxism and the problems of linguistics.
presumably something they acquire by learning the words and
In A Primer of Linguistics, ed. Anne Fremantle, 20318. New York:
their meanings. More importantly, it seems likely that peoples
St. Martins.
Voloinov, V. N. [1929] 1973. Marxism and the Philosophy of Language. beliefs about particular events are afected by the concepts (thus,
Trans. L. Matejka and I. R. Titunik. London: Seminar Press. meanings) available to and salient for them. hus, without the
Williams, Raymond. 1958. Culture and Society 17801950. London: Chatto concept of terrorism, perhaps Americans would have under-
and Windus. stood the events of September 11, 2001, as crimes. his would
. 1976. Keywords: A Vocabulary of Culture and Society. London: have changed their beliefs about the nature of the event, proper
Fontana. responses to the event (e.g., police investigation, extradition,
. 1977. Marxism and Literature. London: Oxford University Press. etc., rather than war), and so on.
his challenge to the expressive component of the common-
sense view seems to preserve the distinctness component. In
MEANING AND BELIEF
other words, it seems to rely on a presumption that meaning and
here is a commonsense view of the relation between meaning belief are diferent. After all, if meaning and belief are not dis-
and belief that has been tacitly presupposed in many philosophi- tinct, then it is diicult to tell exactly how meaning could afect
cal, linguistic, and other treatments of these topics. It runs some- belief. his division between meaning and belief is precisely what
thing like this. words refer to objects and have deinitions. he is challenged by the critique of analyticity.
deinitions represent properties of the objects, provide criteria here are two clear ways in which the meaning/belief divi-
for using the words to refer to objects, and allow understanding sion may be criticized. hey relate to two obvious ways in which
of such uses. hus man means, roughly, adult, human, male. the division itself may be formulated. One way concerns revis-
hat meaning picks out properties of some objects, isolating men ability. We might say that the belief component of our ideas
(rather than women, children, apes, and so on). In keeping with is revisable by reference to experience or facts. In contrast, the
this, the meaning allows us to use the term to refer to certain sets meaning component is steady in the face of new experiences
of objects (men) or particular members of that set (individual or facts. However, W. V. Quine has argued famously that there
men) and to understand what other people refer to when they use is no way of putting some truths into empirical quarantine and
the term. Additionally, meanings allow us to express our beliefs judging the remainder free of infection. hus meaning and
about members of sets of objects (generally or individually). In sensory evidence are inextricably intertwined (1976, 139; see
470
Meaning and Belief
also 1981, 67, 712). his suggests a number of things about the other words, understanding father involves various schemas
relation between meaning and belief. Perhaps most obviously, that cluster information into relations. hese schemas have
it indicates that sentences are not true by their meanings alone. default values (such as father is mothers husband), perhaps
More importantly for our purposes, it suggests that what we con- along with speciied alternatives to defaults (such as father is
sider meanings are open to empirical revision, precisely in the divorced from mother). his information is hierarchized in that
manner of beliefs. For instance, a hundred years ago, one might we generally consider items higher in the hierarchy to be more
have thought that My father is a man was true analytically. criterial for application of the term than items lower in the hier-
However, sex-change operations have shown us that the mean- archy. Put very simply, if we ind out that Peter is Sallys progeni-
ing component male is revisable due to empirical information tor but is not the breadwinner, we are more likely to count him
about the objects to which father refers. If some idea about an as Sallys father than if we ind out that he is the breadwinner but
object or set of objects is revisable due to empirical information not the progenitor. On the other hand, hierarchy efects are not
about referents, then it would seem to count as a belief, not as a absolute. We may be more inclined to apply father to the bread-
meaning. winning, afectionate, live-in husband of Sallys mother than to
here are undoubtedly cases where we would ind it diicult an unknown progenitor. (he last point, if developed further,
to imagine such revision. For example, I have no good idea for would lead us to the place of prototypes in lexical seman-
how I could possibly revise my view that If Jones is currently a tics. However, the inclusion of prototypes or for that matter,
man, then Jones is currently a male, adult, human. (Obviously, exemplars would not afect the main argument as it bears on
there are scenarios where the meaning of man could change, but meaning and belief.)
that is not at issue.) However, it might be argued that this tells us Insofar as this model of meaning is accurate, it suggests,
something about my imagination, not about the facts. Perhaps irst of all, that there is no sharp meaning/belief division. here
it was impossible for people to imagine sex-change operations a does not seem to be any point at which the information associ-
century ago. It may be that what we consider to be meaning is a ated with a given heading stops being semantic and starts being
function of what we can imagine changing. But our imagination empirical. On the other hand, it also suggests that the meaning/
could always be mistaken. belief division is not wholly pointless in that there does appear
On the other hand, perhaps the obvious cases of revision are to be a continuum from more deinition-like information to
not so obvious as they initially appear. For example, people did more observation-like information. But this, too, is not all.
imagine men changing into women and women changing into he hierarchical continuum is not determinative. We may think
men well before sex-change operations. If Tiresias had a child of the hierarchy as a series of weighted properties and/or rela-
as a man, then was transformed into a woman, his child could tions. Although those higher in the hierarchy are more heavily
truthfully say, My father is a woman. So perhaps male was weighted, they may be outweighed by a large enough number of
always only more limitedly part of the meaning of father, closer lower-level properties/relations. Alternatively, in connectionist
to a belief than we recognize. In this way, it may be that the revis- terms, a large number of weak connections may reach some acti-
ability argument is not deinitive. vation threshold that is not reached by a small number of strong
A more productive approach may still be Quinean in orien- connections. his last point suggests that despite the hierarchy,
tation naturalizing our treatment of the topic, as Quine all information associated with lexical items is in some ways
often urged (1969; see Chapter 3) by turning to the natural sci- more akin to belief than to meaning (though perhaps neither
ences. Here, we might consider two sorts of cognitive architec- term is truly adequate here).
ture that are common in discussions of meaning today. (his, As the preceding reference to connectionism suggests, the
of course, is not Quinean as it is mentalistic.) he irst is inten- same conclusions hold for accounts of meaning that rely on neu-
tional/representational; the second relies on neural networks, ral networks. For example, neural accounts treat meaning as a
either artiicial (see connectionism, language science, complex of circuits linking conigurations of neurons in difer-
and meaning ) or natural (see semantics, neurobiology ent areas of the brain insofar as these bear on the sound of the
of ). relevant word, the appearance of the referent, our own actions
A standard intentional/representational account of lexical as they might bear on the referent, and so on (see, for example,
semantics involves headings, some sort of meaning units con- Chapter 4 of Pulvermller 2002). hese circuits are presumably
nected with the headings, and connections across headings. not fully ixed and identical across all uses. Rather, the precise
he connections across headings establish lexical relations coniguration activated at any given moment will vary, depend-
of various sorts, including semantic fields. he semantic ing on what other neural circuits are simultaneously activated.
units themselves are structured into complexes of relations For example, suppose I say squeeze. hat activates circuits that
with default values and are typically hierarchized, such that include neuron populations that govern closing together the in-
some units are more important than others. Consider, for gers of the dominant hand. Suppose I then say ball. he, so to
example, man. his entry is linked to woman for one domain speak, resting circuit for ball includes a range of neuron popu-
(adult human), to boy for another domain (male human), lations, some of which bear on closing together the ingers of the
and so on. It includes a range of information, comprising not dominant hand. Since some part of the latter population was just
only deinitional components, but empirical components as activated by squeeze, it should be more fully activated by ball. he
well. For example, 50 years ago, father included not only male, prior activation due to squeeze will slightly alter the circuit acti-
adult, human, and progenitor of ego but probably hus- vated by ball, perhaps enough to make one think of hand-sized,
band of mother and breadwinner of the family or the like. In rubber balls.
471
Meaning and Stipulation Meaning Externalism and Internalism
Of course, not everything is equally variable. here are some of items to which the noun refers (see intension and exten-
connections in these networks that are stronger than others. sion). One can only adjudicate the deinition of a term by refer-
hese diferences in connection strengths should correspond ence to an extension. For example, consider a deinition of U.S.
roughly with the hierarchy of properties/relations in the inten- state that involves the criterion of continuous land. One can
tional/representational account. Here too, then, we have reason reject this deinition by pointing to Hawaii, which is part of the
to believe that there is some sort of continuum. Not all of our extension of U.S. state. But one can only adjudicate an extension
ideas about a set of objects are equally salient, expected, and so by reference to a deinition. In other words, we rely on a deini-
forth. However, none seems precisely to qualify as a meaning, to tion of U.S. state to judge that Hawaii is a U. S. state. hus, one
be distinguished from a belief and, once again, a greater degree cannot adjudicate a deinition and an extension simultaneously.
of activation bearing on initially weaker connections may have One of the two has to be established arbitrarily. By this argument,
greater efects than a weaker activation bearing on initially strong there is no such thing as the real meaning of any term, includ-
connections. Here too, then, any correlates we may posit for the ing meaning. Meaning may be social, intentional, or whatever,
neuronal circuits seem more like beliefs than like meanings. as we choose in particular contexts. hus, whenever we engage
In conclusion, we might return briely to linguistic relativ- in an interpretive task, the type of meaning at issue should be
ism. If the preceding discussion of meaning/belief (non)distinct- stipulated.
ness is accurate, then it seems that we cannot reasonably say his argument disposes of one problem what meaning
that meanings guide beliefs. We can only say that some beliefs really is. But it leads us to three other concerns.
afect other beliefs. On the other hand, we also cannot say that he irst is ontological just what sorts of meaning actually
meanings simply allow us to express beliefs. Our ideas about the exist. We may, for example, stipulate Platonic meaning as our
world and our production and reception of language are, rather, object of hermeneutic interest (see philology and herme-
dynamic (neurocognitive) processes. hese processes do not trap neutics). But we cannot actually interpret for Platonic mean-
us in a prison house of language (as some writers have put it). ings if they do not exist.
But they also do not allow us some simple freedom to describe he second concern takes up the demarcation of our stipula-
and evaluate the world in abstract removal from the perception, tive categories. hese need to be adequately precise. For exam-
memory, and other circuits that are already in place when we ple, we might stipulate that we are concerned with intentional
come to formulate our descriptions and make our evaluations. meaning. But there are numerous sorts of intentional meaning
that should often be distinguished in the case of legal inter-
Patrick Colm Hogan
pretation, the self-conscious intent of the author who drafted a
piece of legislation, the intents of the legislators who passed it,
the intents of the judges who gave opinions on its constitutional-
Bilgrami, Akeel. 1992. Belief and Meaning: he Unity and Locality of ity, and so on.
Mental Content. Cambridge, MA: Blackwell. he inal concern bears on the particular purposes for which
Davidson, Donald. 1984. Inquiries into Truth and Interpretation. we are interpreting. For example, for any given term in a law,
Oxford: Clarendon Press.
there may be variable social meanings. Ordinary people may use
Pulvermller, Friedemann. 2002. he Neuroscience of Language: On Brain
a term with one meaning; scientists may use it with a slightly dif-
Circuits of Words and Serial Order. Cambridge: Cambridge University
Press.
ferent meaning. In particular cases of interpretation, the meaning
Quine, W. V. 1969. Ontological Relativity and Other Essays. New associated with one or the other group may be more signiicant.
York: Columbia University Press. Note that in these cases, we are not trying to determine the real
. 1976. he Ways of Paradox and Other Essays. Rev. ed. Cambridge: meaning of the law. Rather, we are acknowledging that there are
Harvard University Press. many sorts of meaning and we are trying to determine which is
. 1981. heories and hings. Cambridge: Harvard University Press. the most important in the case at hand.
Patrick Colm Hogan

MEANING AND STIPULATION
Meaning is commonly understood as social, mentalistic, or
abstract. A social account views meaning as existing in social Bundgaard, Peer, and Frederik Stjernfelt. 2009. Patrick Hogan
groups. A mentalistic account places meaning in individual [Interview]. In Signs and Meaning: 5 Questions , 7185. Copenhagen:
minds. An abstract account locates meaning in a Platonic realm. Automatic Press/VIP.
Discussions of meaning often involve debates about which Hogan, Patrick Colm. [1996] 2008. On Interpretation: Meaning
and Inference in Law, Psychoanalysis, and Literature. 2d ed.
of these gives the real meaning of a term or utterance. For
Athens: University of Georgia Press.
example, in legal interpretation, there have been debates
Levinson, Sanford, and Steven Mailloux. 1988. Interpreting Law and
between writers who view the Constitution as an ongoing prod- Literature: A Hermeneutic Reader. Evanston, IL: Northwestern
uct of social developments and those who see it as ixed by the University Press.
Framers intent.
A stipulative account of meaning argues that such debates are
pointless. hey are, in efect, debates over the real meaning of
MEANING EXTERNALISM AND INTERNALISM
the word meaning. Formally, the meaning of any common noun Hilary Putnam (1975) argued for a view now known as mean-
(such as meaning) involves a deinition and an extension or set ing (or semantic) externalism the view that there are terms
472
Meaning Externalism and Internalism
whose meanings are not determined by their users psycholog- precisely the same conditions. So they do not mean the same
ical states. Meaning internalism is simply the denial of meaning thing by their utterances, and this diference in meaning appears
externalism. traceable to the term water. Water in English has a diferent
Putnam qualiies his meaning externalism by explaining that extension from water in Twin Earth English. Hence, since mean-
he intends psychological state to be understood in the nar- ings are (at least) intensions, and intensions determine exten-
row sense, according to which a psychological state implies the sions, water means something diferent in Oscar1s mouth than
existence of nothing but its possessor (1975, 21922). Another it does in Oscar2s. Externalism vindicated.
equally signiicant qualiication is that the argument will show As Colin McGinn (1977) and Tyler Burge (1979) have pointed
that meanings just aint in the head, as Putnam memorably out, the same thought experiment can be used to challenge the
puts it, only if a terms meaning is taken to be (at least) an inten- view that beliefs and other propositional attitudes are
sion, that is, a function from possible circumstances (or worlds) narrow psychological states. When Oscar1 says Water is odor-
to its extension, or the set of objects to which the term applies less, he is expressing one of his beliefs, but this belief is difer-
(Putnam 1975, 227; see intension and extension, referent from the belief Oscar2 expresses via the same sentence. (he
ence and extension, and possible worlds semantics). two beliefs are true under diferent conditions; that, according to
Although there is a good deal of contemporary skepticism most theorists, suices to distinguish them.) Similar points could
about the existence of narrow psychological states (see the fol- be made about the other propositional attitudes. If we continue
lowing), Putnam assumes that at least some psychological states to assume that there are narrow psychological states and that the
are narrow; beliefs, thoughts, feelings, and interior monologue narrow psychologies of Oscar1 and Oscar2 could be identical, it
are all given as examples (1975, 224). will follow that at least some beliefs and other propositional atti-
Putnam argues for meaning externalism with his famous tudes are not narrow.
Twin Earth thought experiment (1975, 2237). Suppose that Under the sway of Putnam (1975), McGinn (1977), and Burge
somewhere in the galaxy there is a planet, Twin Earth, which is (1979), however, many theorists are now skeptical that there
just like Earth save one detail: he liquid that lows from Twin are any narrow psychological states. Even sensations and other
Earthian faucets, falls from Twin Earthian skies, and ills Twin phenomenally conscious psychological states narrow psycho-
Earthian oceans is not water. It is macroscopically identical to logical states if any such there be have recently been argued
water, but, unlike water, it is not the chemical compound H2O. to be examples of wide psychological states. (See Dretske 1996;
Instead, it is some other complicated chemical compound Lycan 2001; and Tye 1995.) Putnam himself (1996) avows skep-
that can be abbreviated XYZ. Twin Earthians speak a dialect of ticism about narrow psychological states. What becomes of the
English, and Earthians and Twin Earthians both use the term thesis of meaning externalism if we suppose that narrow psy-
water, but the extension of water in their respective dialects is chological state is an empty term? If an ordinary human subject,
diferent. In English, water applies to all and only samples of S, lacks narrow psychological states, then S's narrow psychol-
H2O. In Twin Earth English, water applies to all and only sam- ogy determines the meanings of S's terms is true, but vacuously.
ples of XYZ. To avoid this hollow victory for internalism, it is perhaps best to
Now consider two subjects, Oscar1, an Earthian, and Oscar2, recast the distinction between meaning externalism and inter-
a Twin Earthian, both of whom have interacted with, and have nalism in terms of a distinction between intrinsic and extrinsic
beliefs and other psychological attitudes concerning, the water- properties: An intrinsic property is one that an object possesses
like liquid native to their respective planets. Suppose both to be independently of its relations to other objects, whereas an extrin-
living in 1750, before anyone on their planets knew anything sic property is one that an object possesses in virtue of its rela-
about the underlying chemistry of the liquids found thereupon. tions to other objects.
Putnam claims that it is possible for Oscar1 and Oscar2 to be in Given the intrinsic/extrinsic properties distinction, we can
the same narrow psychological state (1975, 224). Since both are reformulate meaning externalism as the view that there are
chemically unsophisticated, neither has beliefs characterizable terms whose meanings are not determined by their users intrin-
with H2O or XYZ that could potentially distinguish their nar- sic properties (regardless of whether there are any intrinsic psy-
row psychologies. Furthermore, given the macroscopic identity chological properties).
between H2O and XYZ, it seems plausible to suppose that all of
Max Deutsch
Oscar1s attitudes, feelings, and sensations about the liquid that
is in fact H2O on his planet could be matched by exactly simi-
lar attitudes, and so on, of Oscar2s toward the liquid that is in
fact XYZ on his planet. Indeed, as Putnam suggests, Oscar1 and Burge, Tyler. 1979. Individualism and the mental. Midwest Studies in
Oscar2 could well be molecule for molecule Doppelgngers, Philosophy 4: 73122.
thus, it would seem, guaranteeing their narrow psychological Dretske, Fred. 1996. Phenomenal externalism. In Philosophical Issues.
Vol. 7. Ed. Enrique Villenueva, 14358. Atascadero, CA: Ridgeview.
identity (1975, 227).
Lycan, William. 2001. he case for phenomenal externalism.
When Oscar1 says Water is odorless, however, does he
Philosophical Perspectives 15: 1735.
mean what Oscar2 means when he says Water is odorless? It McGinn, Colin. 1977. Charity, interpretation, and belief. Journal of
seems not. For what Oscar1 says is true if and only if H2O is odor- Philosophy 74: 52135.
less, while what Oscar2 says is true if and only if XYZ is odorless. Putnam, H. 1975. he meaning of meaning. In Mind, Language,
If, however, Oscar1 and Oscar2 mean precisely the same thing and Reality: Philosophical Papers, II: 21571. Cambridge: Cambridge
by their utterances, then those utterances would be true under University Press.
473
Media of Communication
. 1996. Introduction. In he Twin Earth Chronicles: Twenty Years be lengthy, involving several stages and many revisions. And
of Relection on Hilary Putnams he Meaning of Meaning, ed. writing especially in printed or other permanent forms may
Andrew Pessin and Sanford Goldberg, xvxxii. London: M. E. Sharpe. be received in quite diferent contexts from those in which it was
Tye, Michael. 1995. Ten Problems of Consciousness. Cambridge, produced. he writer must anticipate how the efects of a dis-
MA: Bradford Books, MIT Press.
placed or unknown context might guide interpretation or lead
to misinterpretation. And readers, of course, must typically rely
on the written text alone in arriving at its sense. Writing, conse-
MEDIA OF COMMUNICATION
quently, is forced to be less reliant on its immediate context for
his term refers to the means by which communication its meaning.
takes place or the choice of substance for realizing a commu- Speech is often treated as the primary medium of communica-
nicative act. tion and this for various reasons. In human evolutionary terms, it
It has long been recognized that choosing between alterna- is broadly universal and involves speciic biological adaptation
tive means or media of communication can have consequences unlike writing, which emerges as the product of particular his-
for the linguistic form of the message. hus, the choice of writ- torical societies and is not universal either within or across them.
ing instead of speech as a medium of communication may entail Speech is acquired during a critical language-learning period
particular grammatical, syntactic, or lexical preferences over (see critical periods) very early in life. Writing is acquired
others. Indeed some work suggests that certain kinds of linguistic later and usually as the focus of explicit instruction. Nonetheless,
patterning may be distinctive to particular media (Biber 1988). with the advent of a range of alternatives to speech and writing
From Ferdinand de Saussure (1912) onward, much work as media of communication, it is diicult to insist upon a simple
has been devoted to specifying the diferences between speech dichotomy between oral, situated, face-to-face communication,
and writing. (See also oral composition; oral culture; one the one hand, and visual, decontextualized, noninteractive
writing systems; writing, origin and history of). communication, on the other especially when technological
One approach deals with speech and writing as alternative developments in communication media are considered.
expressions of the same underlying language system, realized We may distinguish broadly among three overlapping phases
in difering ways depending on the medium adopted. hus, in the development of alternatives to speech as media of commu-
the written medium is associated with greater lexical density, nication: mechanical (writing, print), electrical (telegraphy and
a wider range of grammatical structures, a greater degree of wireless telegraphy, radio, and television) and digital (World
embedding, and more varied forms of connectivity between sen- Wide Web and the Internet, cellular phones, and the convergence
tences. Conversely, the spoken medium is associated with lexi- or interaction between these and previous media of communica-
cal repetition, low lexical density, vague or indeinite expressions tion). Developments in communication at a distance for military
(thingymajig), high incidence of coordinated clauses linked by and commercial purposes using semaphore and other lag sig-
common conjunctions (and, but), and selection of the active naling systems are particularly evident in Europe in the late eigh-
rather than the passive voice. he character of these diferences teenth and early nineteenth century. hese were forerunners of
has led one author to characterize speech as a process and writ- the electric telegraph initially designed by Samuel Morse in the
ing as a product (Halliday 1985). 1830s. he use of electrical impulses to make possible commu-
A more radical view suggests that sentence grammars gener- nication at a distance then underpins the development of the
ally have been implicitly biased toward the study of writing, and telephone in the 1870s, and forms of wireless telegraphy in the
grammars would be quite diferent if they were formulated from 1890s, to be followed by radio and television broadcasting in
the outset to take account of speech phenomena. As David Brazil the irst and second half of the twentieth century. In most cases,
observes, if any part of the outcome [of a grammar of speech] each technological development may be seen to favor particular
looks like a sentence, this comes about as an interesting by-prod- linguistic selections over others. he telegraph and subsequent
uct of the processes we are interested in, not as the planned out- telegram, because of the cost of transmission and the premium
come to which these processes owe their deinition (1995, 39). placed on time, tended to favor certain kinds of abbreviation
Certainly there is widespread agreement that the commu- principally the deletion of grammatical function words, such as
nicative potentialities of writing and speech are very diferent. articles, determiners, and verb auxiliaries.
Speech typically takes place between interlocutors who are in he early electrical media of communication at a distance tend
some way copresent to each other, and this enables them to to be dyadic and reciprocal, rather than one-way and noninterac-
adjust their utterance in the light of the apparent reactions of tive. But many of the subsequent and most far-reaching develop-
the other. he process of composing and planning speech goes ments in communication at a distance in the twentieth century
hand in hand with the act of speaking, and speaking, in turn, tend to be one-to-many rather than two-way. Broadcasting is
goes hand in hand with the process of interpretation that must perhaps the best term for these developments which include,
keep pace with it. here is no time lag between production and preeminently, radio and television; and in one form or another
reception. Instead, speech is temporally bound, transient, and these have become ubiquitous forms of communication in the
dynamic, rooted in an unfolding context, with paralinguistic modern era.
behavior providing an important supplementary layer to com- Despite the ubiquity of radio and television, it is diicult
munication (see paralanguage). Conversely, writing as a to characterize the language of broadcasting in any distinctly
semipermanent product enables a gap across time and space to homogenous fashion. Instead, it is best understood as a medley
open up between participants. he process of composition may of distinct genres, including news interviews and reports, comic
474
Media of Communication
monologue, soap opera, various kinds of reality programing, from reading a script to speaking ex tempore, from address to
commercials, commentary on public events including sporting the absent audience to address to a copresent interlocutor, from
occasions, argument, drama, talk shows, and phone-ins (several script written to be read aloud as if unscripted to reading aloud
of which have begun to attract systematic study; see Hutchby an e-mail from the audience.
2005; Tolson 2005). Although there may be generic antecedents In the movement from one phase to another in the devel-
to these in the world of real-time, face-to-face communication, opment of technologies of communication, there are shifts of
certain properties seem to set broadcast genres apart from every- emphasis between one-to-one and one-to-many. he emergence
day nonmediated communication. For one thing, the idealized of writing and print allows communication of the one to the
speaker and hearer of the canonical speech situation, recipro- many. he emergence of wireless telegraphy allows one-to-one
cally exchanging roles and utterances, no longer easily applies but over extreme distances. Broadcasting prioritizes one-to-
except in grossly simpliied ways. many and further collapses both temporal and spatial distances.
Instead, as Erving Gofman (1981) observes, broadcast com- he recently launched digital phase that has followed in the wake
munication takes shape from complex production formats and of broadcasting has allowed the most radical innovations regard-
participation frameworks in which the discourse is sometimes ing the conigurations in time and space of participants to com-
scripted, sometimes relatively spontaneous, sometimes spoken, munication: Instantaneity over distance is possible, and extreme
sometimes written, sometimes written to be spoken, sometimes forms of both one-to-one and one-to-many communication can
single authored, sometimes multiply authored, sometimes dia- become blended in a single message.
logue, and sometimes monologue. Indeed, Gofman suggests Text messaging (SMS) and e-mail, for instance, can be one-
replacing the term speaker with notions of author, animator, to-one or forwarded to a larger audience; are primarily asyn-
and principal. he author is the one who has selected the sen- chronous, but single messages can develop into an extended
timents that are being expressed and the words in which they are dialogue; often assume a fast response, but this may be delayed
encoded (Gofman 1981, 144). he animator is the one who gives if the recipient is oline; and seem transient or ephemeral but
voice to the words that have been selected, sometimes by some- may be archived (sometimes in hard copy, in the case of e-mail)
one else. he principal is whoever is potentially held to account for later use. he language style of such communication may
for the sentiments expressed. In many situations, the three roles well include extreme abbreviation, slang, contractions, phonetic
coalesce, but in broadcast communication in news programs, spelling, erratic punctuation, and short forms, and it seems
for instance the presenter who reads the news from the autocue to operate in an unstable and luctuating zone between speech
may merely be animating a script authored elsewhere, by the on the one hand and writing on the other. his might only be a
editorial team, and the ultimately accountable source for the dis- matter of linguistic curiosity except that variation between styles
course the principal may be the organization itself. hus, in of communication interact with questions of formality and the
the case of a BBC news bulletin, it may be the director general quality of the social relationship. Many commentators have
or members of the board of trustees who resign their positions pointed to growing informality in communication in the modern
should an item be called into question, not necessarily the news era, using such terms to describe the shift as informalization
editor, and certainly not the news presenter. (Elias 1996), the democratisation (or conversationalisation) of
Just as various alignments are possible in terms of the produc- discourse (Fairclough 1992), intimacy at a distance and para-
tion of broadcast communication, important distinctions apply social interaction (Horton and Wohl 1956), synthetic personal-
in its reception, where the potential participation framework isation (Fairclough 2001), and broadcast sociability (Scannell
is equally complex. As Gofman again observes, an utterance 1996). he emphasis in these accounts varies between attention
does not carve up the world beyond the speaker into precisely to forms of the message and attention to forms of the relation-
two parts, recipients and non-recipients, but rather opens up an ship aforded by the message, but what generally seems to be
array of structurally diferentiated possibilities, establishing the at stake is a changing sense of what counts as public space and
participation framework in which the speaker will be guiding what counts as the appropriate linguistic and social demeanor
his delivery (1981, 137). Broadcast communication is quite fre- for it. While larger processes of social change may well underpin
quently oriented to two kinds of recipient. In studio interviews, these shifts, the changing media of communication have clearly
for instance, in chat shows or news programs, there is the imme- contributed to them.
diate recipient of the talk the interviewee or the interviewer
Martin Montgomery
but beyond them is the overhearing audience numbering in size
from thousands to millions. In this way, in posing a question to WORKS CITED AND SUGGESTIONS FOR FURTHER READING
an interviewee, the discourse of the interviewer is bidirectional. It
is oriented in the irst instance to the interviewee, but the design Aitchison, Jean, and D. Lewis. eds. 2003. New Media Language.
London: Routledge.
of the question will also be shaped by the assumed concerns of
Biber, Douglas. 1988. Variation across Speech and Writing.
the broadcast audience beyond. Talk for an overhearing mass
audience in this way assumes characteristics distinctive to the Brazil, David. 1995. A Grammar of Speech. Oxford: Oxford University
medium that are diferent from ordinary talk or conversation. Press.
It should be noted also that in the broadcast media, the Crystal, David. 2001. Language and the Internet. Cambridge: Cambridge
foundational distinction in considering media of communica- University Press.
tion between speech and writing becomes confounded. Within Elias, Norbert. 1996. he Germans. New York: Columbia University
a continuous stretch of discourse, a language user may switch Press.
475
Memes and Language
Fairclough, Norman.1992. Discourse and Social Change. Cambridge, In linguistics, the plausibility of meme-based approaches is
UK: Polity. supported by the increasing productivity of Darwinian thought
. 1995. Media Discourse. London: Arnold. in language studies, which has inspired new eforts to explain
. 2001. Language and Power. 2d ed. London: Pearson Education. the evolution of language (see Hurford 2006) and to understand
Gofman, Erving. 1981. Forms of Talk. Oxford: Basil Blackwell.
the historical development of languages in Darwinian terms
Halliday, Michael Alexander Kirkwood. 1985. Speech and Writing.
(e.g., Croft 2001). Although the Darwinian algorithm depends on
Horton, Donald, and R. Richard Wohl. 1956. Mass communication
the existence of replicating units, the potential of meme-based
and para-social interaction: Observations on intimacy at a distance. approaches to language remains largely to be explored.
Psychiatry 19: 21529. Memetic theories of language need to address at least three
Hutchby, Ian. 2005. Media Talk: Conversation Analysis and the Study of fundamental questions. First, linguistic memes, or replicat-
Broadcasting. Buckingham: Open University Press. ing constituents of linguistic competence, need to be plausi-
Saussure, Ferdinand de. 1916. Cours de linguistique gnrale. Ed. C. Bally bly conceptualized as material patterns with determinable and
and A. Sechehaye, with the collaboration of A. Riedlinger. Lausanne empirically detectable structures. Second, the mechanics by
and Paris: Payot. which memes are replicated need to be determined. hird, the
Scannell, Paddy. 1996. Radio, Television and Modern Life. Oxford: Basil factors that determine the diferential replication of meme vari-
Blackwell.
ants need to be identiied.
Tolson, Andrew. 2005. Media Talk: Spoken Discourse on TV and Radio.
A study that adheres strictly to Dawkinss original pro-
Edinburgh: Edinburgh University Press
posal is that of Nikolaus Ritt 2004. Following connectionist
approaches to competence modeling, he sees language memes
MEMES AND LANGUAGE as patterns acquired by neural networks during language
Memes are information patterns that are culturally transmittable acquisition. hus, a meme representing a phoneme contains
and undergo Darwinian evolution: Variation among meme types a) links to conigurations underlying articulatory gestures and
is created when patterns are altered, recombined, or transmit- b) links to areas that are excited by speciic auditory impres-
ted imperfectly, and selection takes place when more stable or sions, as well as c) links to representations of morphemes for
more easily transmittable meme variants come to oust competi- whose distinction the phone-meme is relevant. herefore,
tors that are less it, that is, less stable or transmittable. As far as phone-memes have both determinable internal structures (i.e.,
their material implementation is concerned, memes are gener- the links between auditory and articulatory conigurations) and
ally thought to exist in brains as constituents of human knowl- determinable positions within the larger networks that imple-
edge. Whether human behavior and artifacts should be regarded ment linguistic competence. Memetic constituents coding for
as external expressions of memes or as alternative ways in which phonotactic regularities, rhythmic conigurations, morphs, or
memetic information can be implemented is still disputed, syntactic categories and constructions are construed in sim-
although the former view seems to be gaining ground. ilar terms.
he term meme was coined by the evolutionary biologist he replication of language memes involves communica-
Richard Dawkins (1976, 192) to denote cultural counterparts tion, acquisition, and accommodation. Since a speakers com-
of genes. Dawkins introduced the concept to support the argu- municative behavior is constrained by his or her competence,
ment that Darwinian evolution is not limited to the biological utterances will automatically express the memetic constituents
domain but represents an algorithmic process that will afect by whose activation they are caused. hen, the mind-brains of
any patterns that are suiciently stable and copied in suicient recipients and those of children in particular will attempt to
numbers with suicient idelity. While in the evolution of species assume organizations by which they can emulate the utterance
those patterns are genes, the historical development of human behavior they are exposed to. hereby, copies of memes that are
cultures might relect the evolution of memes. expressed in utterances get created.
he concept of memes is linked to the idea that human cul- Among possible factors determining the diferential repli-
ture is a Darwinian system that can be understood best on the cation of meme variants, three types are distinguished. First,
level of the elements on which selection operates. A memetic meme replication must be constrained by physiological proper-
view of culture regards humans as physiologically complex, yet ties of their hosts. hus, meme variants that are easy to express in
relatively passive meme hosts. heir instinctive inclination to articulation and whose expression is easy to perceive will be uni-
imitate one another turns them into meme vehicles with lim- versally itter than more costly and less easily perceivable com-
ited control over the memes they acquire, express in behavior, petitors. Second, memes will be sensitive to such social factors
or pass on to other humans. Of course, meme replication will as power relations within and across groups. he more powerful
depend, to a considerable extent, on the physiological makeup, and prestigious that individuals or groups are perceived to be,
the well-being, and the needs of their hosts, and memes that the more often will their behavior be imitated. hird, the repli-
inlict obvious harm on their human carriers are unlikely to cation of any meme will depend on other memes in the system.
thrive. However, the ultimate reason why any Darwinian rep- Since utterances always express many memetic constituents
licator exists is its capacity to get itself transmitted before it simultaneously, stable languages will contain mutually coad-
disintegrates. hus, a memetic approach to human behavior apted memes, which co-express with minimal distortion of one
challenges hermeneutic views (see philology and herme- anothers expressions.
neutics ), which derive it from the irreducibly subjective per- While the predictions derived from physiological and social
spectives of intentional agents. constraints on meme selection seem to mirror those of speaker-
476
Memory and Language
based theories that derive the properties of languages from the represents the occurrence of speciic events, for example, when
needs of their users, the co-adaptive pressures among memes and where you last wrote a letter.
promise new explanations of long-term conspiracies in language Despite the pervasiveness of memory in cognitive functions
change, or the existence of typological classes. hus, most Old such as language, historically memory systems have been inves-
and Middle English sound changes that altered the metrical tigated and theorized as separate and distinct from other cogni-
weight or the syllabic structure of lexemes produced outputs that tive systems. his approach, however, can be diicult to sustain
were more trochee-like than their inputs. From a memetic per- because of the centrality of memory to cognitive functions,
spective, they can be explained as morphotactic adaptations of especially language. Some current models postulate that rep-
lexemes to rules coding for foot isochrony. resentations and processes used for memory and language
Strictly memetic approaches to language are still a minor- are inseparable and part of the same system at both a behavioral
ity program. While adherents regard memes as essential to any and neural level (MacKay et al. 2007; see hippocampus). Here,
truly Darwinian theory of language, even some of the linguists we review the relation between memory and language in behav-
who pursue explicitly Darwinian agendas (e.g., Croft 2001) prefer ior and consider evidence relevant to whether they are distinct
to think of selection as being performed on utterance constitu- systems.
ents and to attribute more active roles to speakers as agents of
change. Skeptics (e.g., Aunger 2001) also emphasize the need to Semantic Encoding in Memory and Language
formalize memetics, the missing evidence of neural replicators, A long-standing feature of models of language processing is that
and the paucity of empirical studies demonstrating the explana- perception of a word activates its lexical representation and that
tory potential of the approach. activation automatically spreads to associated conceptual infor-
mation, including semantic properties of the word that consti-
Nikolaus Ritt
tute its meaning (e.g., Rapp and Goldrick 2000; see spreading
activation). A clear demonstration of this feature is seen in the
Stroop task in which participants are instructed to ignore a word
Aunger, Robert. 2001. Conclusion. In Darwinizing Culture: he Status and simply name its ink color. Despite instructions to ignore the
of Memetics as a Science, ed. Robert Aunger, 20533. Oxford: Oxford word, color naming latency is faster when the base word is the
University Press. same as the ink color than when it is a diferent color, for exam-
Blackmore, Susan. 1999. he Meme Machine. Oxford: Oxford University
ple, the word blue written in red. his diference in latency can
Press.
occur only if the meaning of the base word is accessed, despite
Croft, William. 2001. Explaining Language Change. London: Longman.
Dawkins, Richard. 1976. he Selish Gene. Oxford: Oxford University
instructions to ignore the word. his automatic semantic encod-
Press. ing of a word is clearly a process that is an essential part of a pri-
Dennett, Daniel. 1995. Darwins Dangerous Idea: Evolution and the mary language function, namely, language comprehension, as it
Meanings of Life. New York: Simon and Schuster. is essential to understanding the meaning of a word, sentence
Hurford, James R. 2006. Recent developments in the evolution of lan- and discourse (see discourse analysis [linguistic]).
guage. Cognitive Systems 7 (November): 2332. his language process is also part of encoding in episodic
Ritt, Nikolaus. 2004. Selish Sounds and Linguistic Evolution. memory. For example, in a variant of the Stroop task, color nam-
Cambridge: Cambridge University Press. ing latency was slower for taboo base words (e.g., whore) than
for neutral base words (e.g., wrist). In a subsequent surprise
recall test, memory was better for taboo than neutral base words
MEMORY AND LANGUAGE
(MacKay et al. 2004; see emotion and language and emo-
he study of memory is concerned with the way that information words). his diference in the efects of taboo and neutral
tion is represented and stored over time in the mind and how base words demonstrates that the automatic semantic activation
it is retrieved and inluences behavior. Memory is essential for that occurred during perception of the base word was the basis
all cognitive functions, including those that are intentional and not only for lexical comprehension (which slowed color nam-
under conscious control and those that occur automatically ing latency for taboo base words) but also for the representation
without conscious control. he dominant theoretical approach involved in subsequent episodic memory recall. he strong inlu-
postulates several separate memory systems that vary in the ence of meaning on memory indicates an overlap of comprehen-
nature of their encoding and retrieval, their duration, and their sion and memory representations.
neural substrates, as well as how they are afected by diferent he degree of semantic activation during encoding also has
variables, such as the age of the person or the level of process- a strong efect on how well verbal material is remembered.
ing during encoding (e.g., Tulving and Schacter 1990). For Participants remember more words in a surprise episodic
example, semantic memory stores long-term conceptual memory test after making judgments about word meaning
knowledge including linguistic knowledge, such as words used compared to judgments about phonology or physical form
to express concepts; procedural memory represents learned (Craik and Tulving 1975). he idea that semantic processing is
skills and perceptual-motor routines that can be enacted with a deeper level of processing that improves memory has been
little attentional control, for example, reading; working criticized for being a circular explanation. Nevertheless, partici-
memory represents the current content of consciousness and pants had no advance knowledge that memory would be tested
enables manipulation of this information as when, for exam- and thus they did not engage in mnemonic strategies, and so
ple, a reader develops the meaning of a text; episodic memory the indings demonstrate that semantic processes involved in
477
Memory and Language
understanding language form the basis for representing speciic target word, especially in sound, sometimes persistently come to
occurrences of words in memory. mind, but these alternate words are a consequence, not a cause,
of the gap produced by the TOT (Burke et al. 1991).
Semantic Activation and Memory Errors TOTs are caused by a retrieval failure at the phonological
Semantic activation during language comprehension is also the level of the representation of the word while semantic informa-
basis for memory errors, especially constructive memory errors, tion is available for retrieval. Low-frequency words are more vul-
which occur when there is false memory for material that is con- nerable to TOTs than high-frequency words, and recent use of a
ceptually related to presented material but was not actually pre- word makes it less vulnerable to TOT. Clearly, TOTs represent a
sented. For example, implications of sentences are commonly memory retrieval failure and demonstrate the interdependence
remembered as having been presented when they were not. he of memory and language processes. TOTs can be explained in
target sentence, he hungry python caught the mouse is likely to terms of impaired spreading activation from lexical to phono-
be remembered as he hungry python ate the mouse (Harris and logical representations because of weak connections between
Monaco 1978). he implication is encoded in memory as part these representations, caused by disuse. Consistent with this
of the presented sentence because it is activated during com- explanation, a TOT can be resolved by pronouncing phonolog-
prehension. What is remembered is what is computed by com- ical segments of the word, which increases phonological activa-
prehension processes, not what was actually presented. his tion (James and Burke 2000). Memory and language processes
integration of language comprehension and memory makes it are indistinct here because identical representations and pro-
extremely diicult for people to remember language verbatim cesses (spreading activation) are crucial for both memory and
and makes memory for what people said or wrote notoriously language.
unreliable. Language processes are also closely linked to working mem-
False memories based on semantic activation processes have ory. heories of working memory include a storage component
also been demonstrated in the Deese/Roediger-McDermott and a controlled attention or central executive component that
(DRM) experimental paradigm (e.g., Roediger and McDermott maintains or computes information that is the focus of attention.
1995). In the DRM paradigm, participants are asked to remem- Working memory has a limited capacity that constrains the abil-
ber a list of words (e.g., snooze, wake, dream, blanket, etc.) that ity to perform complex mental computations, including seman-
are associated with an unpresented critical word (e.g., sleep). tic and syntactic computations necessary for constructing
Participants falsely remember the unpresented critical word at linguistic representations that are the basis for language compre-
rates equivalent to the presented items; their conidence in their hension and production (Just and Carpenter 1992; Caplan and
memory accuracy is as high for critical words as for presented Waters 1999). For example, limited capacity causes reading time
words. he high rate of false memory for a critical word in this to increase at points in a sentence where diicult syntactic com-
paradigm has generally been explained by semantic activation putations are required. Moreover, a persons language ability is
of the list words during their presentation that spread to and correlated with the capacity of their working memory: Language
summated at the representation for the critical word. he high comprehension and production are better for people with large
level of activation of the critical word at the test produces a feel- rather than small working memory spans.
ing of familiarity that leads to the false recognition. Consistent he theory that working memory is a separate construct from
with this explanation, increasing the number of related words on linguistic processes that constrains language functions has been
the studied list increases the likelihood that the critical word is challenged recently by a connectionist approach to language.
falsely remembered (Robinson and Roediger 1997). here is also his approach postulates that computational eiciency in lan-
evidence that semantic activation of the critical word may have guage processing is determined by the state of the network rep-
decayed before the test, but the critical word is reactivated at the resenting linguistic knowledge, not by the capacity of a separate
test because of its association with the list (Meade et al. 2007). working memory system (MacDonald and Christiansen 2002).
Knowledge and experience inluence the state of the representa-
Memory Processes and Language tional network. For example, repetition strengthens connections
Frequency and recency of occurrence have strong efects on among representations so that they pass activation more quickly,
memory. Classic forgetting curves show that the more recent the increasing processing eiciency. Consistent with this idea, lan-
presentation of material, the better the memory for it. Frequency guage that is more frequent at either a lexical or syntactic level
or repetition improves both episodic and procedural memory. is easier to process. Within this approach, complex syntax slows
Parallel efects of frequency and recency are seen in language. reading not because it requires more working memory capacity
Words that are repeated frequently or more recently in natural but because such syntax is infrequent, which weakens connec-
language are easier to perceive and to produce. he efect of tions among relevant representations. Similarly, because linguis-
recency and frequency is demonstrated in a dramatic language tic experience increases the processing eiciency of the language
production failure known as the tip-of-the-tongue state (TOT) in network, individuals with greater language experience are pre-
which a person is temporarily unable to produce a well-known dicted to have larger verbal working memory spans than individ-
word. In the throes of a TOT, a person can produce semantic uals with less language experience. hat is, the observed relation
information about the TOT target and sometimes partial infor- between language ability and working memory span is attributed
mation about the phonology of the word, such as number of to a common cause: increased eiciency of the language network
syllables or irst phoneme, but the complete word remains because of increased language experience. his approach elimi-
maddeningly out of reach. Alternate words related to the TOT nates the architectural and computational distinction between
478
Mental Models and Language
working memory and the language system. It views working logician C.S. Peirce formulated an account of verbal reasoning
memory limitations as emerging from the architecture of the based on diagrams that were models of assertions (Peirce
language network, rather than from a ixed capacity. 193158, vol. 4). In the 1970s, cognitive scientists converged
again on the idea of mental models. hey argued that when we
Elizabeth R. Graham and Deborah M. Burke
understand discourse, we use sentence meaning and our
general knowledge in order to construct a mental model of the
situation under description (also known as a situation model).
Burke, Deborah M., Donald G. MacKay, Joanna S. Worthley, and Such a model is as iconic as possible; that is, its structure corre-
Elizabeth Wade. 1991. On the tip of the tongue: What causes word sponds to the structure of the situation it represents. Hence, the
inding failures in young and older adults? Journal of Memory and model represents each referent with a single mental token, the
Language 30: 54279.
properties of referents with properties of the tokens, and the rela-
Caplan, David, and Gloria S. Waters. 1999. Verbal working memory and
tions among referents with relations among the tokens (Johnson-
sentence comprehension. Behavioral and Brain Sciences 22: 77126.
Craik, Fergus I. M., and Endel Tulving. 1975. Depth of processing and
Laird 1983). he model captures what is common to the diferent
the retention of words in episodic memory. Journal of Experimental ways in which a possibility might occur, and so the theory is
Psychology: General 104: 26894. analogous to possible worlds semantics and to discourse
Harris, Richard J., and Gregory E. Monaco. 1978. Psychology of prag- representation theory (Kamp and Reyle 1993). However, these
matic implication: Information processing between the lines. Journal approaches postulate that representations are logically correct,
of Experimental Psychology: General 107: 122. whereas mental models have inbuilt shortcomings as a result of
James, Lori E., and Deborah M. Burke. 2000. Phonological priming the constraints of the human mind (see the following).
efects on word retrieval and tip-of-the-tongue experiences in young As an example of a model, consider a simple spatial descrip-
and older adults. Journal of Experimental Psychology: Learning,
tion (see Byrne and Johnson-Laird 1989):
Memory and Cognition 26: 137891.
Just, Marcel A., and Patricia A. Carpenter. 1992. A capacity theory of com- he oice door is on the left of the elevator. he exit door is on
prehension: Individual diferences in working memory. Psychological the right of the elevator.
Review 99: 12249.
MacDonald, Maryellen C., and Morten H. Christiansen. 2002. Reassessing We can construct a mental model of the spatial layout, which is
working memory: Comment on Just and Carpenter (1992) and Waters analogous to this diagram:
and Caplan (1996). Psychological Review 109: 3554.
MacKay, Donald G., Lori E. James, Jennifer K. Taylor, and Diane E. oice-door elevator exit-door
Marian. 2007. Amnesic H. M. exhibits parallel deicits and sparing
in language and memory: Systems versus binding theory accounts. he diagram is iconic in that its layout corresponds to a plan of
Language and Cognitive Processes 22: 377452. the three entities, but a mental model of the layout is likely to
MacKay, Donald G., Meredith Shafto, Jennifer K. Taylor, Diane E. Marian,
represent the doors rather than to use verbal labels, which occur
Lise Abrams, and Jennifer R. Dyer. 2004. Relations between emotion,
in the diagram for simplicity.
memory, and attention: Evidence from taboo Stroop, lexical decision,
and immediate memory tasks. Memory and Cognition 32: 47488.
Suppose that the discourse continues:
Meade, Michelle L., Jason M. Watson, David A. Balota, and Henry L. A man is standing in front of the oice door. A woman is
Roediger, III. 2007. he roles of spreading activation and retrieval standing in front of the exit door.
mode in producing false recognition in the DRM paradigm. Journal of We incorporate this information in our model:
Memory and Language 56: 30520.
Rapp, Brenda, and Matthew Goldrick. 2000. Discreteness and interac- oice-door elevator exit-door
tivity in spoken word production. Psychological Review 107: 46099.
Robinson, Kerry J., and Henry L. Roediger, III. 1997. Associative proman woman
cesses in false recall and false recognition. Psychological Science
8: 2317.
It follows that the man is on the left of the elevator and the
Roediger, Henry L., III, and Kathleen B. McDermott. 1995. Creating
woman is on the right of the elevator. No alternative model of
false memories: Remembering words not presented in lists. Journal
of Experimental Psychology: Learning, Memory, and Cognition the discourse is a counterexample to this conclusion, and so it
21: 80314. is logically valid; that is, it must be true, given the truth of the
Tulving, Endel, and Daniel L. Schacter. 1990. Priming and human mem- premises.
ory systems. Science 247: 3016. Mental models govern our memory for discourse. Suppose
that this discourse continues:
MENTAL MODELS AND LANGUAGE he man standing in front of the oice door was using a cell phone.
How do we represent discourse, and how do we reason from Later it states:

its contents? One answer to both questions is that we rely on
he man using the cell phone was wearing a suit.
mental models of the situations that discourse describes. he
Scottish psychologist Kenneth Craik (1943, 61) wrote that if Both assertions can be used to update our model. In an unex-
we construct a small-scale model of the world, we can use it pected memory test, as Alan Garnham (1987) showed, we are not
to make sensible decisions about our actions. Several thinkers likely to recall which of the following sentences occurred in the
anticipated him, and the nineteenth-century American discourse:
479
Mental Models and Language
he man standing in front of the oice door was wearing a suit. he mental models of the irst assertion are:
he man using a cell phone was wearing a suit. king ace
not(king) ace
We forget the sentences and recall only the situation represented
in our model. Hence, given an assertion that follows at once from where not represents negation. he second assertion elimi-
our model, we are also prone to suppose that it too occurred in nates the second of these possibilities, and so it seems that there
the discourse (Bransford, Barclay, and Franks 1972). is an ace in the hand. However, the connective or else means that
Models of discourse can be abstract. hey can combine both at the very least, one of the propositions that it connects may be
iconic elements and symbolic elements, such as negation. We false. Given, say, the falsity of if theres a king in the hand then
might translate negation into a set of alternative airmative pos- theres an ace in the hand, we realize that its possible that theres
sibilities (Schroyens, Schaeken, and dYdewalle 2001). We repre- a king in the hand without an ace. So, even though the second
sent the proposition that the man isnt in front of the exit door as assertion tells us that there is a king in the hand, no guarantee
a set of airmative possibilities: He is in front of the elevator, or exists that theres an ace, too. he inference is fallacious. his
he is in front of the oice door, or But this representation calls analysis relies only on two well-attested facts about our under-
for a procedure that interprets a set of models as alternatives. As standing: 1) or else allows that one of the clauses it connects is
Peirce realized, this machinery is not iconic but symbolic. And, false, and 2) the falsity of a conditional allows that its if-clause
often, there are too many airmative possibilities to represent can be true and its then-clause false. A computer program imple-
negation in this way. menting the principle of truth led to the discovery of a variety of
If we have a dynamic model of what happens in a story, then illusions, and subsequent studies have corroborated their occur-
changes in location should afect our ease of accessing referents. rence (Johnson-Laird 2006).
Experiments have shown that if, say, the protagonist in a story When you read the earlier description of the man and woman
walks through a door into another room carrying an object, then standing in front of the doors, you might have formed a visual
it is easier for us to access this object and the entities in the new image of the layout. Spatial relations are usually easy to visualize.
room than those in the room the protagonist has left. It takes us You might, therefore, assume that mental models are nothing
longer to respond to questions or to a probe word; and these more than visual images. his assumption is wrong. Some rela-
efects occur for stories (e.g., Glenberg, Mayer, and Lindem tions, such as Pam is better than Viv, are impossible to visual-
1987; Rinck and Bower 1995), movies (e.g., Magliano, Miller, ize. You can imagine, say, Pam as further up a ladder than Viv is,
and Zwaan 2001), and virtual reality on a computer screen but nothing in your image or in any possible image can make
(Radvansky and Copeland 2006). We therefore maintain a model explicit the meaning of better than. Not all relations are rooted in
of discourse that has perceptual and spatial features that parallel a sensory modality or have a spatial interpretation. Some rela-
those in models that we construct from witnessing events, and tions, such as the cat is cleaner than the dog, are easy to visualize
the model may rely on many of the same brain areas underlying but do not invoke a spatial representation. Reasoning with these
perception. visual relations, which elicit images rather than spatial models,
he principle of truth is an interpretative assumption gov- takes longer than reasoning with other sorts of relation, and it
erning mental models (Johnson-Laird 2006). It postulates that activates a region of the brain underlying vision.
they represent only what is true according to the discourse. As a he hypothesis that we represent discourse in mental models is
consequence, an assertion, such as uncontroversial in psycholinguistics, though not all accounts
stress the iconicity of models (cf. Kintsch 1988; and Gernsbacher,
he man is wearing a suit or else the woman is wearing a suit, 1990). We make a dynamic representation of entities, their prop-
but not both. erties, and the relations among them. he heart of the problem
is represented in separate mental models of the two possibilities, in building a mental model is to recover the appropriate referent
depicted here on separate lines: for each expression. Speakers refer back to entities that they have
already introduced in the discourse, and they can use diferent
man wears suit noun phrases, demonstratives, or pronouns to do so. he correct
woman wears suit interpretation of such anaphora depends on many factors. Given
a sentence like he man confessed to the priest because he wanted
Again, these sentences stand in for mental models. What the absolution, we understand that he refers to the man rather than
models do not represent, at least explicitly, is the falsity of the the priest. We make this attribution because we know the purpose
woman is wearing a suit in the irst possibility and the falsity of of confession, because we have a preference for locating the ante-
the man is wearing a suit in the second possibility. he principle cedents of pronouns in the subjects of clauses, and because we also
of truth reduces the load on our memory, and it seems innoc- have a preference for a parallel grammatical role of antecedent and
uous. Yet, it can lead us into the illusion that we understand a anaphor (Stevenson, Nelson, and Stenning 1995).
description that, in fact, is beyond us. In computational linguistics, centering theory shows how the
A striking illusion of this sort occurs with the description: focus on a local segment of discourse determines the anteced-
If theres a king in the hand then theres an ace in the hand
ents of anaphora, especially pronouns (e.g., Grosz, Joshi, and
or else theres an ace in the hand if there isnt a king in the hand.
Weinstein 1995; Webber et al. 2003) Another factor is the seman-
tic diference between antecedent and anaphor (Almor 1999).
here is a king in the hand. he information load on interpretation increases when, unlike
480
Mental Models and Language Mental Space
normal cases, the anaphor is more speciic than its antecedent, Rinck, Mike, and G. Bower. 1995. Anaphor resolution and the focus of
for example, He had a beer, and the Guinness tasted good. attention in situation models. Memory & Language 34: 11031.
Within the framework of mental models, the most comprehen- Schroyens, Walter, W. Schaeken, and G. dYdewalle. 2001. he process-
sive account of anaphora is that of Garnham and his colleagues ing of negations in conditional reasoning: A meta-analytic case study
in mental model and/or mental logic theory. hinking & Reasoning
(cf. Cowles and Garnham 2005). his theory takes into account
7: 12172.
all the preceding factors, but also postulates that a crucial factor
Stevenson, Rosemary J., A. W. R. Nelson, and K. Stenning. 1995. he role
is the number of potential antecedents for an anaphor. In looking of parallelism in strategies of pronoun comprehension. Language and
backward, an anaphor should have enough content to pinpoint its Speech 38: 393418.
antecedent among the candidates. But the choice of a particular Webber, Bonnie, M. Stone, A. Joshi, and A Knott. 2003. Anaphora and
anaphor also signals the future direction of the discourse subse- discourse structure. Computational Linguistics 29: 54587.
quent content may provide the information needed to pinpoint
the antecedent. And the content in the anaphor may also signal MENTAL SPACE
a shift in theme. So, the theory is Janus-faced, looking both back-
ward for antecedents and forward for thematic shifts. No current What Is a Mental Space?
theory, however, has led to a computer program that constructs Mental spaces are partial assemblies constructed as we think
models for anything more than a fragment of the language. and talk, for purposes of local understanding and action. hey
contain elements and are structured by frames and cognitive
P. N. Johnson-Laird models. Mental spaces are connected to long-term schematic
knowledge, such as the frame for walking along a path, and to
WORKS CITED AND SUGGESTIONS FOR FURTHER READING long-term speciic knowledge, such as a memory of the time you
Almor, Amit. 1999. Noun-phrase anaphora and focus: he informational climbed Mount Rainier in 2001. he mental space that includes
load hypothesis. Psychological Review 106: 74865. you, Mount Rainier, and your climbing the mountain can be acti-
Bransford, John D., J. R. Barclay, and J. J. Franks. 1972. Sentence vated in many diferent ways and for many diferent purposes.
memory: A constructive versus an interpretive approach. Cognitive You climbed Mount Rainier in 2001 sets up the mental space in
Psychology 3: 193209. order to report a past event. If you had climbed Mount Rainier
Byrne, Ruth M. J., and P. N. Johnson-Laird. 1989. Spatial reasoning. sets up the same mental space in order to examine a counter-
Journal of Memory and Language 28: 56475. factual situation and its consequences. Max believes that you
Cowles, Wind, and A. Garnham. 2005. Antecedent focus and conceptual climbed Mount Rainier sets it up again, but now for the purpose
distance efects in category noun-phrase anaphora. Language and
of stating what Max believes.
Cognitive Processes 20: 72550.
Mental spaces are constructed and modiied as thought and
Craik, Kenneth. 1943. he Nature of Explanation. Cambridge: Cambridge
University Press.
discourse unfold and are connected to each other by mappings,
Garnham, Alan. 1987. Mental Models as Representations of Discourse and such as identity and analogy. It has been hypothesized that
Text. Chichester: Ellis Horwood. at the neural level, mental spaces are sets of activated neuronal
. 2001. Mental Models and the Interpretation of Anaphora. Hove, assemblies and that the connections between elements cor-
East Sussex: Psychology Press. A major statement of the theory of men- respond to coactivation bindings. On this view, mental spaces
tal models for discourse. operate in working memory but are built up partly by activat-
Gernsbacher, Morton A. 1990. Language Comprehension as Structure ing structures available from long-term memory. Connections
Building. Hillsdale, NJ: Erlbaum. link elements across spaces without implying that they have the
Glenberg, Arthur M., M. Meyer, and K. Lindem. 1987. Mental models same features or properties. When I was six, I weighed ifty pounds
contribute to foregrounding during text comprehension. Memory &
prompts us to build an identity connector between the adult and
Language 26: 6983.
the six-year-old despite the manifest and pervasive diferences.
Grosz, Barbara, A. Joshi, and S. Weinstein. 1995. Centering: A frame-
work for modelling the local coherence of discourse. Computational
Mental spaces are built up dynamically in working memory but
Linguistics 21: 20326. can become entrenched in long-term memory.
Johnson-Laird, Philip N. 1983. Mental Models. Cambridge: Harvard An expression that names or describes an element in one
University Press. mental space can be used to access a counterpart of that element
. 2006. How We Reason. Oxford: Oxford University Press. in another mental space (access principle).
Kamp, Hans, and U. Reyle. 1993. From Discourse to Logic. Dordrecht, the
Netherlands: Kluwer. Exploring Mental Spaces
Kintsch, Walter. 1988. he role of knowledge in discourse compre- In the 1970s, it became clear that grammatical and semantic
hension: A construction-integration model. Psychological Review structure provide evidence for general features of human con-
95: 16382. ceptual systems and operations.
Magliano, Joseph P., J. Miller, and R. A. Zwaan. 2001. Indexing space and
Logical phenomena, such as quantiier scope, anaphora,
time in ilm understanding. Applied Cognitive Psychology 15: 53345.
opacity, and presupposition had been largely the province of
Peirce, Charles S. 193158. Collected Papers of Charles Sanders Peirce.
analytic philosophy. Bypassing the mind/brain, semantics was
8 vols. Ed. C. Hartshorne, P. Weiss, and A. Burks. Cambridge: Harvard
University Press. framed in terms of an external theory of truth and reference.
Radvansky, Gabriel A., and D. E. Copeland. 2006. Walking through cognitive linguistics embarked on a diferent course, plac-
doorways causes forgetting: Situation models and experienced space. ing mental constructs at the forefront of the study of language.
Memory & Cognition 34: 11506. he initial motivation for mental space theory (Fauconnier
481
Mental Space Merge
[1985] 1994, 1997) was that it provided simple, elegant, and gen- WORKS CITED AND SUGGESTIONS FOR FURTHER READING
eral solutions to problems such as referential opacity or pre- Coulson, Seana. 2001. Semantic leaps. Cambridge, UK: Cambridge
supposition projection that had baled logicians and formal University Press.
linguists. Opacity results from the application of the access prin- Cutrer, M. 1994. Time and tense in narratives and everyday language.
ciple across mental spaces as discourse unfolds. What emerged Ph.D. diss., University of California, San Diego.
was a uniied cognitively based approach to anaphora, presup- Dancygier, Barbara. 1998. Conditionals and Prediction. Cambridge:
position, conditionals, and counterfactuals. Additionally, the Cambridge University Press.
gestural modality of signed languages revealed other ways in Dancygier, Barbara, and Eve Sweetser. 2005. Mental Spaces in
which mental spaces could be set up and operated on cognitively Grammar: Conditional Constructions. Cambridge: Cambridge
and physically. University Press.
Dinsmore, J. 1991. Partitioned Representations. Dordrecht, the
Shortly thereafter, J. Dinsmore (1991) developed a powerful
Netherlands: Kluwer.
approach to tense and aspect phenomena, based on mental
Epstein, Richard. 2001 he deinite article, accessibility, and the con-
space connections. he approach was pursued and extended struction of discourse referents. Cognitive Linguistics 12: 33378.
in fundamental ways by M. Cutrer (1994), who made it possible Fauconnier, Gilles. [1985] 1994. Mental Spaces. Cambridge: Cambridge
to understand the role of grammatical markers as prompts to University Press.
deploy vast networks of connected mental spaces. Further gen- . 1997. Mappings in hought and Language. Cambridge: Cambridge
eralizations were achieved in areas exempliied by the diverse University Press.
contributions to Spaces, Worlds, and Grammar (Fauconnier and Fauconnier, Gilles, and Eve Sweetser. 1996. Spaces, Worlds, and
Sweetser 1996). Sophisticated research continues to be done in Grammar. Chicago: University of Chicago Press.
all of the areas where mental space theory was irst applied, in Fauconnier, Gilles, and Mark Turner. 2002. he Way we hink. New
York: Basic Books.
particular on conditionals (see Dancygier 1998; Dancygier and
Huumo, Tuomas. 1996. A scoping hierarchy of locatives. Cognitive
Sweetser 1996, 2005), scoping phenomena on locative and tem-
Linguistics 7: 26599.
poral domains (see Huumo 1996), grammar of sign languages
Liddell, Scott K. 2003. Grammar, Gesture, and Meaning in American Sign
(see Liddell 2003), discourse (see Epstein 2001), and frame shift- Language. Cambridge: Cambridge University Press.
ing (see Coulson 2001). But at the same time, there has been an Van Hoek, Karen. 1997. Anaphora and Conceptual Structure. Chicago:
explosion of research triggered by the discovery of wide-ranging University of Chicago Press.
phenomena whereby mental spaces are assembled, connected,
and constructed within networks of conceptual integration (see
conceptual blending). his area of research links linguis- MERGE
tic and nonlinguistic phenomena in systematic ways that begin
Merge is the primitive combinatorial operation in the most recent
to explain how and why there can be imaginative emergent
version of transformational grammar known as minimal-
structure in human thought in its everyday manifestations,
ism. In its most austere variety, merge is a generalized transforma-
as well as in its most original and singular sparks of creativity.
tion that simply turns its input elements into a set with the input
Mental Spaces in Discourse: A Simple Example elements as members (set-merge). Unlike the earlier government
Suppose the current president of our country is Nick, and that and binding model of the principles and parameters the-
someone says: ory, Noam Chomskys (1995) minimalist model does not assume
a deep structure (see underlying structure) representation
hirty years ago, the president was a baby. as a starting point of the derivation; instead, syntactic computation
he base mental space, B, corresponds to the time at which the starts out from individual words. Merge combines words as well as
statement is made and contains an element a which ills the syntactic objects it has already formed in a recursive manner (see
role president in a political frame and has the name Nick. he recursion, iteration, and metarepresentation), gener-
space-builder thirty years ago sets up a new space M relative to ating an ininite array of discrete expressions with a hierarchical
the base (30 years before now); a in B has a counterpart a in constituent structure. In principle, merge can freely apply
the new space M; the president identiies a in B, and can therefore to elements available to it, but its application is constrained by
access its counterpart a in M. he property baby is assigned principles of computational eiciency and by output conditions
to a in M. he sentence is interpreted as saying that Nick was a imposed by external systems of sound/gesture and meaning.
baby 30 years ago. movement is construed as merge of a syntactic object with a
he expression the president, however, can equally well be syntactic object contained in it, whence the term internal merge
construed as directly identifying an element b in M: It ills the (vs. external merge; see Chomsky 2004). Consider, for instance,
role president in the political frame for M. he property baby the derivation of the passive sentence (1a), where the underly-
is now assigned to b. he sentence is now interpreted as saying ing object is moved to the subject position. Here the expression
that a baby was president 30 years ago. (1b), constructed by recursive applications of merge, undergoes
It is an empirical fact that the example sentence does merge with its subset {a, house}, yielding (1c). (2) is a tree dia-
indeed have the two interpretations, and this fact, like many gram representation of (1c).
others, follows from the accessing principles of mental space
(1) a. A house will be built
conigurations.
b. {will, {be, {built, {a, house}}}}
Gilles Fauconnier c. {{a, house}, {will, {be, {built, {a, house}}}}}
482
Merge Metaphor
(2) . 2004. Beyond explanatory adequacy. In Structures and

Beyond: he Cartography of Syntactic Structures, ed. Belletti, 10431.
a house will METALANGUAGE

be
Languages may be used to talk about languages. In the irst sen-
built
tence of this entry, English is used to mention (or talk about) lan-
a house guages. (See use and mention.) In general, a language under
discussion, on a given occasion, is called the object language,
he syntactic object {a, house} has two copies or occurrences, but and the language being used in the discussion (to talk about the
it is realized phonetically only as a member of the (largest) set in object language) is the metalanguage.
(1c). Note that the two occurrences resulting from movement are Alfred Tarski proved that no classical language (language with
not distinct syntactic objects. Rather, the same syntactic object classical logic) can express its own truth predicate. Accordingly,
is a member of two sets, where one set is properly contained in the semantics of a classical language (with standard syntax)
the other. can only be done in a richer metalanguage.
While there is currently no agreement regarding restrictions
J. C. Beall
on merge, or how to deduce them, the mainstream view holds
that merge is binary, always taking exactly two input elements
SUGGESTION FOR FURTHER READING
(entailing strict binary branching in syntactic trees), and it can-
not alter set-membership relations that it has established before. Tarski, Alfred. 1990. Logic, Semantics, Metamathematics: Papers from
It is unresolved whether the output of merge should be enriched 19231938. 2d ed. Ed. J. Corcoran. Indianapolis, IN: Hackett.
to encode any asymmetry between its operands. On Chomskys
(1995) original deinition, merge forms a set with the following
METAPHOR
two members: the set of the input elements and the word func-
tioning as the head of the constituent (also known as label), his term derives from the Greek metapherein, indicating a
thereby representing the asymmetry in the choice of the input transfer of meaning from one linguistic expression to a second,
element that projects (see x-bar theory). Following this for- semantically diferent expression. According to standard dic-
mulation, (1c) can be rewritten as (3). he structure in (3) is rep- tionary deinitions, a metaphor is a igure of speech in which a
resented by the labeled tree diagram in (4). word or phrase that ordinarily designates one thing is used to
designate another to which it is not literally applicable. hus,
(3) {will, {{a, {a, house}}, {will, {will, {be, {be, {built, {built, {a, {a,
in metaphor, there is an implicit comparison between unlike
house}}}}}}}}}}
things, as in the Shakespearean metaphor Juliet is the sun.
(4) will Employing the terminology introduced by I. A. Richards (1936),
the target concept (Juliet) is labeled the topic or tenor, the met-
aphoric source (sun) is the vehicle and the emergent meaning,
a will the ground (see also source and target). Despite this seem-
ingly simple deinition, the task of actually identifying metaphor
a house will be has proven to be a diicult enterprise because metaphors can be
implied (there is a vehicle but no speciied topic), dead (a usage
be built
so conventionalized that the metaphoric transference is no lon-
ger actively recognized), mixed (the conlation of two distinct
built a
metaphors), submerged (in which the vehicle is implied but not
stated), or extended (suggested throughout a text). his diiculty
a house
has been especially apparent in recent attempts with computer
applications, such those based on the identiication of metaphor
Another asymmetry is that between an adjunct (e.g., an adver-
in text corpora or in computer systems capable of generating
bial) and the host it is adjoined to (e.g., a verb phrase). Chomsky
representations of metaphorical meaning.
(2004) suggests that when an adjunct and its host undergo
merge, the result is an ordered pair Adjunct, Host (also known
Metaphor as a Linguistic Phenomenon
as pair-merge).
THE INFLUENCE OF ARISTOTLE. he classic approach, as presented
Balzs Surnyi in the dictionary deinition, treats metaphor as a rhetorical
trope in which language is used in a way other than what may be
WORKS CITED AND SUGGESTIONS FOR FURTHER READING considered normal or literal. One can see in this approach the
Chametzky, Robert A. 2000. Phrase Structure: From GB to Minimalism. inluence of Aristotle. As Umberto Eco put it, [O]f the thousands
Oxford: Blackwell. and thousands of pages written about the metaphor, few added
Chomsky, Noam. 1995. he Minimalist Program. Cambridge, MA: MIT anything of substance to the irst two or three fundamental con-
Press. cepts stated by Aristotle (1984, 88). Whether or not one agrees
483
Metaphor
with Eco, it is clear that interpretations of Aristotelean thoughts which the novel meaning of a metaphor is not based on identi-
have inluenced much of the thinking on the topic for the past fying a shared set of (possibly marginal) meanings of the words
several millennia, most notably a presumed distinction between being compared. Meaning, he argues, is generated by the inter-
literal and nonliteral language and in framing the basic theo- action between a principal subject (the more literal usage of the
retical issues that have guided much of the subsequent schol- word, similar to the topic) and the complex of associations con-
arship: understanding the cognitive process that permits the nected with a subsidiary subject (analogous to the vehicle). he
stretching of meaning from the literal to the metaphoric and in process is interactive inasmuch as reciprocal action between the
determining the pragmatic reasons why metaphor would be principal and subsidiary subject selects, emphasizes, suppresses
employed when literal counterparts could have been used. and organizes features of the principal subject by implying state-
In his Poetics and in Rhetoric, Aristotle situated metaphor ments about it that normally apply to the subsidiary subject
in the realm of language, a position that has been the basis for (Black 1962, 46). he outcome is the creation of novel meaning
subsequent theories but has been contested since 1980 by theo- formed by a parallel implicational complex in which the topic
rists working within cognitive linguistics (described later). can be viewed in a radically diferent light and in which novel
Aristotles basic premise is that with metaphor, one word (or emergent meanings can be created between words. Despite the
expression) is substituted for another. He described several cat- popularity of this general approach, it, too, has been subject to
egories of substitution, though the forms most studied are what various criticisms, notably regarding the ambiguity in deining
we would call today nominal and predicative metaphor, the the theoretical terms employed and in determining which of the
former in which one noun is substituted for another and the lat- terms is the principal and which is the subsidiary subject.
ter in which the substitution is of verbs. Aristotle provided some Subsequent psycholinguistic theories have attempted
explanation of the process involved in metaphor comprehen- to describe cognitive mechanisms that are consistent with
sion, namely, an innate tendency to see likeness in objects and the interactive approach. Salience imbalance theory is a vari-
events that are, on the face of it, dissimilar (or in which the sim- ant of traditional comparison models aimed at describing why
ilarity is not transparent). Moreover, he provided some reasons some statements are seen as literal and others as metaphoric by
why metaphor might be employed, primarily, to serve a stylistic assuming that the features shared by topic and vehicle difer in
and aesthetic function wherein the listener, forced to decode the relative level of salience: Literal statements are those in which
message, experiences a pleasurable reaction; and secondarily, the shared features are salient to both terms, whereas with meta-
to serve the creative cognitive function of providing a name to phor, they are salient to the vehicle but not the topic. Domain
things that do not have proper names of their own. here has interaction theory is an extension of a computational model of
been considerably elaboration of the seminal ideas of Aristotle in analogy and assumes that metaphor involves the inding of
the twentieth century, despite what some would consider a fatal similarity both within and between the conceptual domains
law in the logic of substitution as the basis for metaphor: If, for evoked by words. hus, a metaphor such as George Bush is a
example, with the nominal Shakespearean metaphor Juliet is hawk would be comprehended by inding a spot in semantic
the sun the vehicle, sun is a substitution for another word that space that would be consistent with the analogy George Bush
falls within the same genus= as the topic, Juliet, what could is to world leaders as hawks are to birds. In this model, ease
that word be? of comprehension is a function of the ease of inding a shared
similarity (i.e., in determining ways in which Bush is similar to
TWENTIETH-CENTURY ELABORATIONS. Two interpretations con- a hawk), whereas a sense of metaphor aptness increases as
sistent with Aristotle have been most inluential. According to the distance between conceptual domains, such as leaders and
the substitution position, the transfer from vehicle to topic is an birds, becomes greater. Finally, structure-mapping theory, also
ornamental means of presenting some intended literal meaning, emerging from computational work in analogy, is based on iden-
so that when one states George is a wolf, it is merely an aesthetic tifying a system of shared relations between the target and source
way of saying that George is ierce. he comparison approach is domains and not by merely identifying a feature shared by topic
less ornamental and closer to the second function described by and vehicle. Although there are psycholinguistic studies that
Aristotle in that, here, the listener must construct a way in which support each theory, each is based ultimately on inding similar-
properties of the vehicle are applicable to the topic. here have ity between the words presented as topic and vehicle and, conse-
been several variants of comparison theories proposed in the quently, heir to all of the criticisms of such models (reviewed, for
literature over the past 50 years, but all include the notion that instance, in Glucksberg 2001).
the comprehension process involves the identiication of a rele- Sam Glucksberg proposes a novel solution that rejects simi-
vant set of preexisting features shared by topic and vehicle. hese larity as the basis for metaphor by arguing that in metaphor, one
theories all have shortcomings, including the failure to encom- does not look for a similarity between topic and vehicle (i.e., by
pass the creation (and not mere identiication) of similarity, the treating the comparison as an implicit simile). He avers, rather,
problem in identifying the mechanisms that would select the that metaphor should be understood as a class inclusion state-
features assumed to be important for interpretation, and the fail- ment analogous in how we treat such statements as my dog is a
ure in such theories to explain the asymmetry in meaning that collie. He argues, and has presented convincing evidence, that
occurs when the topic and vehicle are reversed, as occurs when with metaphor, the vehicle has dual reference (both as the lit-
one contrasts my lawyer is a shark with my shark is a lawyer. eral object and as indicative of higher-order categories) and that,
Max Black attempted to address at least some of these short- in comprehension, one classiies the topic to the category sug-
comings by postulating an interactive theoretical perspective in gested by the vehicle. hat is, in a metaphor such as my lawyer is
484
Metaphor
a shark, the vehicle shark stands for or exempliies a category CONCEPTUAL METAPHORS AND THE CONTEMPORARY THEORY OF
to which lawyers could be assigned (such as aggressive, preda- METAPHOR. he research described in the previous sections has
tory, tenacious entities). In more recent expansion of the theory, undercut the diference between literal and nonliteral language
he and his colleagues have indicated how the topic plays a role (see also Gibbs 1994), a challenge extended most notably by
in identifying the appropriate category for which categorization cognitive linguists, especially by George Lakof, starting with the
is appropriate. publication in 1980 of Metaphors We Live By, co-authored with
Mark Johnson. he main thrust of this theory is that metaphors
THE PROCESS OF METAPHOR COMPREHENSION. Much of the the- are matters of thought and not merely of language, thus distin-
ory and research described here is based on oline methodology. guishing the basic mapping of a source conceptual domain to a
Beginning in the late 1970s, researchers started to examine the target conceptual domain (conceptual metaphor) from the
processing of metaphor online, measuring processes that were linguistic expression of this mapping. he true source of meta-
happening during the act of comprehension. Most of the early phor is at the conceptual level. According to this theory, con-
studies were based on the indexing of reading time or some other ceptual metaphors motivate and underlie understanding of the
measure of response latency; lately, studies have also employed world, such that most of what we call literal is, by this theory,
neurocognitive imaging techniques such as EEG and fMRI (see based on underlying metaphorical mappings. hus, conceptual
neuroimaging). Much of the initial theorizing has been based metaphors are the basis for understanding literal and nonlit-
on speech-act theory, especially as espoused in the work of eral, novel and conventional, poetic and mundane language
John Searle. Following from the distinction between literal and alike. Evidence for a conceptual metaphor, such as the mapping
nonliteral language, the assumption has concerned the process- between the conceptual domains of life and journeys (LIFE IS A
ing priority of literal meaning. According to what is now called the JOURNEY), is relected in a set of seemingly unrelated linguistic
standard pragmatic model, the model would be that the default expressions, such as His life is at an important crossroad and
processing of language would be to its literal meaning, and that She knows where she is going. Mappings elucidate the system-
those inferential processes that seek an alternative, nonliteral atic set of correspondences that exist between constituent ele-
interpretations are only triggered if one fails to ind a context- ments of the source and the target domain. For example, with
appropriate literal interpretation. hese assumptions have been the LIFE IS A JOURNEY mapping, the person is analogous to a
tested in psycholinguistic research that has concretized the stan- traveler, purposes are destinations, means are routes, diiculties
dard model: It assumes that priority to a default literal meaning are obstacles, achievements are landmarks, choices are cross-
would be demonstrated by more rapid reading (or other indices roads, and so on, allowing for novel extensions of elements from
of processing) of a metaphor in a discourse context that is consis- the source domain to elements of target concepts.
tent with its literal sense than in a context that is consistent with he theory has had widespread acceptance, and the task of
the nonliteral sense; it also assumes that one should not ind the identifying the presence and force of underlying (and hence
processing of metaphoric meaning in conditions in which the lit- unconscious) cognitive mappings has entered the debates of
eral sense of an utterance is context appropriate. linguistics, cognitive science, philosophy, literary theory, and
More than 20 years of research have failed, in the main, to sup- criticism, among other disciplines. Nonetheless, the claims in
port the predictions arising from the standard pragmatic model, the literature for an ever-increasing number of conceptual meta-
instead showing that in appropriately elaborated contexts, one phors indicate looseness in the theory that may make it incapable
can process the metaphoric sense as rapidly as the literal sense of of being disprovable and, thus, an inadequate scientiic expla-
an utterance and that, using Stroop-like procedures and speed- nation. Moreover, one testable prediction made by the theory,
accuracy analyses, the initiation of metaphoric interpretation namely, that conceptual metaphors are activated on line during
does not depend on a failure to ind an appropriate literal inter- comprehension, has not been supported consistently, with the
pretation. hese indings, though sometimes complicated by the strongest support coming from the examination of orientational
level of conventionality of the metaphoric expression, have led and temporal metaphors (e.g., Boroditsky 2000).
to a set of competing theories, all of which have some support,
including models based on the notion of resolving constraint CONCEPTUAL BLENDING. A more recent framework, proposed by
satisfaction and those that assume that the initial processing of Gilles Fauconnier and by Mark Turner, seeks to explain much
a word is at an underspeciied schematic level. An increasingly of the same linguistic data discussed in the conceptual meta-
popular processing model by Rachel Giora attempts to main- phor literature and shares with that approach the assumption
tain processing priority but places the emphasis not on literal that metaphor is a conceptual, not a linguistic, phenomenon. In
meaning (as Searle had it) but on the saliency of a word (as con- contrast with conceptual metaphor theory, however, concep-
cretized by familiarity, conventionality, and frequency of use). tual blending theory is not limited to entrenched conceptual
According to this theory, one is obligated to process the salient relations or to the unidirectional mapping from source to target
sense of a word (or expression), regardless of context; contextual or the mapping between only two mental domains. Rather, the
constraints can boost the activation and meaning access of less basic units are mental spaces representing particular scenarios
salient meanings but will not do so at a cost to the activation and recruited from the knowledge of speciic domains constructed
access of the more salient sense. he ultimate success and test of while thinking or talking about situations. As such, the theory
these various theories are being contested, more often these days emphasizes blending as an on-line process, which both instanti-
with neuroimaging techniques that give a more ine-grained ates entrenched metaphors and can yield short-lived and novel
analysis of online processing than available in the past. conceptualizations. his theory, too, has entered the literatures
485
Metaphor Metaphor, Acquisition of
of a number of diverse disciplines, and although the on-line pro- Richards, I. A. 1936. he Philosophy of Rhetoric. New York: Oxford
cessing implications of the theory are still ongoing, the tests to University Press.
date have been encouraging, often employing brain neuroimag-
ing techniques such as event-related potential (ERP) measure- METAPHOR, ACQUISITION OF
ment (see Coulson 2001). Nonetheless, this theory has also been
subjected to criticisms that it, too, is incapable of being disproved Metaphor is a pervasive aspect of human language (Lakof and
and that it is too indiscriminate, inasmuch as almost anything Johnson 1997). Metaphor also plays a central role in abstract
that enters working memory can be considered a blend. thought by structuring concepts (Gibbs 1994) and leading to
conceptual change (Gentner and Wolf 2000). As such, its rudi-
Evaluation mentary manifestation at the early ages and its continued growth
Treatments of metaphor as a linguistic and as a cognitive phe- over developmental time has been the focus of scientiic inquiry
nomenon coexist today, in much the same way that two species for several decades. Research on the acquisition of metaphor
of hominid have coexisted in our evolutionary history. It remains followed two main lines of inquiry, each bearing on a diferent
to be seen whether the ofspring of one approach will disap- deinition of the term. One approach deined metaphor as a simi-
pear. Despite the diferences, there is a convergence between larity comparison between the perceptual features of objects or
approaches that should not be undervalued: his convergence actions, and explored how early children would understand and
includes the undercutting of the distinction between literal and produce these so-called perceptual metaphors (e.g., butterly is
nonliteral language; a recognition by both approaches of the (like) rainbow). Another approach deined metaphor as a con-
need to consider the richness of examples coming from liter- ceptual-linguistic mapping between the structural features of two
ary or philosophical analysis, as well as the controlled rigor that disparate knowledge domains a source domain, which serves
comes from experimental studies; an emphasis on the role of as the source of vocabulary and conceptual inferences, and a
cognition and pragmatics (see Carston 2002, for instance, for target domain, to which vocabulary and inferences are extended
an exposition from a relevance theory perspective); and metaphorically, and it examined the age at which children begin
the growing sentiment (however conceptualized) that the con- to develop an integrated understanding of such structural meta-
struction of metaphoric meaning is lexible and involves more phors (e.g., time is motion along a path) as an amalgam of both
of an active on-line interpretive process and less of a mere source and target domain meanings.
arousal of entrenched meaning and that, ultimately, the battle- Following is a brief summary of the developmental changes
ground for theoretical supremacy (or for synthesis of the two in childrens metaphorical ability, from the early onset of simple
approaches) will depend on data generated and based in the perceptual metaphors to the later emergence of more complex
neurosciences. structural metaphors.
Albert N. Katz Metaphor as Similarity: Childrens Early Comprehension

and Production of Simple Perceptual Metaphors
WORKS CITED AND SUGGESTED FURTHER READINGS
Children can spontaneously produce a variety of perceptual
Black, M. 1962. Models and Metaphor. Ithaca, NY: Cornell University metaphors that highlight similarities between objects and events
Press. during preschool years (~ages 2.05.0; e.g., Billow 1981; Gardner
Boroditsky, L. 2000. Metaphoric structuring: Understanding time et al. 1978; Winner, McCarthy, and Gardner 1980; Winner 1979).
through spatial metaphors. Cognition 75: 128. For example, they hold up a half-peeled banana and call it a
Carston, R. 2002. houghts and Utterances: he Pragmatics of Explicit lower (Elbers 1988), place a foot in the wastebasket and call
Communication. Oxford: Blackwell.
it a boot (Winner 1979), point to a mushroom and say like ice
Coulson, S. 2001. Semantic Leaps. New York: Cambridge University
cream cone (zalkan and Goldin-Meadow 2006), or describe
Press.
Eco, U. 1984. Semiotics and the Philosophy of Language.
a ship sailing in the far distance as taking a bath (Chukovsky
Bloomington: Indiana University Press. 1968). hese early perceptual metaphors typically arise in emerg-
Fauconnier G. 1997. Mappings in hought and Language. ing symbolic play contexts, in which children irst engage in
Cambridge: Cambridge University Press. imaginative object substitutions (e.g., using a banana as if it were
Gibbs, R. 1994. he Poetics of Mind. New York: Cambridge University a phone), and later on, they express similarities between such
Press. objects explicitly in speech (banana is like a phone; Gardner
Giora, R. 2003. On Our Mind: Salience, Context and Figurative Language. et al. 1978; see also Sinclair and Stambak 1993 for more informa-
New York: Oxford University Press. tion on early symbolic play).
Giora, R., ed. 2001. Models of igurative language. Metaphor and
Children can use perceptual similarity to sort objects into
Symbol 16.3/4 (Special Issue): 145333.
categories as early as 18 months (e.g., boxes vs. balls; see Oakes
Glucksberg, S. 2001. Understanding Figurative Language. New
and Madole 2000 for a review). By preschool age, they can under-
Katz, A., C. Cacciari, R. Gibbs, and M. Turner. 1998. Figurative Language stand and make comparisons between two categorically diferent
and hought. New York: Oxford University Press. objects based on feature-based similarities (Billow 1975; Epstein
Lakof, G., and M. Johnson. 1980. Metaphors We Live By. Chicago: and Gamlin 1994; Gardner et al. 1975; Mendelsohn et al. 1984;
University of Chicago Press. Vosniadou and Ortony 1983; Winner, McCarthy, and Gardner
Ortony, A., ed. 1993. Metaphor and hought. 2d ed. New York: Cambridge 1980) and between two events based on action-based similarities
University Press. (Dent 1984). For example, when asked to pick two objects that
486
Metaphor, Acquisition of
go together, children were more likely to group a cherry lollipop commonalities: Children, at all ages, have no diiculty under-
with a toy stop sign, which was similar in shape and color, rather standing feature-based similarities between objects, but it is with
than matching it with a dissimilar object in the same category increasing age that they begin to understand cross-domain map-
(Mendelsohn et al. 1984). Similarly, when presented with event pings based on relational structure and, accordingly, produce
triads, children were more likely to pair two events that were explanations that relect this understanding.
alike (ballerina spinningtop spinning) than to match two events Others propose a developmental progression from an under-
that were of diferent types (ballerina leapingtop spinning; Dent standing of metaphor as involving only one domain to a con-
1984). ceptualization of metaphor as involving two domains (Asch
Moreover, ive-year-old children could provide similarity- and Nerlove 1960; Cicone, Gardner, and Winner 1981; Schecter
based explanations when asked about metaphorical expressions and Broughton 1991; Winner, Rosenstiel, and Gardner 1976).
that involve comparisons between objects such as a butterly is a hus, children initially focus only on the source domain of the
lying rainbow, or a cloud is like a sponge (Gardner et al. 1975; metaphorical mapping and gradually develop a more inte-
Gentner 1988; Billow 1975; Malgady 1977). For example, they grated understanding of metaphor as involving both a source
would explain the statement a cloud is like a sponge by say- and a target domain. For example, in explaining the metaphor-
ing that both clouds and sponges are round and lufy (Gentner ical statement the prison guard is a hard rock, children six
1988), or they would complete the statement he looks as gigan- to eight years of age focused exclusively on the source domain
tic as by selecting from among multiple choice alternatives meaning of the mapping and provided literal interpretations for
an ending that draws on a feature-based comparison: he looks metaphorical statements (e.g., he guard has hard muscles),
as gigantic as a double-decker cone in a babys hand (Gardner whereas older children and adults were able to consider both the
et al. 1975). source and target domain meanings of the mapping, thus pro-
hus, preschool children can both understand similarity viding explanations that captured the metaphorical meaning
comparisons between two objects or events that are perceptu- (e.g., he guard was mean and did not care about the feelings
ally alike and spontaneously produce perceptual metaphors and of prisoners; Winner, Rosenstiel, and Gardner 1976). Similarly,
explanations based on such comparisons in their early com- when asked to extend physical sensation terms onto psycholog-
munications. his ability constitutes an important milestone in ical traits (e.g., Can a person be warm/ sweet/ soft?), children
childrens language development. he ability to express simi- three to seven years of age focused only on the source domain
larities between objects and events based on shared perceptual of the metaphorical mapping and provided literal explanations
features is considered the earliest sign of metaphorical ability in for metaphorical statements (e.g., Mommy is sweet because she
young children, and accordingly, children are believed to have a cooks sweet things), whereas older children focused on both
rudimentary level of metaphorical ability as early as preschool domains simultaneously and provided explanations that treated
age (Billow 1981; Gardner et al. 1978; Vosniadou 1987; Winner metaphorical meaning as a diferent but related extension of
1979). the literal meaning (e.g., Hard things and hard people are both
unmanageable; Asch and Nerlove 1960).
Metaphor as Conceptual-Linguistic Mapping: Childrens Yet another group of researchers argue that the ability to
Comprehension and Production of Complex Structural understand structural metaphors is not determined solely by a
Metaphors childs age but by a host of other factors, such as the nature of
Childrens early ability to produce feature-based similarity com- the source or the target domain (Keil 1986), and the familiarity
parisons is considered the irst step in the development of more of the metaphorical mapping or the source domain (zalkan
complex metaphorical abilities, particularly those that involve 2007). For example, ive-year-old children can correctly map
structural comparisons between disparate domains (Gardner animate terms onto cars (e.g., the car is thirsty), but have dii-
et al. 1978; Gentner 1988; Winner 1979). Not surprisingly, chil- culty understanding metaphors that involve mappings between
drens mastery of such structural metaphors takes several more taste terms and people (e.g., she is a bitter person; Keil 1986).
years, extending well into early adolescent years (Asch and Similarly, preschool children can both understand and explain
Nerlove 1960; Vosniadou 1987; Winner, Rosenstiel, and Gardner metaphors that are structured by motion (e.g., Time lies by,
1976), and diferent researchers propose diferent views con- Ideas cross my mind; zalkan 2005, 2007) a domain that
cerning how children make this transition. structures a wide range of abstract concepts across diferent lan-
Some researchers propose a developmental progression guages of the world but have diiculty deciphering the mean-
from mappings based on feature-based similarities to map- ing of metaphors that involve extensions of object properties
pings based on relational structure in childrens metaphorical (e.g., he prison guard is a hard rock; Winner, Rosenstiel, and
abilities (Billow 1975; Gentner 1988, Gentner and Rattermann Gardner 1976). From this perspective, the development of meta-
1991, Vosniadou and Ortony 1983). For example, in explaining phorical ability shows diferent trajectories for diferent con-
the metaphorical statement a cloud is like a sponge, ive-year- ceptual domains and metaphorical mappings, based on ones
old children typically rely on feature-based similarities between knowledge of the source and/or the target domain and the famil-
the two objects (e.g., Both clouds and sponges are round and iarity of the metaphorical mapping.
lufy), while older children and adults opt for more relational In summary, research on childrens metaphor comprehen-
explanations (e.g., Both clouds and sponges contain water; sion and production shows that children can both understand
Gentner 1988). In this view, what drives development is the and spontaneously produce perceptual metaphors that involve
shift in focus from feature-based commonalities to relational similarity comparisons by preschool age. However, the ability
487
Metaphor, Acquisition of Metaphor, Information Transfer in
to understand and explain more complex metaphors, namely, zalkan, S. 2005. On learning to draw the distinction between physi-
those that involve structural mappings between diferent knowl- cal and metaphorical motion: Is metaphor an early emerging cognitive
edge domains, emerges in late childhood, somewhere between and linguistic capacity? Journal of Child Language 32.2: 128.
ages 11.0 to 14.0. Nevertheless, at the same time, childrens early . 2007. Metaphors we move by: Childrens developing under-
standing of metaphorical motion in typologically distinct languages.
metaphorical ability is strongly inluenced by the familiarity of
Metaphor and Symbol 22.2:14768.
the source and target domains of the metaphor, with more famil-
zalkan, S., and S. Goldin-Meadow. 2006. X is like Y: he emer-
iar domains and metaphorical relations leading to earlier onset
gence of similarity mappings in childrens early speech and gesture.
of metaphor comprehension and production. In Cognitive Linguistics: Foundations and Fields of Application, ed. G.
eyda zalkan Kristianssen, M. Achard, R. Dirven, and F. Ruiz de Mendoza, 22962.
Berlin: Mouton de Gruyter.
Schecter, B., and J. Broughton. 1991. Developmental relationships
WORKS CITED AND SUGGESTIONS FOR FURTHER READING between psychological metaphors and concepts of life and conscious-
Asch, S., and H. Nerlove. 1960. he development of double function ness. Metaphor and Symbolic Activity 6.2: 11943.
terms in children. In Perspectives in Psychological heory, ed. B. Kaplan Sinclair, M., and M. Stambak. 1993. Pretend play among three-year-olds.
and S. Wapner, 4760). New York: International Universities Press. Mahwah, NJ: Lawrence Erlbaum.
Billow, R. M. 1975. A cognitive developmental study of metaphor com- Vosniadou, S. 1987. Children and metaphors. Child Development
prehension. Developmental Psychology 11.4: 41523. 58: 87085.
. 1981. Observing spontaneous metaphor in children. Journal of Vosniadou, S., and A. Ortony. 1983. he emergence of the literal-meta-
Experimental Child Psychology 31: 43045. phorical anomolous distinction in young children. Child Development
Chukovsky, K. 1968. From Two to Five. Berkeley: University of California 54: 15461.
Press. Winner, E. 1979. New names for old things: he emergence of meta-
Cicone, M., H. Gardner, and E. Winner. 1981. Understanding the psy- phoric language. Journal of Child Language 6: 46991.
chology in psychological metaphors. Journal of Child Language . 1997. he Point of Words: Childrens Understanding of Metaphor
8: 21316. and Irony. Cambridge: Harvard University Press.
Dent, C. H. 1984. he developmental importance of motion infor- Winner, E., M. McCarthy, and H. Gardner. 1980. he ontogenesis of met-
mation in perceiving and describing metaphoric similarity. Child aphor. In Cognition and Figurative Language, ed. R. P. Honeck and
Development 55: 160713. R. Hofman, 34161. Hillsdale, NJ: Lawrence Erlbaum.
Elbers, L. 1988. New names from old words: Related aspects of chil- Winner, E., A. K. Rosenstiel, and H. Gardner. 1976. he development of
drens metaphors and word compounds. Journal of Child Language metaphoric understanding. Developmental Psychology 12: 28997.
15: 591617.
Epstein, R. L., and P. J. Gamlin. 1994. Young childrens comprehension
of simple and complex metaphors presented in pictures and words. METAPHOR, INFORMATION TRANSFER IN
Metaphor and Symbolic Activity 9.3: 17991.
he study of metaphor is currently dominated by concep-
Gardner, H., M. Kircher, E. Winner, and D. Perkins. 1975. Childrens
tual metaphor theory. One alternative was put forth by Amos
metaphoric productions and preferences. Journal of Child Language
2: 12541. Tversky, then further developed by Andrew Ortony and others.
Gardner, H., E. Winner, R. Bechhofer, and D. Wolf. 1978. he develop- his account begins with the idea that we understand metaphors
ment of igurative language. In Childrens Language. Vol. 1. Ed. K. by scanning entries in our mental lexicon, transferring relevant
Nelson, 138. New York: Gardner Press. features from a source to a target (see source and target). In
Gentner, D. 1988. Metaphor as structure mapping: he relational shift. some versions, the process is viewed as involving a wider range
Child Development 59: 4759. of information and components of cognitive architecture
Gentner, D., and M. J. Rattermann. 1991. Language and the career of beyond semantic features.
similarity. In Perspectives on Language and hought: Interrelations Consider the following situation. Smith monopolizes dis-
in Development, ed. S. A. Gelman and J. P. Byrnes, 22577. New
cussion in a department meeting. Afterward, Doe asks Jones
what she thought of the debate. She replies, Smith is a braying
Gentner, D., and P. Wolf. 2000. Metaphor and knowledge change. In
donkey. Using standard cognitive architecture, we might ana-
Cognitive Dynamics: Conceptual Change in Humans and Machines, ed.
E. Dietrick and A. Markman, 295342. Mahwah, NJ: Lawrence Erlbaum. lyze this as follows: Jones and Doe both have lexical entries for
Gibbs, R. 1994. he Poetics of Mind. Cambridge: Cambridge University donkey, bray, and Smith. hey also have episodic memories of
Press. the recent department meeting. he recent events are primed
Keil, F. C. 1986. Conceptual domains and the acquisition of metaphor. or partially activated (see priming, semantic; spreading
Cognitive Development 1: 7396. activation). he mention of Smith serves to further activate
Lakof, G., and M. Johnson. 1997. Philosophy in the Flesh. New York: Basic the episodic memories of Smith in the meeting. he lexical entry
Books. for bray involves such elements as produce a sound using vocal
Malgady, R. G. 1977. Childrens interpretation and appreciation of simi- chords. his serves to further activate the episodic memories
les. Child Development 48: 17348.
of vocal chord sounds in the meeting. Speciically, in conjunc-
Mendelsohn, E., S. Robinson, H. Gardner, and E. Winner. 1984. Are pre-
tion with Smith, it serves to strongly activate episodic memories
schoolers renamings intentional category violations? Developmental
of Smith speaking. Following principles of conversational
Psychology 20.2: 18792.
Oakes, L. M., and K. L. Madole. 2000. he future of infant categoriza- implicature, Doe assumes that Jones is making some positive
tion research: A process-oriented approach. Child Development contribution to the conversation. hus, Doe looks for new infor-
71.1: 11926. mation in Joness statement. here is no new information in what
488
Metaphor, Information Transfer in Metaphor, Neural Substrates of
we have isolated thus far that Smith used his vocal chords to Indeed, they see many conceptual metaphors as lexicalized. For
make a sound. he new information comes with distinctive fea- example, pass away just has die as one of its literal (lexical-
tures of the metaphorical source. Speciically, braying does not ized) meanings. It does not operate metaphorically.
apply to every use of vocal chords. It applies only to a particu- One obvious advantage of this account is that it explains the
lar sort of nonlinguistic thus, meaningless and inarticulate prominence of mixed metaphors. Consider a sentence such as I
sound. Doe synthesizes this information in working memory. tapped into the good life on the road to acing my degree. Some
He understands, roughly, that (in Joness view) Smiths speech elements are lexicalized here. Others are interpreted metaphori-
was meaningless and inarticulate. cally, but only to the extent required by context. In contrast,
hus far, however, the analysis does not distinguish the conceptual metaphor theory might lead us to expect greater
understanding of metaphor from that of literal statements. In consistency in the use of standard metaphorical mappings. he
both cases, there is a complex synthesis of lexical and episodic present account does have more diiculty explaining consis-
information in working memory; this leads to contextually rel- tency when it does occur, as when someone says I followed the
evant inference. What, then, is the diference between a meta- straight and narrow path to the reach my destination a degree.
phorical statement and a literal one? However, it may be possible to account for such consistency by
One account begins by making metaphor a matter of inter- ordinary processes of priming, both current and historical (see
pretation, rather than a matter of some intrinsic linguistic prop- Hogan 2002).
erty. Speciically, a speaker intends an utterance metaphorically One future task is to develop this account in terms of neural
when he or she intends the addressee to interpret the utter- substrates. Consistent with the preceding analysis, neurosci-
ance metaphorically. What, then, constitutes metaphorical entiic research indicates that there is no sharp metaphorical/
interpretation? literal division. Certain interpretive tasks demand greater activa-
Our mental lexicons are organized into clusters of infor- tion of a broad range of meanings before selection. hese tasks
mation bearing on particular objects and types of objects (see often involve metaphorical interpretation, but not invariably
schema, prototype ). his information is arranged hierar- (see metaphor, neural substrates of). Currently, we are
chically. here are certain things that we take to be more cru- not in a position to examine semantic processing in a suiciently
cial or deinitive features of a given type of object. For example, ine-grained way to consider the processes posited here. We may
being made from milk is a more important property of cheese distinguish diferent categories of information (e.g., perception-
than being white or yellow. Moreover, a range of high-level related versus motor-related; see semantics, neurobiology
properties are default properties. If a default does not apply, of), but not precise features, defaults, and so on. Possibilities for
then we commonly have speciiable alternatives. hus, we future research may be suggested by modeling these processes
assume (as a default) that an unknown person say, Jones in connectionist networks particularly the key diference
has two arms. But if we learn that she does not, we assume that between assuming that defaults apply and assuming that they
she sufered some birth defect or is an amputee, these being the do not.
standard alternatives.
Patrick Colm Hogan
When interpreting a statement literally, we assume that all
default information applies unless it is speciically contradicted.
Moreover, if a default is contradicted, we assume that one of the
standard alternatives applies. In contrast, when interpreting a Hogan, Patrick Colm. 2002. A minimal, lexicalist/constituent transfer
statement metaphorically, we do not assume that default infor- account of metaphor. Style 36.3: 484502.
mation applies. hat is the deinitive diference. In interpreting a Ortony, Andrew. 1988. Are emotion metaphors conceptual or lexical?
Cognition and Emotion 2: 95104.
statement either metaphorically or literally, we scan lexical infor-
. 1993. he role of similarity in similes and metaphors. In
mation to glean what is most relevant to the topic at hand. But
Metaphor and hought (2d ed.), ed. Andrew Ortony, 34256. New York:
when interpreting literally, we assume that unselected, default Cambridge University Press.
information applies as well. We do not assume this when inter- Tversky, Amos. 1977. Features of similarity. Psychological Review
preting metaphorically. 84: 32752.
he basic diference has several consequences. One is worth
mentioning. All interpretation involves drawing on a range of
METAPHOR, NEURAL SUBSTRATES OF
associated information, not only that included in the lexical
entries for the source and target items. In metaphorical interpre- he interest in how the brain processes metaphors traces its
tation, the loosening of hierarchical structures (e.g., through the origins back to a tradition that regarded igurative language as
nonassumption of defaults) may encourage the incorporation poetic and, hence, the opposite of literal language. Despite its
of more distant associations, including primed emotional asso- ubiquity (Lakof and Johnson 1980), the underlying assumption
ciations. For example, when Jones refers to Smith as a braying has been that this diference should be relected both in behav-
donkey, she not only characterizes Smith but also expresses and ioral (Grice 1975; Searle 1979) and brain mechanisms. In this
tries to communicate a certain feeling. entry, we examine this and other long-standing assumptions,
his account is similar to conceptual metaphor theory in suggesting that the interactions of linguistics with empirical,
stressing cognition. However, it suggests that the cognitive neuropsychological, and neuroscientiic research have drawn
efects of metaphors need not be profound. Writers adopting this a far more complex and, arguably, fascinating picture, not only
account commonly view metaphor as operating more locally. about metaphor but also about the brain.
489
Metaphor, Neural Substrates of
Is Metaphor Really So Different? metaphors distinguishes patients with schizophrenia from

Since the 1970s, the assumption that metaphors are processed healthy controls (Kircher et al. 2007).
diferently from literals has come under close scrutiny. For Taken together, these indings suggest that lateralization
example, on the basis of psycholinguistic experiments, it has in the brains hemispheres is contingent upon such factors as
been argued that in the presence of rich and supportive con- novelty, semantic and conceptual mapping complexity, and
text, metaphors and literals are processed along the same routes evoked range of associations, all of which seem to act indepen-
(Gibbs 1994; Ortony et al. 1978). dently of igurativeness, thus challenging as too simplistic the
Although some metaphoric and literal expressions require notion of a preferential RH processing of stimuli solely by vir-
similar processes (Glucksberg 2001), it has also become increas- tue of their metaphoricity. hese factors, however, are in accor-
ingly evident that the categories used are in themselves heteroge- dance with an alternative account the inecoarse semantic
neous. For instance, some literals (the ring was made of tin, with coding hypothesis (Beeman 1998; Jung-Beeman 2005) which
a pebble instead of a gem) require more complex (metaphor-like) views the LH as adept at processing inely tuned semantic rela-
conceptual mapping processes than others (hat stone we saw tions and the RH as specialized in processing distant semantic
in the natural history museum is a gem; Coulson and Van Petten relationships.
2002). Others (curl up and dye) are more appealing though harder
to process than metaphoric equivalents (curl up and die; Giora NOVELTY. Recent studies indicate that the degree of novelty of
2003). Metaphors are not all alike either; some are novel, hav- an expression is an important determinant of neural process-
ing nonsalient metaphoric interpretations that are usually more ing. For instance, lesion studies (Giora et al. 2000; Kaplan et
appealing yet harder to process than those that are conventional al. 1990) and studies of individuals with Alzheimers disease
and salient (Giora et al. 2004). Furthermore, some metaphoric (Amanzio et al. 2008), as well as functional magnetic resonance
stimuli, though relatively conventional, may still be more open- imaging (fMRI) studies involving healthy participants (Eviatar
ended than others and, when functioning as a context, give rise and Just 2006), demonstrated that processing non-salient
to a wider range of associations (Stringaris et al. 2006). (ironic, metaphoric) interpretations relied more heavily
In fact, recent indings indicate that notions such as degree on the RH; processing conventional (metaphoric) meanings
of salience, complexity, or open-endedness may be more suit- involved the LH. Similarly, a series of fMRI, divided visual ield
able for describing the complexity of some of the phenomena (DVF), and event-related potential (ERP) studies demonstrated
in question and span the metaphor-literal divide. Furthermore, increased activation of RH areas during processing of nonsa-
while these notions may, to an extent, overlap, none of them is lient interpretations of novel metaphors (Arzouan, Goldstein,
speciic to metaphor. and Faust 2007; Faust and Mashal 2007; Mashal and Faust 2008;
Mashal, Faust, and Hendler 2005; Mashal et al. 2007) and lit-
Is Metaphor Processed Differently in the Brain? eral/compositional interpretations of idioms (Mashal et al.
Consistent with the prevailing view of the right hemisphere 2008). And while RH advantage was demonstrated in process-
(RH) as being more adept at creativity than the left hemi- ing nonsalient interpretations of novel metaphors during irst
sphere (LH), early lesion studies have been interpreted as exposure, repeated exposure beneited the LH (Mashal and
evidence that metaphors rely more heavily than their literal Faust 2009).
counterparts on regions in the RH (Winner and Gardner 1977).
However, Ellen Winner and Howard Gardners study actually COMPLEXITY. hat RH recruitment increases with complex sen-
reveals that patients with RH lesions were not insensitive to met- tences has been demonstrated by a number of studies (Jung-
aphor (1977, 725) when ofering verbal explications to igurative Beeman 2005). his has also been seen as typifying conceptual
stimuli, although they tended to erroneously select literal over mapping complexity (Coulson and Van Petten 2002), thus intro-
metaphoric interpretations in a picture-matching task. Similarly, ducing another parameter that may determine processing and
the results of the earliest imaging study in the ield (Bottini et al. operate regardless of metaphoricity. Further work is awaited to
1994) were also seen as supporting a RH predominance for meta- establish this view.
phor comprehension. However, alternative explanations may be
more appropriate, given that the linguistic items used also dif- RANGE OF SEMANTIC ASSOCIATIONS. Range of semantic asso-
fered on categories other than sensu strictu metaphoricity. ciations, also termed degree of open-endedness, can be seen
Indeed, subsequent studies have challenged the purported as determined by the extent to which a stimulus evokes a wide
predominance of the RH by demonstrating that when conven- network of semantic associations (Black 1993). In a fMRI study,
tional metaphors compared to literals are processed, the LH is Stringaris et al. (2006) showed that deciding that a given probe
more active (Ahrens et al. 2007; Lee and Dapretto 2006; Oliveri, was unrelated to a previous neutral context triggered activation
Romero, and Papageno 2004), perhaps relecting retrieval from of frontal RH areas following open-ended (metaphoric) contexts
semantic stores. In fact, most recent research suggests that in (Some answers are straight) but not following more restricted
the absence of a rich biasing context, the hemispheres are insen- (literal) contexts (Some answers are emotional). In the case of
sitive to igurativeness. Rather, the RH is more sensitive than the the open-ended primes (see priming, semantic), both nega-
LH to novel, nonsalient interpretations and poetic associations, tive and positive decisions elicited the same neural responses.
to complexity, and to open-endedness (Blasko and Kazmerski Indeed, higher degree of open-endedness may lead to increased
2006; Giora 2007). his is corroborated by a recent fmri study RH activation, probably because of the evocation of remotely
showing that failure to recruit RH areas when processing novel related associations (Jung-Beeman 2005). As shown by Mashal
490
Metaphor, Neural Substrates of
et al. (in press), RH areas were uniquely involved when novel lit- Coulson, Seana, and Cyma Van Petten. 2002. Conceptual integration
eral interpretations of familiar idioms (involving their familiar and metaphor comprehension: An ERP study. Memory & Cognition
idiomatic meanings as well) were deliberated on. 30: 95868.
. 2007. A special role for the right hemisphere in metaphor com-
prehension? ERP evidence from hemiield presentation. Brain
CONTEXTUAL INFORMATION. Contextual factors involved in
Research 1146: 12845.
processing (such as biasing information, task, mood, or expe-
Eviatar, Zohar, and Marcel Just. 2006. Brain correlates of discourse
rience) further argue against a speciic and invariant brain processing: An fMRI investigation of irony and metaphor comprehen-
locus for metaphor (Kutas 2006). hey show that recruitment sion. Neuropsychologia 44: 234859.
of neural networks depends upon factors other than metapho- Faust, Miriam, and Nira Mashal. 2007. he role of the right cerebral
ricity per se. For instance, in Coulson and Van Petten (2007), hemisphere in processing novel metaphoric expressions taken from
RH advantage in processing novel metaphors disappears in poetry: A divided visual ield study. Neuropsychologia 45: 86070.
the presence of biasing information. In Kacinik and Chiarello Gibbs, W. Raymond, Jr. 1994. he Poetics of Mind. Cambridge: Cambridge
(2007), both hemispheres were activated by metaphors, but University Press.
only the LH-response was context sensitive, thereby restricting Giora, Rachel. 2003. On Our Mind: Salience, Context and Figurative
the range of possible alternatives. Conversely, the response in Language. New York: Oxford University Press.
Giora, Rachel, ed. 2007. Is Metaphor Unique? Neural Correlates of
the RH indicated retention of alternatives available for process-
Nonliteral Language. Brain and Language 100: 2.
ing. Findings in Rapp et al. (2007) indicate that the type of task
Giora, Rachel, Ofer Fein, Ann Kronrod, Idit Elnatan, Noa Shuval, and
is an additional determinant of processing. When participants Adi Zur. 2004. Weapons of mass distraction: Optimal innovation and
had to judge the emotional valence of connotations, meta- pleasure ratings. Metaphor and Symbol 19: 11541.
phors elicited LH regions, despite their novelty. In Stringaris Giora, Rachel, Eran Zaidel, Nachum Soroker, Gila Batori, and Asa
et al. (2006), familiar metaphors activated RH areas when a Kasher. 2000. Diferential efects of right- and left-hemisphere dam-
coherence judgment was required; however, when a meaning- age on understanding sarcasm and metaphor. Metaphor and Symbol
fulness judgment was required, same stimuli evoked LH areas 15: 6383.
(Stringaris et al. 2007). In Blasko and Kazmerski (2006), it was Glucksberg, Sam. 2001. Understanding Figurative Language: From
individual diferences in experience that mattered: Poets and Metaphors to Idioms. New York: Oxford University Press.
Grice, H. Paul. 1975. Logic and conversation. In Speech Acts: Syntax
nonpoets difered as to which brain areas were recruited when
and Semantics. Vol. 3. Ed. Peter Cole and Jerry Morgan, 4158. New
reading poetry.
In sum, recent research, involving a wide range of method-
Jung-Beeman, Mark. 2005. Bilateral brain processes for comprehending
ologies, does not provide support for the long-assumed special natural language. Trends in Cognitive Sciences 9: 51218.
status of metaphor in language. Instead, it shows that the pro- Kacinik, A. Natalie, and Christine Chiarello. 2007. Understanding meta-
cessing of metaphors in the brain depends on a great number of phors: Is the right hemisphere uniquely involved? Brain and Language
factors beyond igurativeness. 100: 188207.
Kaplan, Joan A., Hiram H. Brownell, Janet R. Jacobs, and Howard
Rachel Giora and Argyris K. Stringaris
Gardner. 1990. he efects of right hemisphere damage on the prag-
matic interpretation of conversational remarks. Brain and Language
38: 31533.
Ahrens, Kathleen, Ho-Ling Liu, Chia-Ying Lee, Shu-Ping Gong, Shin-Yi Kircher, T. J. Tilo, Dirk T. Leube, Michael Erb, Wolfgang Grodd, and
Fang, and Yuan-Yu Hsu. 2007. Functional MRI of conventional and Alexander M. Rapp. 2007. Neural correlates of metaphor processing
anomalous metaphors in Mandarin Chinese. Brain and Language in schizophrenia. NeuroImage 34: 2819.
100: 16371. Kutas, Marta. 2006. One lesson learned: Frame language processing
Amanzio, Martina, Giuliano Geminiani, Daniela Leotta, and Stefano literal and igurative as a human brain function. Metaphor and
Cappa. 2008. Metaphor comprehension in Alzheimers dis- Symbol 21: 285325.
ease: Novelty matters. Brain and Language 107.1: 110. Lakof, George, and Mark Johnson. 1980. Metaphors We Live By.
Arzouan, Yossi, Abraham Goldstein, and Miriam Faust. 2007. Brain Chicago: University of Chicago Press.
waves are stethoscopes: ERP correlates of novel metaphor compre- Lee, S. Susan, and Mirella Dapretto. 2006. Metaphorical vs. literal word
hension. Brain Research 1160: 6981. meanings: fMRI evidence against a selective role of the right hemi-
Beeman, Mark. 1998. Coarse semantic coding and discourse compre- sphere. NeuroImage 29: 53644.
hension. In Right Hemisphere Language Comprehension: Perspectives Mashal, Nira, and Miriam Faust. 2008. Right hemisphere sensitivity to
from Cognitive Neuroscience, ed. Mark Beeman and Christine Chiarello, novel metaphoric relations: Application of the signal detection the-
25584. Mahwah, NJ: Erlbaum. ory. Brain and Language 104.2: 10312.
Black, Max. 1993. More about metaphor. In Metaphor and hought (2d . 2009. Conventionalization of novel metaphors: A shift in hemi-
ed.), ed. Andrew Ortony, 1941. Cambridge: Cambridge University spheric asymmetry. In Laterality 14.6:57389.
Press. Mashal, Nira, Miriam Faust, and Talma Hendler. 2005. he role of
Blasko, G. Dawn, and Victoria A. Kazmerski. 2006. ERP correlates of the right hemisphere in processing nonsalient metaphorical mean-
individual diferences in the comprehension of nonliteral language. ings: Application of principal components analysis to fMRI data.
Metaphor and Symbol 21.4: 26784. Neuropsychologia 43.14: 2084100.
Bottini, Gabriella., Corcoran Rhiannon, Roberto Sterzi, Eraldo Paulesu, Mashal, Nira, Miriam Faust, Talma Hendler, and Mark Jung-Beeman.
P. Schenone, P. Scarpa, et al. 1994. he role of the right hemisphere 2007. An fMRI investigation of the neural correlates underlying the
in the interpretation of igurative aspects of language: A positron emis- processing of novel metaphoric expressions. Brain and Language
sion tomography activation study. Brain 117: 124153. 100: 11526.
491
Metaphor, Universals of
. 2008. Hemispheric diferences in processing the literal interpre- of evidence from a number of linguists who are native speak-
tation of idioms: Converging evidence from behavioral and fMRI stud- ers of the respective languages, Zoltn Kvecses (2000) points
ies. Cortex 44.7: 84860. out that English, Japanese, Chinese, Hungarian, Wolof, Zulu,
Oliveri, Massimiliano., Leonor Romero, and Costanza Papagno. 2004. Polish, and others possess the metaphor AN ANGRY PERSON
Left but not right temporal involvement in opaque idiom comprehen-
IS A PRESSURIZED CONTAINER, to various degrees. Ning Yus
sion: A repetitive transcranial magnetic stimulation study. Journal of
(1995, 1998) work indicates that that the metaphor HAPPINESS
Cognitive Neuroscience 16: 84855.
Ortony, Andrew, Diane L. Schallert, Ralph E. Reynolds, and Stephen J.
IS UP is also present not only in English but also in Chinese. he
Antos. 1978. Interpreting metaphors and idioms: Some efects of system of metaphors called the event structure metaphor (Lakof
context on comprehension. Journal of Verbal Learning and Verbal 1993) includes such submetaphors as CAUSES ARE FORCES,
Behavior 17: 46577. STATES ARE CONTAINERS, PURPOSES ARE DESTINATIONS,
Rapp, M. Alexander, Dirk T. Leube, Michael Erb, Wolfgang Grodd, and ACTION IS MOTION, DIFFICULTIES ARE IMPEDIMENTS
Tilo T. J. Kircher. 2007. Laterality in metaphor processing: Lack of evi- (TO MOTION), and so forth. Remarkably, this set of submeta-
dence from functional magnetic resonance imaging for the right hemi- phors occurs in such widely diferent languages and cultures as
sphere theory. Brain and Language 100: 1429. Chinese (Yu 1998) and Hungarian (Kvecses 2005), in addition
Searle, John. 1979. Expression and Meaning. Cambridge: Cambridge to English. Eve Sweetser (1990) noticed that the KNOWING IS
University Press.
SEEING and the more general the MIND IS THE BODY meta-
Schmidt, L. Gwen, Casey J. DeBuse, and Carol A. Seger. 2007. Right
phors can be found in many European languages and are prob-
hemisphere metaphor processing? Characterizing the lateralization
semantic processes. Brain and Language 100: 12741.
ably good candidates for (near-)universal metaphors. As a inal
Stringaris, K. Argyris, Nicholas C. Medford, Vincent C. Giampietro, example, George Lakof and Mark Johnson (1999) describe the
Michael J. Brammer, and Anthony S. David. 2007. Deriving mean- metaphors used for ones inner life in English. It turns out that
ing: Distinct neural mechanisms for metaphoric, literal, and non- metaphors such as SELF-CONTROL IS OBJECT POSSESSION,
meaningful sentences. Brain and Language 100: 15062. SUBJECT AND SELF ARE ADVERSARIES, and THE SELF IS A
Stringaris, K. Argyris, Nicholas C. Medford, Rachel Giora, Vincent C. CHILD are shared by English, Japanese, and Hungarian. Given
Giampietro, Michael J. Brammer, and Anthony S. David. 2006. How that ones inner life is a highly elusive phenomenon and, hence,
metaphors inluence semantic relatedness judgments: he role of the would seem to be heavily culture and language dependent, one
right frontal cortex. NeuroImage 33: 78493.
would expect a great deal of signiicant cultural variation in such
Winner, Ellen, and Howard Gardner. 1977. he comprehension of meta-
a metaphor. All in all, then, we have a number of cases that con-
phor in brain-damaged patients. Brain 100: 71729.
stitute near-universal or potentially universal conceptual meta-
phors, though not universal metaphors in the strong sense.
METAPHOR, UNIVERSALS OF
Universal Metaphors? How Can We Have (Near-)Universal Metaphors?
How is it possible that such conceptual metaphors exist in diverse
Native speakers of all languages use a large number of meta-
languages and cultures? After all, the languages belong to very
phors when they communicate about the world (Lakof and
diferent language families and represent very diferent cultures
Johnson 1980). Such metaphorically used words and expres-
of the world. Several answers to this question lend themselves
sions may vary considerably across diferent languages. For
for consideration. First, we can suggest that by coincidence, all
example, the idea that is expressed in English with the words
these languages developed the same conceptual metaphors for
spending your time is expressed in Hungarian as illing your time.
happiness, time, purpose, and so on. Second, we can consider
he images that diferent languages and cultures employ to
the possibility that languages borrowed the metaphors from
code meanings can be extremely diverse. Given this diversity, it
one another. hird, we can argue that there may be some uni-
is natural to ask: Are there any universal metaphors at all, if by
versal basis for the same metaphors to develop in the diverse
universal we mean those linguistic metaphors that occur in each
languages.
and every language? his question is diicult because it goes
Let us take as an example the HAPPINESS IS UP concep-
against our everyday experiences and intuitions as regards meta-
tual metaphor, irst discussed by Lakof and Johnson (1980) in
phorical language in diverse cultures; it would also be extremely
English. his conceptual metaphor can be seen in such linguis-
diicult to study, given that there are 4,0006,000 languages spo-
tic expressions as feeling up, being on cloud nine, being high, and
ken around the world today.
others. Yu (1995, 1998) noticed that the conceptual metaphor
If we go beyond looking at metaphorically used linguistic
can also be found in Chinese. And evidence shows that it also
expressions in diferent languages, however, and look at con-
exists in Hungarian. Following are some linguistic examples (Yu
ceptual metaphors instead of linguistic metaphors, we begin
used the grammatical abbreviations PRT = particle and ASP =
to notice that many conceptual metaphors appear in a wide
aspect marker):
range of languages. For example, Hoyt Alverson (1994) found
Chinese:
that the TIME IS SPACE conceptual metaphor can be found
in such diverse languages and cultures as English, Mandarin happy is up
Chinese, Hindi, and Sesotho. Many other researchers suggested Ta hen gao-xing.
that the same conceptual metaphor is present in a large num-
he very high-spirit
ber of additional languages. Several other conceptual metaphors
appear in a large number of diferent languages. On the basis He is very high-spirited/happy.
492
Ta xing congcong de. metaphors from primary metaphors. His idea was that com-
he spirit rise-rise PRT plex metaphors (e.g., THEORIES ARE BUILDINGS) are com-
posed of primary metaphors (e.g., LOGICAL ORGANIZATION IS
His spirits are rising and rising./Hes pleased and excited.
PHYSICAL STRUCTURE). he primary metaphors consist of cor-
Zhe-xia tiqi le wo-de xingzhi. relations of a subjective experience with a physical experience.
this-moment raise ASP my mood As a matter of fact, it turned out that many of the conceptual
his time it lifted my mood/interest. metaphors discussed in the cognitive linguistic literature
are primary metaphors in this sense. For instance, HAPPY IS UP
Hungarian: is best viewed as a primary metaphor, wherein being happy is a
happiness is up subjective experience and being physically up is a physical one
Ez a ilm feldobott. that is repeatedly associated with it. Other primary metaphors
include MORE IS UP, PURPOSES ARE DESTINATIONS, and
this the ilm up-threw-me
INTIMACY IS CLOSENESS. On this view, it is the primary meta-
his ilm gave me a high.-his ilm made me happy. phors that are potentially universal.
Majd elszll a boldogsgtl. Primary metaphors function at a fairly local and speciic level
almost away-lies-he/she the happiness-from of conceptualization, and, hence, in the brain. At the same time,
we can also assume the existence of much more global meta-
He/she is on cloud nine.
phors (see also generic- and specific-level metaphors).
English, Mandarin Chinese, and Hungarian (a Finno-Ugric For example, animals are commonly viewed as humans and
language) belong to diferent language families, which humans as animals; humans are commonly conceptualized as
developed independently for much of their history. It is also objects and objects as humans, and so on. A famous example of
unlikely that the three languages had any signiicant impact on the objects as humans metaphor was described by Keith Basso
one another in their recent history. his is not to say that such an (1967), who showed that in the language of the Western Apache,
impact never shapes particular languages as regards their meta- cars are metaphorically viewed in terms of the human body. In
phors (e.g., the processes of globalization and the widespread addition, the work of Bernd Heine and his colleagues (Heine,
use of the Internet may popularize certain conceptual meta- Claudi, and Hnnemeyer 1991; Heine 1995; Heine and Kuteva
phors, such as TIME IS A COMMODITY), but only to suggest 2002) reveals other large-scale metaphorical processes that peo-
that the particular HAPPINESS IS UP metaphor does not exist in ple seem to employ nearly universally; for example, spatial rela-
the three languages because, say, Hungarian borrowed it from tions are commonly understood as parts of the human body (e.g.,
Chinese and English from Hungarian. the head means up and the feet mean down). hese conceptual
So how did the same conceptual metaphor emerge, then, in metaphors seem to be global design features of the brain/mind
these diverse languages? he best answer seems to be that there is of human beings.
some universal bodily experience that led to its emergence. Lakof It seems to be clear at this point that commonality in human
and Johnson argued early that English has the metaphor because experience is a major force shaping the metaphors we have. It is
when we are happy, we tend to be physically up, move around, be this force that gives us many of the metaphors that we can take
active, jump up and down, smile (i.e., turn up the corners of the to be near-universal or potentially universal. But commonality
mouth), rather than down, inactive, and static, and so forth. hese in human experience is not the only force that plays a role in
are undoubtedly universal experiences associated with happiness the process of establishing and using metaphors. here are also
(or, more precisely, joy), and they are likely to produce potentially countervailing forces that work against universality in metaphor
universal (or near-universal) conceptual metaphors. he emer- production.
gence of a potentially universal conceptual metaphor does not,
of course, mean that the linguistic expressions themselves will be Causes of Metaphor Variation
the same in diferent languages that possess a particular concep- Heines work also shows that not even such global metaphors as
tual metaphor (Barcelona 2000; Maalej 2004). SPATIAL RELATIONS ARE PARTS OF THE BODY are universal in
Kvecses (1990, 2000) proposed, furthermore, that the uni- an absolute sense. here are languages in which spatial relations
versal bodily experiences can be captured in the conceptual are conceptualized not as the human but as the animal body.
metonymies associated with particular concepts. Speciically, Heine points out that such languages function in societies where
in the case of emotion concepts such as happiness, anger, love, animal husbandry is a main form of subsistence. his leads us
pride, and so forth, the metonymies correspond to various kinds to the question: What causes our metaphors to vary as they do?
of physiological, behavioral, and expressive reactions. hese It is convenient to set up two large groups of causes: diferential
reactions provide us with a proile of the bodily basis of emotion experience and diferential cognitive preferences. Diferential
concepts. hus, the metonymies give us a sense of the embodied experience involves diferences in the social-cultural context, in
nature of concepts, and the embodiment of concepts may be social and personal history, and in what we can term social and
overlapping, that is, (near-)universal, across diferent languages personal concern or interest (see Kvecses 2005).
and language families. Such universal embodiment may lead to One example of how the social-cultural context can shape
the emergence of shared conceptual metaphors. conceptual metaphors is provided by Dirk Geeraerts and Stephan
Joseph Grady (1997a, 1997b) developed the Lakof-Johnson Grondelaers (1995). hey note that in the Euro-American tradi-
view further by proposing that we need to distinguish complex tion, it is the classical-medieval notion of the four humors from
493
which the Euro-American conceptualization of anger (as well as metaphorical conceptualization of a target domain. A case in
that of emotion in general) derived. he humoral view maintains point is the conceptualization of anger in English and Chinese.
that the four luids (phlegm, black bile, yellow bile, and blood) As studies of the physiology of anger across several unrelated
and the temperatures associated with them regulate the vital cultures show, increase in skin temperature and blood pres-
processes of the human body. he humors were also believed to sure are universal physiological correlates of anger (Levenson
determine personality types (such as sanguine, melancholy, etc.) et al. 1992). his accounts for the ANGER IS HEAT metaphor in
and account for a number of medical problems. he humoral English and in many other languages. However, Kings and Yus
view exerted a major impact on the emergence of the European work mentioned earlier suggest that the conceptualization of
conception of anger as a hot luid in a pressurized container. By anger in terms of heat is much less prevalent in Chinese than it
contrast, Brian King (1989) and Yu (1995 and 1998) suggest that is in English. In Chinese, the major metaphors of anger seem to
the Chinese concept of nu (corresponding to anger) is bound up be based on pressure not heat. his indicates that speakers of
with the notion of qi, that is, the energy that lows through the Chinese have relied on a diferent aspect of their physiology in
body. Qi in turn is embedded in not only the psychological (i.e., the metaphorical conceptualization of anger than speakers of
emotional) but also the philosophical and medical discourse of English. he major point is that in many cases, the universality
Chinese culture and civilization. When qi rises in the body, there of the experiential basis does not necessarily lead to universally
is anger (nu). Without the concept of qi, it would be diicult to equivalent conceptualization at least not at the speciic level of
imagine the view of anger in Chinese culture. hus, emotion hot luids.
concepts, such as anger in English, dh in Hungarian (the two Are there any diferences in the way the cognitive processes
representing European culture), and nu in Chinese, are in part of metaphor versus metonymy are used in diferent languages
explained in the respective cultures by the culture-speciic con- and cultures? Jonathan Charteris-Black (2003) examined in great
cepts of the four humors and qi, respectively. It appears that the detail how and for what purpose three concepts mouth, tongue,
culture-speciic key concepts that operate in particular cultures and lip are iguratively utilized in English and Malay. He found
account for many of the speciic-level diferences among the vari- similarities in metaphorical conceptualization. For example,
ous anger-related concepts and the PRESSURIZED CONTAINER in both languages, the same underlying conceptual metaphor
metaphor. (e.g., MANNER IS TASTE) accounts for expressions like honey-
As an example of how diferences in human concern can tongued and lidah manis (tongue sweet), and in both languages
create new metaphors, consider some well-known concep- such expressions are used for the discourse function of evaluat-
tual metaphors for sadness: SADNESS IS DOWN, SADNESS IS ing (especially negatively) what a person says. However, he also
A BURDEN, and SADNESS IS DARK. he counterpart of sad- found that the igurative expressions involving the three concepts
ness is depression in a clinical context. Linda McMullen and tended to be metonymic in English and metaphoric in Malay. In
John Conway (2002) studied the metaphors that people with English, more than half of the expressions were metonyms, while
episodes of depression use and, with one exception, found the in Malay the vast majority of them showed evidence of metaphor
same conceptual metaphors for depression that nondepressed (often in combination with metonymy). For example, while met-
people use for sadness. hey identiied the unique metaphor as onymic expressions like tight-lipped abound in English, such
DEPRESSION IS A CAPTOR. Why dont merely sad people talk expressions are much less frequent in Malay. It seems that, at
about sadness as being a captor? Most people do not normally least in the domain of speech organs, the employment of these
talk about being trapped by, wanting to be free of, or wanting concepts by means of igurative processes is partially culture
to break out of sadness, although these are ways of talking and speciic.
thinking about depression in a clinical context. It makes sense to In sum, metaphorical linguistic expressions may vary widely
suggest that people with depression use this language and way of cross-culturally, but many conceptual metaphors appear to be
thinking about their situation because it faithfully captures what potentially universal or near-universal. his happens because
they experience and feel. heir deep concern is with their unique people across the world share certain bodily experiences.
experiences and feelings that set them apart from people who However, even such potentially universal metaphors may dis-
do not have them. It is this concern that gives them the CAPTOR play variation in their speciic details because people do not
metaphor for depression (see also emotion and language). use their cognitive capacities in the same way from culture to
People can employ a variety of diferent cognitive operations culture. Moreover, shared conceptual metaphors may vary
in their efort to make sense of experience. For example, what I cross-culturally in the frequency of their use. Finally, many con-
call experiential focus can have an impact on the speciic details ceptual metaphors are unique to particular (sub)cultures or sets
of the conceptual metaphors used, and what is conceptualized of cultures because of diferences in such factors as the social-
metaphorically in one culture can predominantly be conceptu- cultural context, history, or human concern that characterize
alized by means of metonymy in another (Kvecses 2005). he these cultures.
universal bodily basis on which universal metaphors could be
Zoltn Kvecses
built may not be utilized in the same way or to the same extent
in diferent languages. What experiential focus means is that
diferent peoples may be attuned to diferent aspects of their WORKS CITED AND SUGGESTIONS FOR FURTHER READING
bodily functioning in relation to a metaphorical target domain Alverson, Hoyt. 1994. Semantics and Experience: Universal Metaphors
(see source and target) or that they can ignore or downplay of Time in English, Mandarin, Hindi, and Sesotho. Baltimore: Johns
certain aspects of their bodily functioning with respect to the Hopkins University Press.
494
Metaphor, Universals of Meter
Barcelona, Antonio. 2000. On the plausibility of claiming a metonymic METER

motivation for conceptual metaphor. In Metaphor and Metonymy at
the Crossroads, ed. A. Barcelona, 3158. Berlin: Mouton de Gruyter. Verse is text that is divided into lines (verse lines). One of the
Basso, Keith H. 1967. Semantic aspects of linguistic acculturation. subtypes of verse is metrical verse. In metrical verse, the length
American Anthropologist, n.s., 69.5: 4717. of the lines is controlled by a set of rules (indirectly, all metrical
Charteris-Black, Jonathan. 2003. Speaking with forked tongue: A com- rules count syllables). he lines in metrical verse are usually
parative study of metaphor and metonymy in English and Malay subject to other restrictions as well, most commonly restrictions
phraseology. Metaphor and Symbol 18.4: 289310. on the rhythm of the line (based on stress, syllable weight,
Geeraerts, Dirk, and Stephan Grondelaers. 1995. Looking back at
or lexical tone), and/or on a requirement that a syllable in a
anger: Cultural traditions and metaphorical patterns. In Language
speciic line-internal location be word-initial or word-inal (a
and the Cognitive Construal of the World, ed. J. Taylor and R. MacLaury,
caesura rule). Some meters also include rules about rhyme or
15379. Berlin: Gruyter.
Grady, Joseph. 1997a. Foundations of meaning: Primary metaphors and alliteration.
primary scenes. Ph.D. diss., University of California at Berkeley. Although verse is probably a universal (see poetic form,
. 1997b. heories are buildings revisited. Cognitive Linguistics universals of), found in all oral or literary traditions, there are
8: 26790. poetic traditions without metrical verse, of which perhaps the
Haspelmath, Martin. 1997. From Space to Time: Temporal Adverbials in best known is the Hebrew poetry of the Old Testament, which
the Worlds Languages. Munich and Newcastle: Lincom Europa. is based on syntactic parallelism rather than on counted syl-
Heine, Bernd. 1995. Conceptual grammaticalization and prediction. In lables. Metrical verse is found in European literatures (Greek,
Language and the Cognitive Construal of the World, ed. J. Taylor and R. English, the various Celtic, Germanic, Romance, and Slavic lit-
MacLaury, 11935. Berlin: Mouton de Gruyter.
eratures, also Finnish), in Arabic and Islamic literatures (e.g.,
Heine, Bernd, Ulrike Claudi, and Friederike Hnnemeyer. 1991.
Persian, Urdu, Turkish, Hausa), and in literatures less clearly
Grammaticalization: A Conceptual Framework. Chicago: University of
inluenced by Arabic (such as Berber and Somali), in the litera-
Chicago Press.
Heine, Bernd, and Tania Kuteva. 2002. World Lexicon of tures of South Asia (e.g., Sanskrit, Pali, Hindi, Malayalam, Tamil),
Grammaticalization. Cambridge: Cambridge University Press. of Southeast Asia (e.g., hai, Burmese, Vietnamese), and of East
King, Brian. 1989. he conceptual structure of emotional experience in Asia (e.g., Chinese, Korean, Japanese). Metrical verse is reported
Chinese. Ph.D. diss., Ohio State University. to be largely or completely absent in the poetry of ancient Semitic
Kvecses, Zoltn. 1990. Emotion Concepts. Berlin and New York: Springer- literatures and of Australia, non-Islamic Africa, the Americas,
Verlag. and New Guinea, but this may just be because researchers have
. 2000. Metaphor and Emotion. New York and Cambridge: not been looking for it (ieldworkers far too rarely ask questions
Cambridge University Press. about the poetics or poetic practice of a culture).
. 2005. Metaphor in Culture: Universality and Variation. Cambridge
he variety of meters can be illustrated by some examples.
and New York: Cambridge University Press.
English iambic pentameter requires a 10- or 11-syllable line,
Lakof, George. 1993. he contemporary theory of metaphor. In
Metaphor and hought, ed. A. Ortony, 20251. Cambridge: Cambridge
with even-numbered syllables tending to have stress. he French
University Press. alexandrin requires a line of 12 or 13 syllables, with the sixth syl-
Lakof, George, and Mark Johnson. 1980. Metaphors We Live By. lable both stressed and word-inal. Swahili shairi requires a line
Chicago: University of Chicago Press. of 16 syllables, with the eighth syllable word-inal (and no con-
. 1999. Philosophy in the Flesh. New York: Basic Books. trol over rhythm). Greek iambic trimeter requires a line of 12 syl-
Levenson, R. W., P. Ekman, K. Heider, and W. V. Friesen. 1992. Emotion lables, with even-numbered syllables heavy (containing a long
and autonomic nervous system activity in the Minangkabau of West vowel or ending in two consonants), and the third, seventh and
Sumatra. Journal of Personality and Social Psychology 62: 97288. eleventh syllables light (containing a short vowel ending in at
Maalej, Zouhair. 2004. Figurative language in anger expressions in most one consonant). Arabic kamil requires a line of between 12
Tunisian Arabic: An extended view of embodiment. Metaphor and
and 15 syllables with a complex rhythmic control (in the short-
Symbol 19.1: 5175.
est line the third, seventh and eleventh syllables are light and the
McMullen, Linda, and John Conway. 2002. Conventional metaphors for
depression. In Verbal Communication of Emotion: Interdisciplinary
others heavy). Sanskrit sardulavikridita requires a line of 19 syl-
Perspectives, ed. S. Fussell, 16781. Mahwah, NJ: Lawrence lables with the aperiodic rhythm heavy heavy heavy light light
Erlbaum. heavy light heavy light light light heavy (word boundary) heavy
Mithen, Steven. 1996. he Prehistory of the Mind: A Search for the Origin heavy light heavy heavy light heavy. Japanese meters require
of Art, Science and Religion. London and New York: hames and lines of ive or seven light syllables (but permit a heavy syllable
Hudson. to substitute for two light syllables). A genre of Vietnamese pairs
. 1998. A creative explosion? heory of mind, language, and the a six-syllable line with an eight-syllable line, in which the sec-
disembodied mind of the Upper Paleolithic. In Creativity in Human ond, and sixth (and eighth) syllables belong to one tonal class
Evolution and Prehistory, ed. S. Mithen, 16591. London and New and the fourth to another. Germanic alliterative meter requires
York: Routledge.
between two and four stressed syllables, at least two of which
Sweetser, Eve. 1990. From Etymology to Pragmatics. Cambridge and New
must alliterate.
Yu, Ning. 1995. Metaphorical expressions of anger and happiness in
In literary studies, meter is usually discussed as an aid to
English and Chinese. Metaphor and Symbolic Activity 10: 5992. interpretation, and less attention has been paid to the the-
. 1998. he Contemporary heory of Metaphor in Chinese: A ory that underlies the meter than is desirable. he approach to
Perspective from Chinese. Amsterdam: John Benjamins. meter most common in such studies is the foot combination and
495
Meter
substitution approach. In this approach, a meter such as iambic elements. In Sanskrit and later Indian meters, morae are thus
pentameter is a template made by combining ive iambic feet counted, but some meters count morae while also controlling
each of which is composed of a sequence of an unmarked sylla- syllables: In the gana-counting meters, syllables form (typically)
ble followed by a syllable that is marked. he resulting template four-mora groups that are respected in composing the line.
is matched to a line whose syllables are unstressed or stressed, Some song meters (e.g., Tongan, Ugandan) also use mora count-
so that stressed syllables occupy marked positions in the tem- ing as an organizing principle, but this may be a secondary efect
plate and unstressed syllables the unmarked positions. For in song traditions where heavy syllables match two beats, light
lines that are not fully periodic (e.g., in an iambic pentameter syllables match one, and the number of beats is musically con-
line where the rhythm does not involve a uniform repetition of trolled. Here, a better understanding of the independent metri-
unstressed-stressed throughout), the template itself is changed cal status of text and tune is required.
by substituting a foot of a diferent kind (e.g., a spondee for an In addition to controlling the length of the line, metrical rules
iamb) to match the stress pattern of the variant part of the line. also often control a pattern based on putting the syllables into
his approach only describes the actual rhythm of the line, and two classes, one marked and the other unmarked. It is of par-
though it ofers a convenient vocabulary for the literary critic, it ticular interest that metrical rules diferentiate two kinds of syl-
tells us nothing about the organization of the meter of the line lable but apparently never three or more kinds, even though this
or why some variations are possible in this meter and some not. greater diferentiation is phonetically possible in many lan-
Most recent theoretical accounts express strong reservations or guages. For example, in English metrical verse, the only strictly
total rejection of this approach. controlled syllables are those that carry main stress in a poly-
Recent theoretical approaches to meter are based primarily syllable; other syllables, whether stressed or not, are not strictly
on linguistic theory, particularly on the theory of phonology, controlled, and this is why English metrical verse is rhythmically
following the foundational work of Morris Halle and Samuel Jay fairly variable (see the discussion in music, language and).
Keyser (1971). For metrical purposes, most such theories adopt his means that as regards the strict regulation of syllable types
mechanisms that are used in the theory of phonology, particu- in English meters, the syllable carrying the main stress in a poly-
larly the theory of word stress. Following Mark Libermans (1975) syllabic word is in one class, and all other syllables, whether
insight that stress is a matter of the relation between syllables, not stressed or unstressed, are in the other class. Yet English distin-
a feature of syllables, diferent approaches explored the use of guishes several degrees of stress in longer words, such as autobi-
trees and grids as representations in accounts both of word stress ographical or onomatopoeic, and there is no question that in the
and of metrical poetry (Kiparsky 1977; Hayes 1983). he phono- perception of the rhythm of the line, we perceive more than two
logical theory of optimality theory has also been adapted for degrees of stress. In the quantitative meters of Greek, Sanskrit, or
use in metrical verse (e.g., Golston and Riad 2005). Nigel Fabb Arabic, syllable placement depends on whether a syllable is light
and Halle (2008) develop their account of poetic meter from a or is heavy. In Vietnamese, there are phonologically six distinct
formalism proposed for word stress by William Idsardi (1992); it kinds of tone, but the six types of syllable are grouped into just
groups the syllables with the help of unpaired parentheses both two tonal classes for metrical purposes. It is interesting in this
in phonology (word stress) and in lines (metrical verse). While in connection to consider alliterative meters, such as the meter of
most approaches the metrical representation is a template built Beowulf; in the normative line with four stressed syllables, the
by special rules and then matched to the line, for Fabb and Halle third must alliterate with the irst and/or second but not with the
the metrical representation is generated from the line (much as fourth. Here, stressed syllables are partitioned into two types
in generative syntax the syntactic representation is generated alliterating and not alliterating.
from the terminal elements, such as words or morphemes). A patterned distribution based on two metrical classes of syl-
In metrical verse, as noted, the length of the line is controlled. lable, such as the heavy and light syllables in Greek verse, is often
In most cases, the basic unit of measurement is the syllable. thought of as the basis of the rhythm of the line. A major way in
However, in many metrical traditions, some syllables are part of which theories of meter diverge is in their account of the relation
the line but uncounted. In a common convention, a syllable end- between rhythm and meter. For example, in English iambic pen-
ing in a vowel precedes a syllable beginning in a vowel, but only tameter, there is a general tendency for odd-numbered syllables
one of the two syllables is counted for metrical purposes (though to be unstressed and even-numbered syllables to be stressed,
both are usually pronounced). he latter fact shows something but the actual pattern of stressed and unstressed syllables var-
important; it shows that the grouping and counting of syllables ies constantly from line to line, thus, lines in the same meter can
for metrical purposes is not directly dependent on the phonology vary in their rhythm. Some accounts of English meters attempt to
of the lines. It also poses evident problems for other approaches, explain the full range of rhythmic variation by building statistical
such as that of Kristin Hanson and Paul Kiparsky (1996), which tendencies into the metrical rules. In a diferent approach, Derek
attempt to account for variation in number of syllables by refer- Attridge (1982) incorporates rhythm fully into his account of
ring to the speciic phonology of the language. metrical verse, so that meter and rhythm are accounted for by a
In Japanese, some Indian meters, and some other metrical single theory. In his account, the metrical template also includes
traditions, morae are counted, a heavy syllable counting as two elements that match silences in the text (ofbeats), thus build-
morae and a light syllable as one mora. It is often argued that ing temporal notions into the metrical theory. his and similar
the heavy syllable actually consists phonologically of two morae, accounts must cope with the fact that lines with the same met-
but this is not necessary for an explanation of the meter, which rical pattern can be realized with diferent rhythmic patterns,
can refer just to the syllable as projecting one or two metrical and vice versa. If rhythm is not explained by the metrical rules,
496
Methodological Solipsism Methodology
several types of explanation are possible (and can be combined). mental states, this is impossible because Putnams thought refers
For example, Fabb (2002) argues that the perception of rhythmic to H2O whereas his Twins thought refers to XYZ. Fodors conclu-
regularity involves pragmatic processes of pattern matching that sion from this argument is that mental states are to be construed
are distinct from metrical rules (which govern those aspects of narrowly, without reference to the external state of the world.
the line that are strictly controlled and, like other kinds of implicit Only the so-called narrow content and structure of a belief deter-
linguistic rules, are not directly perceived). It is also possible that mines behavior, not whether the belief is about H2O or XYZ or
rhythmic patterns might be independently represented, per- even whether its true or not.
haps by grids similar to those found in metrical verse. he link In addition to mental states being internal, Fodor argues,
between the metrical form and the rhythmic form of the verse mental processes have to be computational, that is, work on the
then may fall under a theory of text-to-tune matching. formal, syntactical properties of mental states, rather than on
their semantical properties, which are forbidden in an inter-
Nigel Fabb and Morris Halle
nalist account. his is called the formality condition.
Fodors main argument for MS is a negative one: Adhering
to its counterpart renders psychology practically impossible
Attridge, Derek. 1982. he Rhythms of English Poetry. Harlow, because it assumes the availability of a full description of the
UK: Longman. relevant aspects of the world in physical terms, such being
Fabb, Nigel. 2002. Language and Literary Structure: he Linguistic necessary to individuate mental states. hat is, we would need
Analysis of Form in Verse and Narrative. Cambridge: Cambridge
to have the physical description of water available to tell us
University Press.
what a mental state containing water is about. However, such
Fabb, Nigel, and Morris Halle. 2008. Meter in Poetry: A New heory.
physical descriptions are often unavailable. MS is diferent
Golston, Chris, and Tomas Riad. 2005. he phonology of Greek lyric from functionalism (as proposed by Putnam) in that it adds the
meter. Journal of Linguistics 41: 77115. requirement that mental states are formal, symbolic entities on
Halle, Morris, and Samuel Jay Keyser. 1971. English Stress: Its Form, Its which computational processes can work. Functionalism deines
Growth and Its Role in Verse. New York: Harper and Row. mental states to be determined by their functional role, that is,
Hanson, Kristin, and Paul Kiparsky. 1996. A parametric theory of poetic their place in a causal network of other mental states, sensory
meter. Language 72: 287335. inputs, and behavior resulting from them. Functionalism sets
Hayes, Bruce. 1983. A grid-based theory of English meter. Linguistic apart mental states from their physical substratum, whereas MS
Inquiry 14: 35794. divorces mental states from their causal antecedents in the world
Idsardi, William. 1992. he computation of stress. Ph.D. diss.,
and proposes that mental states are to be treated as syntactical
Massachusetts Institute of Technology.
rather than semantical entities.
Kiparsky, Paul. 1977. he rhythmic structure of English verse. Linguistic
Inquiry 8: 189247. Ingmar Visser
Liberman, Mark. 1975. he intonational system of English. Ph.D. diss.,
Massachusetts Institute of Technology. WORKS CITED AND SUGGESTIONS FOR FURTHER READING
Fodor, Jerry A. 1980. Methodological solipsism considered as a research
METHODOLOGICAL SOLIPSISM strategy in cognitive psychology. Behavioral and Brain Sciences
Methodological solipsism (MS) is the thesis that mental (or 3: 63109.
Putnam, Hilary. 1975a. he meaning of meaning. In Mind, Language
psychological) states are to be individuated solely by referring
and Reality: Philosophical Papers, II: 21571. Cambridge: Cambridge
to their relationships with other mental states and the physical
University Press.
state of someones brain, but not by referring to the physical . 1975b. he nature of mental states. In Mind, Language and
world outside the individual to whom those states are ascribed. Reality: Philosophical Papers, II: 42940. Cambridge: Cambridge
his phrase was coined by Hilary Putnam (1975a) in an essay University Press.
about meaning externalism and internalism, but its Tuomela, Raima. 1989. Methodological solipsism and explanation in
main advocate (Putnam opposes the thesis) is Jerry Fodor, who psychology. Philosophy of Science 56.1: 2347.
defends it as part of the representational and computational the-
ory of mind (Fodor 1980).
METHODOLOGY
he main issue that MS is concerned with is the relation-
ship between mental states and the outside world or, rather, the he topic of methodology most generally involves exploring the
absence of such relationships so far as explanation in scientiic range of responses to the following questions that any researcher
psychology is concerned. In Putnams (1975a) Twin Earth in the language sciences must answer: What sort of empirical
thought experiment, the question is whether Putnam and his data are you collecting, how are you collecting it, and how do
Twin are in the same mental state when thinking about water, you hope it will bear on the research question(s) you are trying
given that on Earth water is H2O, whereas on Twin Earth water to answer? Addressing these questions for a particular special-
has the chemical formula XYZ (though it behaves otherwise iden- ization within the language sciences falls under the purview
tically to water on Earth). If mental states are to explain behavior, of its experts. My goal here is a more general examination of
Fodor argues, it needs to be the case that Putnam and his Twin issues that arise when language scientists assess alternative data
are in the same mental state when they are thinking I would like sources and means of data collection and seek to interpret their
to take a dive into the deep waters. In an externalist account of data. hese high-level choices require more inesse in language
497
Methodology
sciences than in others, perhaps, because (for the purposes of an otherwise comparable population is one of the major chal-
most researchers) humans are the only creatures that display the lenges of research. For example, if you want to study specific
target phenomenon, namely, language. his creates ubiquitous language impairment (SLI), you presumably want to com-
challenges owing to the great complexity of the human organ- pare children with SLI to unafected children, but which ones?
ism (as compared, say, to a fruit ly) and to ethical considerations If you use children of the same chronological age, you will surely
that prevent us from carrying out potentially informative proce- ind many diferences in their speech, but this will mainly con-
dures that are used with other species. irm that a speech pathologist was correct in diagnosing the irst
I structure the discussion around a taxonomy of the ways group with SLI. More interesting would be to ind that younger
in which data can be collected by language scientists. Practical unafected children whose language is similar to that of the
exigencies limit the discussion to data about (human) language, children with SLI in some respects (e.g., mean length of utter-
though many language scientists need to gather other sorts of ance) nonetheless is more advanced in others (e.g. correct use of
data as well (e.g., computational linguists collect simulation inlectional morphology).
data, anthropological linguists collect cultural data, dialectolo- here are numerous ways of classifying speakers into groups
gists collect geographical data, etc.). I further narrow the focus that have seemed fruitful: age (see aging and language),
to data collected for research purposes, leaving aside issues par- gender (see gender and language), handedness, education,
ticular to clinical (see speech-language pathology), foren- native versus non-native speaker, mono- versus bi-/multilingual
sic (see forensic linguistics), and other applications. I also (see bilingualism and multilingualism), socioeconomic
omit discussion of instrumental, statistical, or other formal treat- status (see sociolinguistics), and many more. Interpreting
ment of data: his is important, but ultimately futile if ones data any correlations one inds between one of these variables and
do not properly address the questions to be answered. some language phenomenon is rarely straightforward, how-
My taxonomic framework divides empirical methods along ever. For example, if we ind that increasing age correlates with
two dimensions. One dimension along which data collection increasing frequency of tip-of-the-tongue states, does that impli-
can be characterized is by the population of speakers/hearers cate a general decline in memory retrieval with age, or rather an
(hereafter simply speakers, by which I mean also users of sign ability to partially retrieve words that younger people could not
languages) from whom one is collecting data: adult native retrieve at all, thanks to greater exposure to these words over the
speakers of a particular language, children growing up in bilin- course of a lifetime (Gollan and Brown 2006)?
gual households, a creole community, the last surviving
speaker of a dying language (see extinction of languages), RELEVANCE OF ATYPICAL SPEAKERS TO THE STUDY OF TYPICAL
the unknown author(s) of an ancient text, the editorial board of a SPEAKERS. A second reason for studying a particular population
dictionary, and so on. A second dimension, more or less orthog- is to allow us to learn things about typical language that typical
onal to the irst in principle, is how the language data get from speakers do not. For example, it has been suggested that certain
the speaker(s) to the researcher(s). Researchers may observe the language disorders represent an otherwise intact language sys-
speaker while he or she is doing something involving language, tem from which one circumscribed grammatical mechanism has
or they may gather data produced as the result of a prior event been removed or rendered inoperative, as in Yosef Grodzinskys
involving language, via an artifact or another person. 1986 account of agrammatic aphasia, according to which
As illustrations, I use hypothetical indings that should not traces of movement are missing from otherwise normal syn-
be taken as statements of fact. Because their importance here tactic representations. No unafected speakers would provide us
lies in clarifying conceptual points, I have not restricted myself with the opportunity to pose the question What does syntax
to attested uncontroversial results, though I believe these hypo- look like if you take out just the traces? Similarly, SLI in later
thetical results not to be wildly implausible. life could, on certain views, allow us to ask how an incompletely
developed morphosyntax behaves when coupled with adult-
Different Populations sized open-class vocabulary and general cognitive capacities,
INTRINSIC INTEREST IN SUBGROUPS. Obviously, if ones research such as working memory (see working memory and lan-
questions are about a particular population (e.g., the language guage processing) permitting us to test, for example, the
of autistic children [see autism and language], speakers of behavior of complex sentences in such circumstances. he
tone languages), then this is an excellent reason for collecting congnitive immaturity of (typically developing) children pre-
data from that population, but it is not the only possible reason, cludes this kind of test.
as will be discussed presently. Research on any group other than here is a major caveat when dealing with atypical popula-
the default (healthy adult native speakers) is virtually always tions, however particularly when analyzing them using theo-
comparative, if only implicitly: In order to know whether one is ries based on typical populations or drawing conclusions about
really discovering properties of population X, rather than simply such populations. We do not know how circumscribed their
heretofore unknown properties of human language in general, deviation from the norm really is. For example, in the case of
X must be compared with population Y with regard to the same focal brain damage, it is not an innocent assumption to posit that
properties. For example, inding that some group of bilinguals the speakers subsequent use of language is simply the output of
has an average vocabulary size of n in their dominant language an otherwise normal brain (as it was before the lesion occurred)
would be most interesting in the context of knowing that an minus whatever function(s) used to be performed by the lost
otherwise-comparable group of monolinguals has an average neural structures. Rather, the speakers language use is the prod-
vocabulary size of, say, 1.75n. Determining what constitutes uct of a damaged brain that has recovered from injury in ways
498
Methodology
that we do not yet know how to ascertain. Some functions previ- (ERP), magnetoencephalography (MEG). Useful results have
ously performed in the damaged area may have been taken over been obtained on some measures without giving subjects any
by intact areas, which may in turn have lost some of their original task at all, simply by exposing them to language auditorily, but
functionality; areas that were inhibited by the damaged area may depending on the technique, mental tasks or even ones involving
now be free to come into play; and so forth. responses such as button pressing are possible. A serious meth-
Another kind of atypical population includes the expert users odological issue arises when we want to interpret the resulting
of language: authors, comedians, songwriters, journalists, poets, data, however. For example, consider eye-tracking data from a
playwrights, preachers, politicians, and so on. hey can be taken sentence reading task. he assumption has usually been that
as proof by example of what it is possible for humans to do with the longer a reader spends looking at a particular word or group
language, but beyond that we know very little about how they of words, the harder they found those words to process or under-
come by their expertise, and so it is hard to say how they might stand. While that is true in many cases, one situation in which
inform the study of language in nonexpert speakers. people may spend a very short time looking at some word is
A special reason for choosing particular speakers to study is when it signals the need to reanalyze an earlier part of the sen-
because of their genetic relationship to other speakers. his is tence and triggers an immediate regressive eye movement. his
most obvious for language disorders suspected to have a herita- clearly should not be taken to indicate ease of processing. Due
ble component, such as SLI. But genetic relationships, in partic- to challenges of this sort, there are now a half dozen or more
ular between twins, can be used in language sciences (as in many measures of ixation times commonly reported in eye-tracking
sciences) to approach issues concerning the possible contribu- studies, but their interpretation is not agreed upon and may
tions of the genotype to aspects of language in the phenotype. well depend on the particulars of what is being read. Here, and
he standard methodology is to compare monozygotic to dizy- especially for brain measures, as data become richer they do not
gotic twin pairs, whereby the former share (on average) twice as necessarily become more informative until foundational results
much genetic material. Any phenomenon of language where the establish how the basic response features are to be interpreted.
monozygotic pairs are more similar is taken to be shaped more It is sometimes thought that we do not actually need to
heavily by prewired brain structures (cf. Ganger 1998). understand these detailed properties of brain activity in order to
make productive use of these measures: So long as we can show
Types of Data Collection: Immediate Versus Delayed that stimulus Y patterns like stimulus X while stimulus Z pat-
By immediate collection of language-relevant data I mean that terns diferently, then we have evidence that whatever manipu-
researchers obtain data from a speaker while he or she is engaged lation we used in creating the stimuli classiies X with Y to the
in some language-related behavior (though this may not involve exclusion of Z. For example, someone might try to ask whether
any action or even any awareness of language on the speakers binding theory (see also anaphora) is really part of syntax or
part). By delayed data collection I mean that researchers part of semantics by creating a sentence that violates a clearly
obtain language-related data after the fact, including by studying syntactic principle (X), one that violates a clearly semantic prin-
artifactual records of previous language-related behaviors (e.g. ciple (Z), and a binding violation (Y), and then seeing whether
transcriptions, recordings, corpora [see corpus linguistics], Y patterns like X or like Z in ERPs (or neither, in which case
grammars, etc.). his distinction is crucial because it bears on no conclusion can be drawn). But it is impossible to construct
how much researchers can know about the original event. In sentences that are identical in all respects (phonology, mor-
what follows I exemplify numerous approaches in each category phology, sequence of word classes, etc.) except for these vio-
and suggest advantages and disadvantages. lations, and so if we know nothing about what the observed brain
patterns actually mean, all that this kind of experiment can tell
KINDS OF IMMEDIATE DATA COLLECTION. here are really only two us is that some property shared by X and Y is lighting up, and Z
sorts of data one can collect from speakers who are doing a lin- does not share that property. (Although ERP researchers speak
guistic task (meant broadly, as shorthand for doing something of components sensitive to syntactic violations versus semantic
that involves language). One is to collect data on what they are anomaly, the basis for this is a very small range of sentence types,
(deliberately) doing: for example, if they are talking, what they are and semantic really refers to real-world implausibility, not vio-
saying; if they are listening, when they are nodding, when they are lations of principles of formal semantics.)
smiling, and so on. he other is to collect some other physical mea- Turning now to conscious reaction tasks, the most common
sure that will (one hopes) provide evidence about language inside of course involve psychologys favorite technique, measuring
them. his can take the form of voluntary behavioral measures or reaction time to press a button, say a word, and so on. Within
involuntary physiological or brain measures. (he data may be pre- certain schools of linguistics, grammaticality judgments
served for later analysis, e.g., on videotape. What is crucial in count- are the most favored (increasingly encompassed by the broader
ing it as immediate data is that it captures the speakers immediate term well-formedness ratings, as they are also applied to individ-
response. Even a questionnaire can fall into this category if speakers ual words and are elicited on multipoint or open-ended scales).
report their immediate reactions, e.g., Yes/No or numeric ratings.) In mentioning these two types of data collection side by side, my
In the category of involuntary responses, we ind such tech- intent is to emphasize their similarities. hey both involve col-
niques as measuring galvanic skin response, pupil diameter, lecting behavioral measures in immediate response to some lin-
and eye movements, plus indicators of brain activity from neu- guistic stimulus. Grammaticality judgments can be recorded and
roimaging positron emission tomography (PET), functional timed by computer. Contrariwise, experiments normally carried
magnetic resonance imagining (fMRI), event-related potential out by computer can be done interview style, for example, with
499
Methodology
people who cannot read. Interviews lose ine-grained timing member of an isolated society that does not welcome outsiders
information (experimenters should still note gross diferences may report that its leader uses special vocabulary; if research-
in response times) but gain elsewhere, including the allowance ers are not members of this society, they must take the word of
for open-ended narrative responses and the possibility of asking someone else.
follow-up questions contingent thereon. It is important to note Evidently there are some situations in which the use of delayed
that the presence/absence of a laboratory setting, electronic language data is unavoidable, for example, when studying dead
equipment, statistical analysis, and so on has no bearing on the languages or when speakers are not accessible. Also, quantita-
conceptual/epistemological nature of the data collected. tive measures such as word frequency could not practically be
Finally in this category are data from speakers who are actu- calculated entirely from immediate interactions with individual
ally using language with no extra task imposed on them. It is speakers. More generally, the use of delayed data afords us a
surprising prima facie how little time most language scientists much larger sample of language material, hence, potential expo-
spend actually observing just these events. he reason is largely sure to rare phenomena that we might otherwise never become
practical. Most research necessarily concentrates on a quite spe- aware of. However, it is misguided to think that the availability
ciic aspect of some linguistic phenomenon; waiting for it to arise of billions of words of computer-searchable text has eliminated
by chance is too resource intensive. Nonetheless, it is important the need for explicit data-gathering tasks: For the vast majority of
to keep in mind that every step away from the real situations we the worlds languages, the quantity of existing written materials
are interested in will introduce both random errors and system- (if there is a writing system at all) is many orders of magnitude
atic distortions for example, in the case of transcripts, due to smaller than for the languages that dominate the information
imperfect recording quality and the transcribers subconscious age, and much of it is not on computer.
assumptions about what is being said, respectively.
Conclusion
KINDS OF DELAYED DATA COLLECTION. I classify as a delayed data Part of what makes the study of language both fascinating and
situation one in which the object of the researchers measure- frustrating is that language can never truly be studied in isola-
ments, observations, and so on is not a person at a time when he tion: It inexorably traces back to the bodies and brains of human
or she is engaging with language, but rather some indication of beings, which both are always doing myriad things. Rather than
what may have happened at such a time: an artifact produced by trying to ignore this as an inconvenience, researchers would do
that person, or a behavior observed by someone other than the well to keep it in mind whenever they have methodological deci-
researcher. here are two major subclasses of such data. sions to make. Sometimes, as with twin studies and certain lan-
One subtype comprises any instances of written language, guage disorders, it can even be turned into an advantage.
whether created by an original act of writing or representing an
attempt to transcribe or otherwise keep record of language that Carson T. Schtze
was originally spoken. (Although this distinction is important,
all written material, including phonetic transcription, loses WORKS CITED AND SUGGESTIONS FOR FURTHER READING
much information found in spoken language.) his includes Botha, Rudolph P. 1981. he Conduct of Linguistic Inquiry: A Systematic
documents from now-dead languages, dictionaries and gram- Introduction to the Methodology of Generative Grammar. he
mars, dialect atlases, poetry, song lyrics, scripts, and so on Hague: Mouton.
(in some of which the writers intent may be to sound unlike Cowart, Wayne. 1997. Experimental Syntax: Applying Objective Methods
his own or anyone elses natural speech or prose writing), as to Sentence Judgments. housand Oaks, CA: Sage.
well as corpora amassed speciically for academic purposes. Ganger, Jennifer B. 1998. Genes and environment in language acquisi-
tion: A study of early vocabulary and syntactic development in twins.
Any text found on the World Wide Web falls into this category
Ph.D. diss., Massachusetts Institute of Technology.
as well. One can, of course, treat textual material as an object of
Gollan, Tamar H., and Alan S. Brown. 2006. From tip-of-the-tongue
study unto itself, ignoring how it was created, but if one wants to
(TOT) data to theoretical implications in two steps: When more TOTs
use it as evidence bearing on human language in general, then means better retrieval. Journal of Experimental Psychology: General
considering the many diferences between writing and talking 135: 46283.
becomes paramount. Most signiicant is the ability to edit writ- Grodzinsky, Yosef. 1986. Language deicits and the theory of syntax.
ten material after initially producing it (in most situations). he Brain and Language 27: 13559.
Web, increasingly used as a corpus because of its size, comes Labov, William. 1972. Some principles of linguistic methodology.
with many special problems: It can be hard to ascertain who Language in Society 1: 97120.
actually wrote any given passage; it is usually impossible to Matthewson, Lisa. 2004. On the methodology of semantic ieldwork.
establish the native language(s), gender, age, and so on of the International Journal of American Linguistics 70: 369415.
Newmeyer, Frederick J. 1983. Grammatical heory, Its Limits and Its
author; the intended meaning and discourse function is often
Possibilities. Chicago: University of Chicago Press.
unclear; and so forth.
Resnik, Philip, Aaron Elkiss, Ellen Lau, and Heather Taylor. 2006. he
he second subtype of delayed data is hearsay, that is, reports Web in heoretical Linguistics Research: Two Case Studies Using the
given by someone about language phenomena witnessed, told Linguists Search Engine. In Proceedings of the 31st Annual Meeting of
about, or engaged in personally. For example, elderly speak- the Berkeley Linguistics Society, 26576.
ers might report that their parents used to use some expression Schtze, Carson T. 1996. he Empirical Base of Linguistics: Grammaticality
but that they themselves never used it. his is information that Judgments and Linguistic Methodology. Chicago: University of Chicago
researchers have no way of independently verifying. Likewise, a Press.
500
Metonymy
. 2005. hinking about what we are asking speakers to do. In Like lexical metonymies, grammatical metonymies operate
Linguistic Evidence: Empirical, heoretical, and Computational both on the synchronic and diachronic levels. he coercive pro-
Perspectives, ed. Stephan Kepser and Marga Reis, 45784. cess in metonymy is particularly striking in cases where gram-
Berlin: Mouton de Gruyter. matical meaning conlicts with lexical meaning. For example,
stative predicates, such as the verb be, may be used in construc-
METONYMY tions that normally require action predicates, such as impera-
tives. hus, the slogan of the American news network CNN Be the
Metonymy (Greek change of name) is one of the irst to know is interpreted as the efect of an intentional act to
major igures of speech recognized in classical rhetoric. he be carried out by the hearer: Do something [viz. watch CNN] so
Roman treatise Rhetorica ad Herennium deines metonymy as that, as a result, you are the irst to know. he conceptual shift
a trope that takes its expression from near and close things by at work here is based on the result for action metonymy. On
which we can comprehend a word that is not denominated by the diachronic level, metonymy plays a crucial role in gram-
its proper word. his ancient characterization already points to maticalization processes. For example, the lexical item go (in
two criterial notions of metonymy, contiguity and substitution, conjunction with the present progressive) in the phrase be going
which still occur in most present-day deinitions of metonymy to has grammaticalized into a future marker. Human motion is
as the substitution of one word for another with which it is typically directed toward a goal and, hence, is strongly associ-
associated. ated with the intention of reaching the goal. Since the goal can
Recent studies in cognitive linguistics have shown that only be reached in the future, the intention to reach the goal may
metonymy is not just a matter of words and their substitution stand for the future itself.
but is part of human thinking and reasoning. he conceptual Looked at from a pragmatic point of view, metonymy can
nature of metonymy has been demonstrated by George Lakof be regarded as a matter of inferencing. We can distinguish the
(1987). For example, the term mother makes many people think following three types of metonymic inference: inferences about
of a housewife mother. he relationship between mothers and a referential item (referential metonym), inferences about a
housewives is metonymic and operates only on the conceptual predicate (predicational metonymy), and inferences about the
level: he category mother is metonymically associated with the speech-act meaning (illocutionary metonymy) (Panther
subcategory housewife mother as one of its members. and hornburg 1998). Referential metonymy is a means of indi-
Various cognitive linguists have described the conceptual rect reference. For example, the use of subway in he subway is on
basis of metonymy using the notion conceptual frame. Frames strike invites the inference that the subway personnel is meant.
are packages of knowledge about coherent segments of experi- Predicational metonymy is exempliied by utterances such as
ence. he elements of a frame are conceptually contiguous: Any he saxophone player had to leave early, which in many contexts
element evokes the frame as a whole and, concomitantly, other induces the metonymic inference that the saxophone player left
elements within the frame network. For example, the concept early. In this case, an obligation to leave is interpreted as an actu-
author establishes a frame that includes literary works, a pub- ally occurring action. Illocutionary metonymy is illustrated by
lisher, biographical information, etc. Since these elements are utterances such as Can you lend me ten dollars? he speaker liter-
conceptually contiguous, they may be exploited by metonymy. ally poses a question about the hearers ability to lend the speaker
hus, we may metonymically refer to a book by naming its author, $10, but this question gives rise to the metonymic inference that
as in We are reading Shakespeare. Typically, a metonymic inter- the hearer is being asked to lend $10 to the speaker; it is under-
pretation is coerced when there is a conceptual conlict between stood as a request. Conventional indirect requests like these are
expressions belonging to the same frame. In the previous exam- not just random substitute forms for the direct request Lend me
ple, the verb read requires an object that denotes a linguistically ten dollars. he literal meaning of the metonymic expression has
coded content, such as a book or a letter. he conceptual conlict an important communicative function in this indirect request.
is resolved by understanding Shakespeare as a reference point It addresses a potential obstacle: he hearer might be unable
that provides mental access to Shakespeares literary work to carry out the requested action because he or she needs the
(Langacker 1993; Radden and Kvecses 1999). money, too (Gibbs 1994). In fact, the example illustrates an
Studies in metonymy have traditionally focused on words. important general point: he literal meaning of a metonymy is
Standard examples on the synchronic level include he ket- always relevant to the interpretation of metonymic expressions.
tle is boiling (container for content) and Jonathan is in the It thus provides strong evidence against the view that metonymy
phone book (person for name). Metonymic processes on the is merely the substitution of one word for another.
diachronic level have been long noted by historical lin-
guists and amply demonstrated since the nineteenth century. Gnter Radden and Klaus-Uwe Panther
Metonymic shifts have been observed cross-linguistically in a
number of conceptual frames (Koch 1999). For example, in the WORKS CITED AND SUGGESTIONS FOR FURTHER READING
marriage frame, a preparatory status of being engaged may
Gibbs, Raymond W., Jr. 1994. he Poetics of Mind: Figurative hought,
stand for the state of being married. hus, the Latin word sponsus/
Language, and Understanding. Cambridge: Cambridge University
sponsa with the meaning ianc/iance shifted its meaning to Press.
bride/bridegroom and ended up with the meaning husband/ Koch, Peter. 1999. Frame and contiguity: On the cognitive bases of
wife, as in Spanish esposo/esposa, French poux/pouse, and metonymy and certain types of word formation. In Panther and
English spouse. Radden 1999, 13967.
501
Minimalism
Lakof, George. 1987. Women, Fire, and Dangerous hings: What Categories inserted throughout the course of the syntactic derivation, via
Reveal about the Mind. Chicago: University of Chicago Press. generalized transformations, rather than all in one initial block.
Lakof, George, and Mark Johnson. 1980. Metaphors We Live By. Chicago he derivation proceeds bottom up with the most deeply
and London: University of Chicago Press. embedded structural unit created, then combined, via merge,
Langacker, Ronald. 1993. Reference-point constructions. Cognitive
with the head of which it is the complement to create a larger
Linguistics 4: 138.
unit, and so on. Consider the derivation of he woman will see
Panther, Klaus-Uwe, and Gnter Radden, eds. 1999. Metonymy in
Language and hought. Amsterdam and Philadelphia: Benjamins.
the man: he noun (N) man is combined with the determiner (D)
Panther, Klaus-Uwe, and Linda L. hornburg. 1998. A cognitive approach the to form the determiner phrase (DP) the man. his DP then
to inferencing in conversation. Journal of Pragmatics 30: 75569. combines with the verb see to produce an intermediate projec-
Radden, Gnter, and Zoltn Kvecses. 1999. Towards a theory of meton- tion (in the sense of x-bar theory), V-bar. he DP the woman
ymy. In Panther and Radden 1999, 1759. is created in the same fashion as the man, and is combined with
the V-bar to produce the VP. Next, this VP merges with the Inl
will producing I-bar. he DP the woman inally moves (leaving
MINIMALISM
a TRACE t) to the speciier position of I, yielding the full clausal
Minimalism, extending earlier work in transformational projection IP, schematically illustrated in (2) (by labeled bracket-
grammar and generative grammar, conjectures that the ing, a notational variant of tree representation):
computational system central to human language is a per-
(2) [IP he woman [I will [VP t [V see [DP the man]]]]]
fect solution to the task of relating sound and meaning. Recent
research has investigated the complexities evident in earlier In this model, there is no one representation following all lexi-
models and attempted to eliminate them, or to show how they cal insertion and preceding all singulary transformations. hat
are only apparent, following from deeper and simpler proper- is, there is no D-structure.
ties. Major examples of this work include the reduction of the
number of linguistic levels of representation in the model and
Some Minimalist Goals
the deduction of certain constraints on syntactic derivations
So far, S-structure persists: If there is a point where the deriva-
from general considerations of economy and computational
tion divides, branching toward LF on one path and toward PF on
simplicity.
the other, that point is S-structure. he more signiicant question
Like earlier versions of generative grammar, the minimalist
is whether there are any crucial conditions deined on it as in
program (MP) (Chomsky 1995b, 2000, 2004, 2005) maintains that
the GB framework, for example, with respect to binding theory
linguistic competence is a computational system creating and
(Chomsky 1981). One goal of the minimalist research program is
manipulating structural representations. MP further proposes
to establish that these further properties are actually properties
that the derivations and representations conform to economy
of LF, as suggested in the mid-1980s (Chomsky 1986), contrary to
criteria, demanding that they be minimal in a sense determined
previous arguments (Chomsky 1981).
by the language faculty (perhaps ultimately by general proper-
Another goal is to reduce all constraints on representation to
ties of organic systems): no extra steps in derivations, no extra
bare output conditions, determined by the properties of the men-
symbols in representations, and no representations beyond
tal systems that LF and PF must interface with. For instance, the
those that are conceptually necessary.
motor system determines that a phonetic representation must
Reduction of Levels be linearly ordered.
Internal to the computational system, the desideratum is
Minimalism developed out of the government and binding that constraints on transformational derivations be reduced to
(GB) or principles and parameters model (Chomsky 1981, general principles of economy. Derivations beginning from the
1982; Chomsky and Lasnik 1993). In that model, there are four same choice of lexical items are compared in terms of number
signiicant levels of representation, related by derivation: of steps, length of movements, and so on, with the less economi-
cal ones being rejected. An example is the minimalist deduction
(1) D(eep)-Structure
of the superiority condition, which demands that when multiple
items are available for wh-movement in a language, like English,
S(urface)-Structure
allowing only one to move, it is the highest one (one closest to
PF LF the root of the phrase structure tree) that will move:
(Phonetic Form) (Logical Form)
(3) Who t will read what
Given that a human language is a way of relating sound (or, (4) *What will who read t [* indicates ungrammaticality]
more generally, gesture, as in sign languages) and meaning,
the interface levels PF and LF were assumed to be ineliminable. Economy, in the form of shortest move, selects (3) over (4) since
Minimalism begins with the hypothesis that there are no other the subject is higher than the object, hence, closer to the sen-
levels. tence initial target of wh-movement than the object is.
he simplifying developments in the theory leading toward
Structure Building the minimalist approach generally led to greater breadth and
Minimalism, in a partial return to the apparatus of pre-1965 depth of understanding of both how human languages are orga-
transformational theory (Chomsky 1955), has lexical items nized (descriptive adequacy) and how they develop in childrens
502
Minimalism
minds (explanatory adequacy) (see descriptive, obser- as a result of the movement, but that is simply a beneicial side
vational, and explanatory adequacy). his success efect of the satisfaction of the requirement of the attractor. he
led Noam Chomsky to put forward the audaciously minimal- earlier minimalist approach to the driving force of movement
ist conjecture that we are now in a position to go even beyond was called Greed by Chomsky. his later one developed out of
explanatory adequacy: he human language faculty might be what Howard Lasnik (1995) called Enlightened Self Interest.
a computationally perfect solution to the problem of relating
sound and meaning, the minimal computational system given The Syntactic Similarity of Languages
the boundary conditions provided by other modules of the mind. One recurrent theme in GB and minimalist theorizing, motivated
his conjecture leads to a general minimalist critique of syntactic by the quest for explanatory adequacy, is that human languages
theorizing, including Chomskys own earlier minimalist theo- are syntactically very similar. he standard GB and early mini-
rizing. Consider irst the leading idea that multiple derivations malist instantiation of this claim was the proposal that supericial
from the same initial set of lexical choices are compared. his diferences result from potential derivational timing diferences
introduces considerable complexity into the computation, espe- among languages, with the same transformation applying in
cially as the number of alternative derivations multiplies. It thus overt or covert syntax. Under both circumstances, LF relects the
becomes desirable to develop a model whereby all relevant deri- results of the transformation. For example, the wh-movement
vational decisions can be made in strictly Markovian fashion: At operative in English interrogative sentences is overt movement
each step, the very next successful step can be determined, and to speciier of C(omplementizer). In many other languages,
determined easily. his arguably more tractable local economy including Chinese and Japanese, interrogative expressions seem
model was suggested by Chomsky (1995a) and developed in to remain in situ, unmoved, as seen in the contrast between (9)
detail by Chris Collins (1997). and its English translation in (10):
(9) ni xihuan shei [Chinese]

The Last Resort Nature of Syntactic Movement you like who
From its inception in the early 1990s, minimalism has insisted
(10) Who do you like
on the last resort nature of movement: Movement must hap-
pen for a formal reason. he case ilter (see filters), which C.-T. Huang (1981/1982) argued that even in such languages
was a central component of the GB system, was thought to pro- there is movement, by showing that well-established locality
vide one such driving force. A standard example involves subject constraints on wh-movement, such as those of John Robert Ross
raising: (1967), also constrain the distribution and interpretation of cer-
tain seemingly unmoved wh-expressions in Chinese. his argu-
(5) John is certain [t to fail the exam]
ment was widely inluential and laid the groundwork for much
(6) It is certain [that John will fail the exam] GB and minimalist research.
Along related lines, Chomsky argued that V-raising, overt in
In (5), as in (6), John is the understood subject of fail the exam.
virtually all of the Romance languages, among others, operates
his fact is captured by deriving (5) from an underlying structure
covertly in English, as in the following examples from English
much like that of (6), except with an ininitival embedded sen-
and their translations into French:
tence instead of a inite one:
(11) a. John often kisses Mary
(7) __ is certain [John to fail the exam]
b. *John kisses often Mary
John in (7) is not in a position appropriate to any case. By raising
(12) a. *Jean souvent embrasse Marie
to the higher subject position (speciier of the higher Inl), it can
b. Jean embrasse souvent Marie
avoid a violation of the case ilter, since the raised position is one
where nominative case is licensed. But if the case requirement of he assumption is that the position of the verb vis--vis the adverb
John provides the driving force for movement, the requirement indicates whether the verb has raised overtly. For V-raising, the
will not be satisied immediately upon the introduction of that feature driving the movement is claimed to be one that resides
nominal expression into the structure, under the assumed bot- in Inl. he feature might be strong, forcing overt movement (as
tom-up derivation. Rather, satisfaction must wait until the next in French), or weak. Similarly, the feature demanding overt wh-
cycle, when a higher layer of structure is built, or, in fact, until an movement in English is a strong feature of C. he principle pro-
unlimited number of cycles later, as raising conigurations can crastinate disallows overt movement except when it is necessary
iterate: (i.e., for the satisfaction of a strong feature as in Chomsky 1993;
Lasnik 1999a).
(8) John seems [ to be certain [ to fail the exam]]
Procrastinate invited a question. Why is delaying an operation
A minimalist perspective favors an alternative whereby until LF more economical than performing it earlier? Further,
the driving force for movement can be satisied immediately. many of the hypothesized instances of covert movement do
Suppose that Inl has a feature that must be checked against not have the semantic efects (with respect to quantiier scope,
the NP. hen as soon as that head has been introduced into the anaphora, etc.) that corresponding overt movements have, as
structure, it attracts the NP or DP that will check its feature. discussed by Lasnik (1999b, Chapters 6 and 8). To address these
Movement is then seen from the point of view of the target, rather questions, Chomsky (2000; 2001) argues for a process of agree-
than the moving item itself. he case of the NP does get checked ment (potentially at a substantial distance) that relates the two
503
Minimalism
items that need to be checked against each other. Many of the representational theory. As the derivation proceeds, always
phenomena that had been analyzed as involving covert move- merging together pairs of items, sisterhood and domination are
ment are reanalyzed as involving no movement at all, just the the only immediately available primitives. And X (asymetrically)
operation Agree (though Huangs argument indicates that there c-commands Y if and only if Y is dominated by the sister of X.
are at least some instances of covert movement). Overt phrasal hese notions are illustrated in (13), where B and C are sisters as
movement (such as subject raising) is then seen in a diferent are D and E, A dominates B, C, D, and E, and C dominates D and
light: It is not driven by the need for case or agreement features E. B asymmetrically c-commands D and E.
to be checked (since that could take place via Agree). Instead, it
(13) A
takes place to satisfy the requirement of certain heads (includ-
ing Inl) that they have a speciier (in the X-bar theoretic sense).
B C
Such a requirement was already formulated by Chomsky (1981),
and dubbed the extended projection principle (EPP) in Chomsky
D E
(1982). To the extent that long distance A-movement (basically,
movement to a higher subject position) as in (8) proceeds suc- Multiple Spell-Out efectively deals with a range of reconstruc-
cessive-cyclically through each intermediate subject position, tion phenomena. For example, an anaphor normally requires an
the EPP is motivated, since, as observed earlier, these intermedi- antecedent that c-commands it:
ate positions are not case-checking positions.
An important question at this point is why language has the (14) John criticized himself
seeming imperfection of movement processes at all. We can (15) *Himself criticized John
distinguish two major types of movement, phrasal movement
and head movement. Chomsky conjectures that phrasal move- But when the anaphor is fronted from a position c-commanded
ment is largely to convey topic-comment information (and pos- by an antecedent to a position not in that structural relation, the
sibly to make scope relations more transparent), and that the anaphoric connection is nonetheless possible:
EPP is the way the computational system formally implements (16) Himself, John criticized
this. V-movement, on the other hand, is conjectured to have
PF motivation (guaranteeing that the Inl aix will ultimately his follows straightforwardly if anaphora can be interpreted
be attached to a proper host, V), and may even be a PF process. prior to movement.
Another possibility is that movement is simply a generalization Chomsky has also explored another kind of approach to
of the merge operation combining smaller structures into larger reconstruction, based on a condition that he calls Inclusiveness
ones. Given that merge is ineliminable, perhaps move is not an Chomsky (1995a). his condition demands that a syntactic deri-
imperfection after all. vation merely combine elements of the numeration. No new
entities can be created. Traces, as traditionally conceived, vio-
Syntactic Interfaces late this condition. Chomsky therefore concludes that a trace of
he connection between syntactic derivation and semantic and movement is actually a copy of the item that moved, rather than
phonological interfaces has long been a central research area. In a new sort of entity. his is yet another return to earlier genera-
minimalism, interpretation could be distributed over many struc- tive approaches (wherein movement was seen as a compound of
tures in the course of transformational cycles. Already decades copying and deletion). he copy left behind is normally deleted
ago, Joan W. Bresnan (1971) argued that the rule responsible for in the phonological component (though Boskovic 2001 pres-
assigning English sentences their intonation contour applies ents arguments that under certain circumstances, lower copies
following each cycle of transformations, rather than at the end are pronounced in order to rescue what would otherwise be
of the syntactic derivation. Ray Jackendof (1972) put forward PF violations) but could persist for semantic purposes, such as
similar proposals for semantic phenomena involving scope and the licensing of anaphoric connection. Danny Fox (2000) pres-
anaphora. Chomsky (2000, 2001) argues for a general instanti- ents an analysis of scope and anaphora reconstruction efects in
ation of this distributed approach, sometimes called Multiple terms of the copy theory.
Spell-Out, based on Epstein (1999) and Uriagereka (1999). An inluential research line, initiated by Richard S. Kayne
At the end of each cycle (or phase in Chomskys more recent (1994), extends the impact of c-command to PF as well. Kayne
work), the syntactic structure thus far created can be encapsu- hypothesizes that the linear order that is manifest in PF (as it must
lated and sent of to the interface components for phonological be, given properties of the phonetic system) comes about via his
and semantic interpretation. hus, even the levels of PF and LF linear correspondence axiom (LCA), which states that asym-
fade away. Samuel D. Epstein argues that such a move represents metric c-command is mapped onto PF linear order. his has the
a conceptual simpliication (in the same way that elimination of far-reaching consequence that structures must always be right-
D-structure and S-structure did), and both Juan Uriagereka and branching. Subject-verb-object (SVO) languages like English are
Chomsky provide empirical justiication. he role of syntactic broadly consistent with this requirement, but subject-object-verb
derivation, always important in Chomskian theorizing, becomes (SOV) languages like Japanese are not. Kaynes antisymmetry
even more central on this view. Epstein reasons that the central- approach reanalyzes SOV languages as underlyingly SVO (as all
ity of (asymmetric) c-command (as opposed to one of a whole languages must be by this hypothesis), with the SOV order derived
range of other conceivable geometric relations) in syntax is by (leftward) movement. One crucial unanswered question is the
predicted on this strongly derivational view, but not in a more source of the driving force for the required movements.
504
Minimalism Mirror Systems, Imitation, and Language
Conclusion Kayne, Richard S. 1994. he Antisymmetry of Syntax. Cambridge, MA: MIT

Chomsky constantly emphasizes that minimalism is as yet still Press.
Lasnik, Howard. 1995. Case and expletives revisited: On Greed and
just an approach, a set of questions and a conjecture about
other human failings. Linguistic Inquiry 26: 61533. Repr. in Lasnik
how human language works (perfectly), and a general program
1999b, 7496.
for exploring the questions and developing the conjecture. he
. 1999a. On feature strength: hree minimalist approaches to overt
descriptive and explanatory success attained thus far gives some movement. Linguistic Inquiry 30: 197217. Repr. Howard Lasnik,
reason for optimism that the approach can be developed into an Minimalist Investigations in Linguistic heory (London; Routledge,
articulated theory of human linguistic ability and of why it has 2003), 83102.
the exact properties it does. . 1999b. Minimalist Analysis. Oxford: Blackwell.
Lasnik, Howard, and Juan Uriagereka, with Cedric Boeckx. 2005. A Course
Howard Lasnik
in Minimalist Syntax: Foundations and Prospects. Oxford: Blackwell.
Ross, John Robert. 1967. Constraints on Variables in Syntax. Ph.D. diss.,
WORKS CITED AND SUGGESTIONS FOR FURTHER READING Massachusetts Institute of Technology. Published as Ininite Syntax!
(Norwood, N.J.: Ablex, 1986).
Boeckx, Cedric. 2006. Linguistic Minimalism: Origins, Methods, Concepts,
Uriagereka, Juan. 1998. Rhyme and Reason: An Introduction to Minimalist
and Aims. Oxford: Oxford University Press.
Syntax. Cambridge, MA: MIT Press.
Boskovic, Zeljko. 2001. On the Nature of the Syntax-Phonology
. 1999. Multiple spell-out. In Working Minimalism, ed. Samuel
Interface: Cliticization and Related Phenomena. Amsterdam: Elsevier
David Epstein and Norbert Hornstein, 25182. Cambridge, MA: MIT
Science.
Press.
Bresnan, Joan W. 1971. Sentence stress and syntactic transformations.
Language 47: 25781.
Chomsky, Noam. 1955. he logical structure of linguistic theory. MIRROR SYSTEMS, IMITATION, AND LANGUAGE
Manuscript, Harvard University and Massachusetts Institute of
Technology. Revised 1956 version published in part by Plenum, New Any normal child reared in human society will acquire lan-
York, 1975 and by University of Chicago Press, Chicago, 1985. guage. Some argue that basic structures of grammar are innate,
. 1981. Lectures on Government and Binding. Dordrecht, the so that the child need simply hear a few sentences to set the
Netherlands: Foris. parameter for each key principle of the grammar of his or her
. 1982. Some Concepts and Consequences of the heory of Government irst language (Baker 2001; Chomsky and Lasnik 1993). Others
and Binding. Cambridge, MA: MIT Press. have argued that the modern child receives rich language stimuli
. 1986. Knowledge of Language. New York: Praeger. within social interactions in learning these key principles. In any
. 1993. A minimalist program for linguistic theory. In The View
case, the child must acquire the particular sounds (phonol-
from Building 20: Essays in Linguistics in Honor of Sylvain Bromberger,
ogy) of the language, an ever-increasing stock of words, and
ed. Kenneth Hale and Samuel J. Keyser, 152. Cambridge, MA: MIT
constructions for arranging words to compound their meanings.
Press. Repr. in Chomsky 1995b, 167217.
. 1995a. Categories and transformations. In he Minimalist he infant acquiring maternal phonology does not imitate the
Program, 219394. Cambridge, MA: MIT Press. caregiver (Y. Yoshikawa and colleagues [2003] model how the
. 1995b. he Minimalist Program. Cambridge, MA: MIT Press. process may use associative learning), but learning how to put
. 2000. Minimalist Inquiries: the framework. In Step by Step: Essays sounds together to form a word that achieves the childs commu-
on Minimalist Syntax in Honor of Howard Lasnik, ed. Roger Martin, nicative goal seems to involve imitation. Imitation also lies at the
David Michaels, and Juan Uriagereka, 89155. Cambridge, MA: MIT heart of the acquisition of syntax and semantics (see syntax,
Press. acquisition of; semantics, acquisition of). Even within
. 2001. Derivation by phase. In Ken Hale: A Life in Language, ed. the principles and parameters approach, the child must imitate
Michael Kenstowicz, 152. Cambridge, MA: MIT Press.
words and combinations, as well as set parameters, to come to
. 2004. Beyond explanatory adequacy. In Structures and Beyond
speak the language (see principles and parameters the-
the Cartography of Syntactic Structure. Vol. 3. Ed. Adriana Belletti,
10431. Oxford: Oxford University Press.
ory and language acquisition).
. 2005. hree factors in language design. Linguistic Inquiry Monkeys have little or no capacity for imitation, and apes
36: 122. (chimpanzees, gorillas, bonobos, orangutans) have a capac-
Chomsky, Noam, and Howard Lasnik. 1993. he theory of principles and ity for simple imitation, whereas humans are the only primates
parameters. In Syntax: An International Handbook of Contemporary capable of complex imitation. We describe these forms of imita-
Research. Vol. 1. Ed. Joachim Jacobs, Arnim von Stechow, Wolfgang tion, then argue that increasing imitative skills, and the relation
Sternefeld, and heo Vennemann, 50669. Berlin: Walter de Gruyter. of mirror neurons to these imitative skills, were at the heart of the
Reprinted in Chomsky 1995b, 13127. evolution of the language-ready brain.
Collins, Chris. 1997. Local Economy. Cambridge, MA: MIT Press.
Epstein, Samuel D. 1999. Un-principled syntax: he derivation of syn- Simple and Complex Imitation
tactic relations. In Working Minimalism, ed. Samuel D. Epstein and M. Myowa-Yamakoshi and T. Matsuzawa (1999) observed that
Norbert Hornstein, 31745. Cambridge, MA: MIT Press.
chimpanzees took 12 or so trials to learn to imitate a behavior in
Fox, Danny. 2000. Economy and Semantic Interpretation. Cambridge,
a laboratory setting, focusing on bringing an object into relation-
MA: MIT Press.
Huang, C.-T. James. 1981/1982. Move wh in a language without wh-
ship with another object or the body, rather than the actual move-
movement. Linguistic Review 1: 369416. ments involved. R. W. Byrne and J. M. E. Byrne (1993) found that
Jackendof, Ray. 1972. Semantic Interpretation in Generative Grammar. gorillas learn complex feeding strategies but may take months to
Cambridge, MA: MIT Press. do so. Consider eating nettle leaves. Skilled gorillas grasp the stem
505
Mirror Systems, Imitation, and Language
Parietal Lobe
Monkey [Not to scale] Human
Wernickes
A rea Occipital
Frontal
Lobe Lobe
Brocas
A rea
Homology Temporal Lobe
Figure 1. A comparative side view of the monkey brain (left) and human brain (right), not to scale. The left view
emphasizes premotor area F5; the right view emphasizes Brocas area and Wernickes area, considered crucial for
language processing. F5 and Brocas area are considered homologous.
irmly, strip of leaves, remove petioles bimanually, fold leaves area, traditionally thought of as a speech area but which has been
over the thumb, pop the bundle into the mouth, and eat. Teaching shown by brain imaging studies (see neuroimaging) to be active
is virtually never observed in apes (Caro and Hauser 1992), and when humans both execute and observe grasps. It is posited that
the young seem to look at the food, not at the methods of acquisi- the mirror system for grasping was also present in the common
tion (Corp and Byrne 2002). Moreover, chimpanzee mothers sel- ancestor of humans and monkeys (perhaps 20 million years ago)
dom, if ever, correct and instruct their young (Tomasello 1999). and that of humans and chimpanzees (perhaps 5 million years
he challenge for acquiring such skills is compounded because ago). Moreover, the mirror neuron property accords well with the
the sequence of atomic actions varies greatly from trial to trial. parity requirement for language that what counts for the speaker
Byrne (2003) implicates imitation by behavior parsing, a pro- must count approximately the same for the hearer. In addition,
tracted form of statistical learning whereby certain subgoals (e.g., normal face-to-face speech involves manual and facial as well as
nettles folded over the thumb) become evident from repeated vocal gestures and, moreover, signed languages are fully devel-
observation as being common to most performances. Apparently, oped human languages (see sign language). hese indings
the young ape, over many months, may acquire the skill by com- ground the Mirror System Hypothesis (Arbib and Rizzolatti 1997;
ing to recognize the relevant subgoals and derive action strategies Rizzolatti and Arbib 1998):
for achieving subgoals by trial and error.
he parity requirement for language in humans is met because
he ability to learn the overall structure of a speciic feeding
Brocas area evolved atop the mirror system for grasping, which
behavior over many, many observations, however, is very diferent
provides the capacity to generate and recognize a set of actions.
from the human ability to understand any sentence of an open-
ended set as it is heard and to generate another novel sentence Recent work (see Arbib 2005 for a review and commentaries
as an appropriate reply. In many cases, humans need just a few on current controversies) has elaborated the hypothesis, dein-
trials to make sense of a relatively complex behavior and can then ing an evolutionary progression of seven stages, S1 through S7:
repeat it under changing circumstances, if the constituent actions
(S1) Cortical control of hand movements.
are familiar and the subgoals these actions must achieve are read-
ily discernible. (he next section places this facility for complex (S2) A mirror system for grasping, shared with the common
imitation in an evolutionary and neurological perspective.) It is ancestor of human and monkey.
interesting to note that even newborn infants can perform certain A mirror system does not provide imitation in itself. A monkey
acts of imitation, but this capacity for neonatal imitation such with an action in its repertoire may have mirror neurons active
as poking out the tongue on seeing an adult poke out a tongue both when executing and observing that action. he monkey
(Meltzof and Moore 1977) is quantitatively diferent from that does not repeat the observed action nor, crucially, does it use
for complex imitation (see communication, prelinguistic). observation of a novel action to add that action to its repertoire.
Nonetheless, the mirror system may serve the monkey well both
The Mirror System Hypothesis in providing feedback during close observation of handobject
he system of the macaque brain for visuomotor control of grasp- relations during dextrous actions and in allowing its recognition
ing has its premotor outpost in an area called F5 (Figure 1 left), of others actions to inluence social behavior. In any case, the
which contains a set of neurons, mirror neurons, such that each data on primate imitation support the hypothesis that a monkey-
one is active not only when the monkey executes a speciic grasp like mirror system becomes embedded in more powerful sys-
but also when the monkey observes a human or other monkey tems in the next two stages of evolution.
execute a more-or-less similar grasp (Rizzolatti et al. 1996). hus,
(S3) A simple imitation system for grasping, shared with
macaque F5 contains a mirror system for grasping that employs
common ancestor of human and apes.
a similar neural code for executed and observed manual actions.
he homologous region of the human brain is in or near Brocas (S4) A complex imitation system for grasping.
506
Mirror Systems, Imitation, and Language
Each of these changes can be of evolutionary advantage in categories. his is supported by the observation that nouns are often
supporting the transfer of novel skills among the members marked for case, number, gender (see gender marking), size,
of a community, though involving praxis rather than explicit shape, deiniteness, and possession, while verbs are often marked
communication. for tense, aspect, mood, modality, transitivity, and agree-
M. A. Arbib, K. Liebal, and S. Pika (2008) summarize data ment. On the latter view, once protolanguage was established, dif-
suggesting that manual gestures have greater openness than ferent peoples developed (and later shared) diferent strategies for
vocalizations in nonhuman primates. Monkey vocalizations are talking about things and actions and then developed these strat-
innately speciied (though occasions for using a call may change egies in diverse ways to talk about more and more of their world.
with experience), whereas a group of apes may communicate his view is based on the fact that there are further aspects of lan-
with novel gestures. M. Tomasello and colleagues (1997) argue guage diversity hard to reconcile with natural selection of brain
that novel gestures may develop through ontogenetic ritualiza- mechanisms. Some languages, like Vietnamese, lack all inlection,
tion, wherein repeated interaction between two individuals precluding the use of inlectional criteria for identifying grammati-
establishes a conventionalized form of an action as a signal for cal categories; other languages employ inlection in unusual ways.
the action for example, a beckoning movement may become For example, the language of the Makah of the northwestern coast
recognized as short for the physical action of pulling the other of the United States applies aspect and mood markers not only to
toward oneself. hese gestures may then be propagated by social words for actions that are translated into English as verbs but also
learning. his supports the hypothesis that it was gesture rather to words for things and properties.
than primate vocalizations that created the opening for Complex imitation has two parts: i) the ability to perceive that
greatly expanded gestural communication once complex imi- a novel action may be approximated by a composite of known
tation had evolved for practical manual skills. R. M. Seyfarth, actions associated with appropriate subgoals, and ii) the abil-
D. L. Cheney, and T. J. Bergman (2005) advance the opposing ity to employ this perception to perform an approximation of
view, but the Mirror System Hypothesis postulates that evolution the observed action, which may then be reined through prac-
proceeded via the next two stages: tice. Both parts come into play when the child is learning a lan-
guage; the former predominates in adult use of language as the
(S5) Protosign, a manual-based communication system
emphasis shifts from mastering novel words and constructions
breaking through the ixed repertoire of primate vocalizations
to inding the appropriate way to continue a dialogue.
to yield an open repertoire.
(S6) Protolanguage as Protosign and Protospeech: an expand- Michael A. Arbib
ing spiral of conventionalized manual, facial, and vocal com-
municative gestures. WORKS CITED AND SUGGESTIONS FOR FURTHER READING
he transition from complex imitation and the small reper- Arbib, M. A. 2005. From monkey-like action recognition to human lan-
toires of ape gestures (perhaps 10 or so novel gestures shared guage: An evolutionary framework for neurolinguistics (with com-
by a group) to protosign involves pantomime, irst of grasp- mentaries and authors response). Behavioral and Brain Sciences
ing and manual praxic actions, then of nonmanual actions 28: 10567.
(e.g., lapping the arms to mime the wings of a lying bird). Arbib, M. A., K. Liebal, and S. Pika. 2008. Primate vocalization, ges-
Pantomime transcends the slow accretion of manual gestures ture, and the evolution of human language. Current Anthropology
by ontogenetic ritualization, providing an open semantics 59.6: 105376.
for a large set of novel meanings (Stokoe 2001). However, such Arbib, M. A., and G. Rizzolatti. 1997. Neural expectations: A possible
evolutionary path from manual skills to language. Communication
pantomime is ineicient both in the time taken to produce it
and Cognition 29: 393424.
and in the likelihood of misunderstanding. Conventionalized
Baker, M. 2001. he Atoms of Language: he Minds Hidden Rules of
signs extend and exploit more eiciently the semantic rich- Grammar. New York: Basic Books.
ness opened up by pantomime. Processes like ontogenetic Byrne, R. W. 2003. Imitation as behavior parsing. Philosophical
ritualization can convert elaborate pantomimes into a con- Transactions of the Royal Society of London (B) 358: 52936.
ventionalized shorthand, just as they do for praxic actions. Byrne, R. W., and J . M. E. Byrne. 1993. Complex leaf-gathering skills of
his capability for protosign rather than elaborations intrin- mountain gorillas (Gorilla g. beringei): Variability and standardiza-
sic to the core vocalization systems may then have provided tion. American Journal of Primatology 31: 24161.
the essential scafolding for protospeech and evolution of the Caro, T. M., and M. D. Hauser. 1992. Is there teaching in nonhuman ani-
human language-ready brain. mals? Quarterly Review of Biology 67: 15174.
Chomsky, N., and H. Lasnik. 1993. he theory of principles and parame-
(S7) Language: the development of syntax and composi- ters. In Syntax: An International Handbook of Contemporary Research,
tional semantics. I: 50669. Berlin: de Gruyter.
Corp, N., and R. W. Byrne. 2002. Ontogeny of manual skill in wild chim-
he inal stage the transition from protolanguage to lan-
panzees: Evidence from feeding on the fruit of saba lorida. Behavior
guage may have rested primarily on biological evolution (Pinker
139: 13768.
and Bloom 1990), but may instead result from cultural evolution Kemmerer, D. 2005. Against innate grammatical categories. Behavioral
(historical change) alone (Arbib 2005; Kemmerer 2005). On the and Brain Sciences 28. Available online at: http://www.bbsonline.org/
former view, the brain might have innate biological mechanisms Preprints/Arbib-0501 2002/Supplemental/.
for processing nouns and verbs, as well as principles and param- Meltzof, A. N., and M. K. Moore. 1977. Imitation of facial and manual
eters for combining them with words and morphemes of other gestures by human neonates. Science 198: 758.
507
Modality
Myowa-Yamakoshi, M., and T. Matsuzawa. 1999. Factors inluencing he modern approach to the semantics of modal logic, devel-
imitation of manipulatory actions in chimpanzees (Pan troglodytes). oped by Saul Kripke and others, is a form of truth condi-
Journal of Comparative Psychology 113: 12836. tional semantics based on possible worlds. For example,
Pinker, S., and P. Bloom. 1990. Natural language and natural selection.
a sentence of the form S is true, at a given world w, if S is
Behavioral and Brain Sciences 13: 70784.
true at every accessible world v. Diferent meanings of and
Rizzolatti, G., and M. A. Arbib. 1998. Language within our grasp. Trends
are represented by establishing diferent sets of worlds as acces-
in Neuroscience 21.5: 18894.
Rizzolatti, G., L. Fadiga, V. Gallese, and L. Fogassi. 1996. Premotor cor- sible. For example, suppose we use to represent it is morally
tex and the recognition of motor actions. Cognitive Brain Research required that; then, at any world w, the accessible worlds are
3: 13141. those that are ideal, from the point of view of morality in w:
Seyfarth, R. M., D. L. Cheney, and T. J. Bergman. 2005. Primate social
(1) (the rich give money to the poor) is true at w if the rich give
cognition and the origins of language. Trends in Cognitive Sciences
money to the poor in every accessible world v.
9.6: 2646.
Stokoe, W. C. 2001. Language in Hand: Why Sign Came Before Speech. A world v is accessible if and only if it is a perfect world, from
Washington, DC: Gallaudet University Press. the point of view of the moral principles holding in w.
Tomasello, M. 1999. he human adaptation for culture. Annual Review
he semantics of is given by replacing every with some in (1).
of Anthropology 28: 50929.
Tomasello, M., J. Call, J. Warren, T. Frost, M. Carpenter, and K. Nagell.
LINGUISTIC THEORIES BASED ON POSSIBLE WORLDS. Most lin-
1997. he ontogeny of chimpanzee gestural signals. In Evolution
of Communication, ed. S. Wilcox, B. King, and L. Steels, 22459. guistic theories of modal semantics are based on possible worlds.
Amsterdam and Philadelphia: John Benjamins. For example, Angelika Kratzer (1981) reines the approach by
Yoshikawa, Y., M. Asada, K. Hosoda, and J. Koga. 2003. A constructivist deining the set of accessible worlds in terms of two conversa-
approach to infants vowel acquisition through mother-infant interactional backgrounds. According to Kratzer, the conversational
tion. Connection Science 15: 24558. backgrounds for (2) are i) relevant facts and ii) moral principles.
Simplifying somewhat:
MODALITY (2) he rich must give money to the poor is true at w if the rich
give money to the poor in every world v which is i) consis-
Deinition
tent with the relevant facts in w and ii) as good as possible
In its broadest sense, this term encompasses all means by which
from the point of view of relevant moral principles in w.
we can talk about hypothetical situations. he conception of
modality includes the following, plus more:
NON-TRUTH CONDITIONAL THEORIES. Many philosophers and
(1) Expressions of necessity and possibility, in any sense linguists have argued that epistemic modals (see the follow-
of these terms (English examples: necessary, possible, must, ing) lack truth conditions. Instead, they are said to indicate the
may). speakers level of commitment to what he or she is saying (e.g.,
(2) Expressions of knowledge, belief, desire, and so on (know, Palmer 2001). Dynamic modal logic (Groenendijk, Stockhof, and
believe, want, must). Veltman 1996) combines ideas from possible worlds semantics
with a non-truth conditional analysis of epistemic modality. he
(3) Expressions used to indicate how strongly the speaker is
fundamental semantic concept of dynamic logic is update poten-
committed to what he or she is saying (perhaps, might).
tial, the capacity of a sentence to afect an information state (for
(4) Expressions used to say that some action is obligatory or example, someones knowledge state or the information shared
permissible (have [to], must, may, allowed, permit). in a conversation). Although the update potential of some sen-
(5) Conditional sentences (If then ). tences can be deined in terms of truth conditions, that of an epi-
stemic sentence cannot be.
here is a range of narrower senses of the term, each used to
describe a grammatical category or set of related categories. For
FUNCTIONAL THEORIES. functional linguistics has
example, English can express necessity, possibility, obligation,
made important contributions to our understanding of the
permissibility, and ability (and other concepts) by means of a
history (e.g., Traugott and Dasher 2002) and typology (e.g.,
grammatically special set of auxiliary verbs (must, may, should,
Bybee, Perkins, and Pagliuca 1994) of modality. cognitive
can, etc.). So, when studying English, it is reasonable to deine
linguistics ofers a theory of modality based on metaphor
modality as the range of meanings expressed by these verbs. But
(e.g., Talmy 1988).
other languages do not have this grammatical category, and so
its also reasonable to deine modality diferently when studying
Varieties of Modality
these languages.
SENTENTIAL MODALITY. Most linguists take as the central cases of
modality examples in which some expression combines with a
Semantic Theories
nonmodal sentence, making it modal. For example, English must
MODAL LOGIC. Much research on modality in natural language
can be analyzed as: must + (the rich give money to the poor). here
has been inspired by modal logic (see Blackburn, de Rejke, and
are many distinct subtypes of sentential modality, including:
Venema 2001 for a brief history). Modal logic typically has two
modal operators, (necessarily, must, or obligatory) and (1) Deontic modality: having to do with rules, including
(possibly, may, permissible), which attach to sentences. morality and law (example: Criminals must be punished).
508
Modality Modern World-System, Language and the
(2) Epistemic modality: having to do with knowledge (It MODERN WORLD-SYSTEM, LANGUAGE AND THE
must be raining).
Ever since the modern world-system came into existence in the
(3) Subjective modality: having to do with the speakers long sixteenth century, language has been a primary political
point of view (overlapping with 1 and 2). concern and a locus of major political struggle. In particular,
the issue of the language or languages that one will require to be
(4) Dynamic modality: having to do with ability or the laws of
learned and used has been a subject of decisions by states in
the natural world (Ducks can swim).
their constitutions, their legislation, and/or their executive poli-
Much work in syntax has studied the representation of sen- cies (see language policy).
tential modality. Two important issues are the extent to which In the modern world-system, all included territory falls within
modality is represented in the same ways across languages and the jurisdiction of individual states. All states are linguistically
whether diferent subtypes are realized in diferent grammatical heterogeneous in the languages used within households, though
positions (e.g., Cinque 1999). some much more so than others. Seeking to be a strong state,
most states have proclaimed an oicial language, meaning that
SUBSENTENTIAL MODALITY. Broad deinitions of modality will laws are written, governmental processes conducted, and edu-
include verbs and adjectives such as know and likely. hey will cation ofered in the oicial language. Sometimes, but rarely,
also include mood, the category of expressions that relect the it means that no other language may be used in public locales,
presence of modal meaning in the sentence but that do not intro- including in signage.
duce modal meaning themselves (for example, indicative and Some states have had more than one oicial language, and
subjunctive verb forms; e.g., Farkas 1985). some distinguish between the oicial language and one or more
national languages (which have some more restricted legal
DISCOURSE MODALITY. Some varieties of modality operate at the
rights). As an overall rule, almost every state has been pressed to
discourse level. evidentials are forms indicating the speakers
adopt a single oicial language. he usual argument, aside from
source or quality of information (e.g., Willett 1988). he concept
the convenience, is that a single language favors national inte-
of illocutionary force is connected to modality as well
gration, part of a process of turning a state into a nation-state.
(e.g., imperative sentences direct the addressee to perform a
Integration is particularly an issue when there are large immi-
hypothetical action). Discourse modality overlaps with senten-
grant groups who speak a diferent language.
tial and subsentential modality. For example, in some languages
In many states, speakers of so-called minority languages, in
the subjunctive mood can operate as an imperative.
the name of cultural rights, resist eforts to impose a single oi-
Paul Portner cial language. In particular, they demand the right to use other
languages in governmental business and schools. Whether
WORKS CITED AND SUGGESTIONS FOR FURTHER READING states yield to such demands is largely a question of the inter-
Blackburn, Patrick, Maarten de Rejke, and Yde Venema. 2001. Modal nal balance of power and demographic strength of the dominant
Logic. Cambridge: Cambridge University Press. linguistic group, as well as the degree of support a minority lan-
Bybee, Joan, Revere Perkins, and William Pagliuca. 1994. he Evolution of guage may have from powerful neighboring states in which this
Grammar: Tense, Aspect, and Modality in the Languages of the World. states minority language is the neighbors majority language. In
Chicago: University of Chicago Press. multilingual states, there is often social resistance by the users of
Cinque, Guglielmo. 1999. Adverbs and Functional Heads: A Cross- the language second in strength to learning well and using the
Linguistic Perspective. Oxford: Oxford University Press.
primary language.
Farkas, Donka. 1985. Intensional Descriptions and the Romance
he problem is compounded beyond the boundaries of a sin-
Subjunctive Mood. New York: Garland.
Garson, James. 2007. Modal logic. In The Stanford Encyclopedia
gle state. Strong regional powers favor the learning of their lan-
of Philosophy (summer ed.), ed. Edward N. Zalta. Available guage by states that they consider to fall within their orbit. hey
online at: http://plato.stanford.edu/archives/sum2007/entries/ use direct political pressure, the beneits of economic ties, or
logic-modal/. cultural liaison. Adoption of particular alphabetic or ideographic
Groenendijk, Jeroen, Martin Stockhof, and Frank Veltman. 1996. systems also favors the dominant regional power. If a small state
Coreference and modality. In he Handbook of Contemporary breaks politically with a regional power and allies itself with
Semantic heory, ed. S. Lappin, 179213. Oxford: Blackwell. another world power, it often seeks to demonstrate and cement
Kratzer, Angelika. 1981. he notional category of modality. In Words, the new ties by adopting a new secondary language or (if rele-
Worlds, and Contexts, ed. H.-J. Eikmeyer and H. Rieser, 3874.
vant) changing the alphabetic system.
Berlin: de Gruyter.
It is at the world or continental level that the issue becomes
Palmer, F. 2001. Mood and Modality. Cambridge: Cambridge University
most contentious. here are practical beneits in using as few lan-
Press.
Portner, P. 2009. Modality. Oxford: Oxford University Press. guages as possible inancial costs, ease of communication, and
Talmy, Leonard. 1988. Force dynamics in language and cognition. savings in the time and efort required for either translation or
Cognitive Science 12: 49100. interpretation. However, the political implications of eliminating
Traugott, Elizabeth, and Richard Dasher. 2002. Regularity in Semantic a particular language as a legitimate option in interstate commu-
Change. New York: Cambridge University Press. nication are very large. he United Nations now has six oicial
Willett, homas. 1988. A cross-linguistic survey of the grammaticaliza- languages. Two are working languages English and French.
tion of evidentiality. Studies in Language 12.1: 5197. he inclusion of French has been the result of continuing and
509
Modularity
very strong political pressure from France. he European Union the central system responsible for problem solving and abstract
has decided that any oicial language of a member state may be thought. Modularity is in most striking contrast with theories
used. Since this number is very large, and the costs of translating, such as connectionism that treat cognition as the emergent
for example, Maltese into Finnish are enormous, the result has product of an unstructured neural network.
been a creeping usage of English as the de facto but not de jure heories of modularity come in a variety of lavors with some-
oicial language. times incompatible properties, making evaluation of the general
In the history of the modern world-system, as Latin fell out hypothesis diicult. I begin with the best-known example Jerry
of diplomatic usage, French took its place. Since 1945, given the Fodors (1983) Modularity of Mind and then contrast it with
hegemony of the United States in the world-system, English has alternative views.
displaced French. he story in international scientiic discourse For Fodor, modules, or input systems, convert sensory
is diferent. In the nineteenth century, German was the favored inputs into representations on which the central system of
lingua franca. After 1918 and especially after 1945, because of the mind can operate. Incoming stimuli of a visual, auditory, tac-
defeats on the battleield, it lost this status. Before 1939, at an tual, or other kind are converted into a form that, in conjunction
international scholarly congress, participants felt free to deliver with knowledge drawn from memory, is adequate for problem
their papers in English, French, German, and usually Italian as solving or the ixation of belief. Such beliefs are typically nei-
well. here was normally no translation, and it was assumed that ther complex nor profound: Hearing a whining noise and seeing
scholars could understand at least three of the four languages. a wagging tail may activate enough encyclopedic knowledge to
After 1945, international scholarly organizations dropped make you ix the belief that the dog wants to go out.
German and Italian entirely. In the 50 years since then, the use of Fodor argued that input systems (corresponding to the
French has declined but is still permitted, and Spanish has joined senses) all share a number of properties. Each has a speciic
French as a permitted but seldom-used language. he inclusion domain of operation (vision, audition, and so on); they act fast
of Spanish is the direct result of the fact that there are 19 states in and mandatorily (you have no choice but to see a dog as a dog);
which it is an oicial language. they are subserved by dedicated neural architecture and, hence,
In commerce, there have always been lingua francas. Anyone are subject to idiosyncratic pathological breakdown (you can be
going to a local market in a major center of a country in the global blind without being deaf, and vice versa); they are innately deter-
South will see merchants capable of conducting their business in mined (hence, universal and uniform across the species); and,
multiple relevant languages. If one looks at discussions among most importantly, they are informationally encapsulated (that
personnel of large corporations, there has been an increasing is, the operation of the input systems proceeds independently
tendency to use English. Nonetheless, it is probably still the of information stored in memory). You may know that railway
case that the ability to use a widely spoken (oicial) language lines dont really converge in the distance but your visual system
other than English is an advantage to persons doing business in still makes them look as if they do. Fodor then suggested that any
countries outside the English linguistic zone. In commerce, the system that shared the properties of the sensory input systems
decision on linguistic use is less a matter of coercion than of opti- was by deinition a module, with the result that language was
mizing the ability to engage in proitable business. included as a module just like vision.
Finally, we should notice the consequences for the geocul- his claim highlights a radical distinction between Fodors
ture of the world-system of the existence of dominant languages. version of modularity and Noam Chomskys earlier one (1975)
he widespread use of English in the twenty-irst century is very that treats modules as knowledge structures, rather than as pro-
advantageous for native English speakers. It is not merely conve- cessing systems. he language faculty is a system of knowledge
nient but tends to turn English linguistic eyes into world linguis- that can be accessed by both input and output systems: We pro-
tic eyes. It also, however, has its negative side for native English duce as well as understand language. A further diference is that
speakers. hey are often the only ones cut of from the internal Fodor is pessimistic about the possibility of saying anything inter-
communications of other linguistic zones, as well as from the esting about the structure of the inscrutable central system,
possibilities of seeing the world through other linguistic eyes. whereas Chomsky is more optimistic, suggesting that the central
It is quite possible that the increasing role of the Internet system too is modular, with moral judgment, music, and other
in communications of all kinds, along with the declining faculties all having speciic (if not localized) areas of the mind
power of the United States in the world-system, will lead to the dedicated to them. On a point of terminology, it is also impor-
reemergence of a multipolar linguistic situation, with ive to tant to note that Chomsky (and linguists more generally) use the
seven world languages that diplomats, scholars, and business term module for the various subparts of the language faculty (the
executives will feel the need to master and use. lexicon and the computational system with components such as
binding, control, movement, etc.).
Immanuel Wallerstein
It is clear that even if they share some of the Fodorian prop-
erties such as innate speciication and domain speciicity, moral
judgment and the sense of smell are radically diferent. his
MODULARITY
has led to the suggestion that we need a distinction between
Modularity is the claim that human cognition is compartmental- (Fodorian) modules and quasi-modules, or modules of the cen-
ized into a number of discrete components or modules poten- tral system (Smith and Tsimpli 1995), where these are deined
tially including vision, audition, moral judgement, theory of in terms of the properties (such as informational encapsula-
mind, and language. hese specialized modules contrast with tion) that they possess and the kind of vocabulary, perceptual or
510
Modularity Montague Grammar
conceptual, over which they are deined. An extreme version of . 2000. he Mind Doesnt Work hat Way: he Scope and Limits of
this position is the claim (cf. Sperber 2002) that the mind is mas- Computational Psychology. Cambridge MA: MIT Press.
sively modular, with everything from individual concepts like Karmilof-Smith, A. 1992. Beyond Modularity. Cambridge, MA: MIT
dog, to Fodorian modules like vision, to our general pragmatic Press.
Plomin, R., and P. Dale 2000. Genetics and early language develop-
ability to interpret utterances being modules. It is unclear what
ment: A UK study of twins. In Speech and Language Impairments
the identity criteria for a module are in such theories, and Fodor
in Children: Causes, Characteristics, Intervention and Outcome, ed.
himself is vehemently opposed to the claim (cf. Fodor 2000). D. Bishop and L. Leonard, 3551. Philadelphia: Psychology Press.
A rival view (e.g., Karmilof-Smith 1992) accepts that the Smith, N. 2003. Dissociation and modularity: Relections on language
minds structure is modular but denies that it is innately deter- and mind. In Mind, Brain and Language, ed. M. Banich and M. Mack,
mined, suggesting instead that the (adult) modular structure 87111. Mahwah, NJ: Lawrence Erlbaum. his article treats in greater
arises as a result of a process of modularization on the basis depth many of the issues discussed here.
of interaction with the environment during development. Smith, N., and I.-M. Tsimpli. 1995. he Mind of a Savant: Language-
Connectionists (e.g., Elman et al. 1996) are more radical and Learning and Modularity. Oxford, Blackwell.
deny the validity of modularity and its use of rules and represen- Sperber, D. 2002. In defense of massive modularity. In Language, Brain
tations entirely, relying instead on the ability of neural networks and Cognitive Development: Essays in Honor of Jacques Mehler, ed.
E. Dupoux, 4757. Cambridge, MA: MIT Press.
to simulate the properties of rule-based systems.
he major evidence for modularity in all its guises is (dou-
ble) dissociation. Although it is typically the case that abilities MONTAGUE GRAMMAR
and disabilities cut across domains (if youre good at one sub-
ject youre likely to be good at others, hence the possibility of Montague grammar is a theory of semantics and the syntax-
assigning people an intelligence quotient), the existence of dis- semantics interface developed by the logician Richard Montague
sociations demonstrates the intrinsic separability and autonomy (193071) and subsequently modiied and extended by linguists,
of the various components of the mind. For instance, intelligence philosophers, and logicians. Classical Montague grammar
and language may doubly dissociate. It is possible to combine had its roots in logic and the philosophy of language; it quickly
high intelligence and good language (you), low intelligence and became inluential in linguistics, and linguists played a large
good language (linguistic savants like Christopher [Smith and role in its evolution into contemporary formal semantics.
Tsimpli 1995]), high intelligence and poor (or nonexistent) lan- he most constant features of the theory over time have been
guage, as in some kinds of aphasia, and low intelligence and poor the focus on truth conditional aspects of meaning (see truth
language (as in typical Down syndrome subjects). conditional semantics), a model-theoretic conception of
Evidence for some version of innate modularity versus modu- semantics, and the methodological centrality of the principle of
larization due to interaction with the environment comes from compositionality.
the developmental trajectory of normal children. R. Plomin and
P. Dale (2000) demonstrate that when tested over time, children History
typically start with diferent abilities in the verbal and nonver- Montague was a student of Alfred Tarski (190283), a pioneer in
bal domains and then gradually converge so that their abilities the model-theoretic semantics of logic. Montague developed an
are similar across domains. his is exactly the opposite of what intensional logic with a rich type theory and a model-theoretic
one would expect on a modularization story. Similarly, connec- possible worlds semantics, incorporating certain aspects
tionist claims that modularity is unnecessary are undermined of (formal) pragmatics, including the treatment of indexical
by the implications of connectionisms uniform reliance on sta- words and morphemes like I, you and the present tense. In the
tistics. he mind exploits statistical regularities in the input dif- late 1960s, Montague turned to the project of universal grammar,
ferently in diferent domains, and Neil Smith and I.-M. Tsimpli which for him meant a theory of syntax and semantics encom-
(1995) demonstrate that connectionist models are undesirably passing both formal and natural languages.
powerful in that they can infer statistical regularities that normal Montagues idea that a natural language could be formally
humans cannot. described using logicians techniques was radical. Most logi-
Modularity, of some kind, is still the most successful theory cians considered natural languages too unruly for precise
of cognition there is. It has rivals and it has problems, but it is formalization, while most linguists either had no awareness
indispensable. of model-theoretic techniques in logic or doubted the appli-
Neil Smith cability of logicians methods to natural languages (Chomsky
1955).
WORKS CITED AND SUGGESTIONS FOR FURTHER READING At the time of Montagues work, generative grammar was
established, linguists were developing approaches to semantics,
Chomsky, N. 1975. Relections on Language. New York: Pantheon. A sem-
and the relation of semantics to syntax had become central. he
inal source for modularity.
linguistic wars between generative semantics and inter-
Elman, J., E. Bates, M. Johnson, A. Karmilof-Smith, D. Parisi, and
pretive semantics were in full swing (Harris 1993). In introdu-
K. Plunkett. 1996. Rethinking Innateness: A Connectionist Perspective
on Development. Cambridge, MA: MIT Press. A sustained alternative cing Montagues work to linguists, Barbara Partee (1973, 1975)
to modularity. and Richmond homason (1974) argued that Montagues work
Fodor, J. 1983. he Modularity of Mind. Cambridge, MA: MIT Press. he ofered some of the best aspects of both warring approaches,
classic and best-known statement of the modularity thesis. with added advantages of its own.
511
Montague Grammar Mood
The Theory and Substance of Montague Grammar a formal system; Montagues thesis was that English can be
It was the short but densely packed PTQ (he proper treatment described as an interpreted formal system.
of quantiication in ordinary English, Montague 1973) that had
Barbara H. Partee
the most impact on linguists and on the development of formal
semantics. Montague grammar has often meant PTQ and its
extensions by linguists and philosophers in the 1970s and 1980s.
But it is the broader algebraic framework of UG (Universal Bach, Emmon. 1989. Informal Lectures on Formal Semantics. New
York: State University of New York Press.
Grammar, Montague 1970) that constitutes Montagues theory
Chomsky, Noam. 1955. Logical syntax and semantics: heir linguistic
of grammar. Crucial features of that theory include the truth con-
relevance. Language 31: 3645.
ditional foundations of semantics, the algebraic interpretation of
Dowty, David, Robert E. Wall, and Stanley Peters, Jr. 1981. Introduction to
the principle of compositionality, and the power of a higher-or- Montague Semantics. Dordrecht, the Netherlands: Reidel. he classic
der typed intensional logic. textbook on Montague grammar.
Before Montague, semanticists focused on the explication of Gamut, L. T. F. 1991. Logic, Language, and Meaning. Vol. 2. Intensional
ambiguity, anomaly, and semantic relatedness; data were often Logic and Logical Grammar. Chicago: University of Chicago Press. A
subjective and controversial. he introduction of truth condi- good, rigorous introduction to Montague grammar and its logic.
tions and entailment relations as core data profoundly afected Harris, Randy Allen. 1993. he Linguistics Wars. Oxford: Oxford University
the adequacy criteria for semantics and led to a great expansion Press.
of semantic research. While some cognitively oriented linguists Lewis, David. 1970. General semantics. Synthese 22: 1867.
Montague, Richard. 1970. Universal Grammar. heoria 36: 37398.
reject the relevance of truth conditions and entailment relations
Repr. in Montague 1974, 22246.
to natural language semantics, many today seek a resolution of
. 1973. he proper treatment of quantiication in ordinary English.
meaning externalism and internalism by studying mind-
In Approaches to Natural Language, ed. K. J. J. Hintikka et al., 22142.
internal intuitions of mind-external relations, such as refer- Dordrecht, the Netherlands: Reidel. Repr. in Montague 1974, 24770.
ence and truth conditions. . 1974. Formal Philosophy: Selected Papers of Richard Montague.
In UG, Montague formalized the Fregean principle of com- Ed. R. homason. New Haven, CT: Yale University Press.
positionality as the requirement of a homomorphism between a Partee, Barbara. 1973. Some transformational extensions of Montague
syntactic algebra and a semantic algebra. he nature of the ele- grammar. Journal of Philosophical Logic 2: 50934.
ments of both the syntactic and the semantic algebras is open to . 1975. Montague grammar and transformational grammar.
variation; what is constrained by compositionality is the relation Linguistic Inquiry 6: 203300.
of the semantics to the syntax, making compositionality as rel- Partee, Barbara H., with Herman L. W. Hendriks. 1997. Montague gram-
mar. In Handbook of Logic and Language, ed. J. van Benthem and A.
evant to representational and conceptual theories of meaning as
ter Meulen, 591. Amsterdam and Cambridge, MA: Elsevier and MIT
it is to model-theoretic semantics.
Press. A fuller history and explication of Montague grammar and its
he richness of Montagues logic made possible a composi-
impact.
tional semantic interpretation of independently motivated syn- homason, Richmond. 1974. Introduction. In Formal
tactic structure (see autonomy of syntax), which was key in Philosophy: Selected Papers of Richard Montague, ed. R. homason,
overcoming the problems that underlay the linguistic wars. his 169. New Haven, CT: Yale University Press.
was well illustrated in PTQ, where a typed higher-order logic with
lambda-abstraction made it possible to interpret noun phrases
(NPs) like every man, the man, a man uniformly as semantic
MOOD
constituents (generalized quantiiers), an idea simultaneously Mood forms part of the nonspatial setting of an event, alongside
advocated by David Lewis (1970). PTQ also contained innovative modality, reality status, tense, aspect, and evidentiality.
treatments of quantification and binding, intensional tran- Mood refers to a type of speech-act, with three basic choices.
sitive verbs, phrasal conjunction, adverbial modiication, and Many languages have a special verb form marking commands,
more. Montagues type theory introduced to linguists Freges which is known as imperative mood. In Latin, the second per-
strategy of taking function-argument application as the basic son imperative dic means (you) say! and is diferent from the
semantic glue for combining meanings, giving renewed sig- statement dicis, you say. Declarative mood (sometimes called
niicance to categorial grammar. indicative) is used in statements. Many more categories tend to
Montagues logic was an intensional logic, developing be expressed in declarative clauses than in either interrogative or
Gottlob Freges distinction between sense and reference imperative. Interrogative mood occurs in questions as in West
and Rudolf Carnaps distinction between intension and Greenlandic where every question is marked with a special suix
extension, using possible world semantics to treat the phe- on verbs (Fortescue 1984, 49, 28798).
nomenon of referential opacity, pervasive in propositional In traditional uses, the notion of mood applied to sets of
attitude sentences and many other constructions (see inlectional verb forms. he Western classical tradition, based on
intentionality ). Greek and Latin, identiied three moods: indicative, subjunctive,
Details of Montagues analyses have been superseded, and imperative, which only partially correspond to the afore-
but in overall impact, PTQ was as profound for semantics as mentioned three speech-acts. Further meanings associated with
Noam Chomskys Syntactic Structures was for syntax. Emmon mood involve optative and dubitative (see Lyons 1977, 725848;
Bach (1989, 8) summed up their cumulative innovations Sadock and Zwicky 1985). Some scholars consider conditional
thus: Chomskys thesis was that English can be described as modality which marks a clause in a conditional sentence and
512
Mood Morpheme
subjunctive modality typically, a form expressing desire or and in Caddo, from North America, realis marks statements and
uncertainty on a par with moods. his is problematic since commands, while irrealis expresses future, possibility, and con-
the distinction between moods as speech-acts and clause types dition. his shows that the realisirrealis distinction is language
(which include division between main and subordinate clauses, speciic and that it is distinct from mood (see Mithun 1999,
where conditional forms would be used) is blurred. he intro- 17880). Mood, modality, and reality status are distinct from evi-
duction of interrogative mood into the system is largely due to dentiality (q.v.) whose primary meaning is information source.
the existence of languages that have an overtly marked verbal Mood is often an obligatory inlectional category of the verb,
paradigm used for the interrogative speech-act, as in a number marked with aixes (suixes or preixes, rarely inixes); it is never
of languages of Amazonia. Further formal distinctions between expressed derivationally. In languages of an isolating proile,
moods as clause types involve prosody and constituent order. mood can be expressed through particles. Modalities are not
Both imperative and interrogative are characterized by a typi- obligatory and, thus, do not constitute part of an inlectional par-
cal intonation contour. Imperatives often have fewer catego- adigm. Modal verbs express modalities rather than moods (this
ries than corresponding declaratives. he English imperative is is the case in English and many other familiar Indo-European
perhaps the simplest form in the language: It consists of the base languages).
form of the verb without any tense inlection, whose subject Forms of mood marking can develop additional meanings
typically, the addressee can be and often is omitted. In con- overlapping with modalities. Imperative forms can be used to
trast, many languages of North and South America and Siberia express optative and conditional, while indicative forms may
distinguish delayed versus immediate imperatives and proximal develop overtones of certainty (associated with epistemic modal-
versus distal imperatives. he universal property of imperatives ity). Indicative forms for instance, future can be used as com-
is having the second person as subject, of a transitive or intran- mand strategies, with diferences in illocutionary force.
sitive verb (Dixon 1994, 13142). A prototypical imperative is
Alexandra Y. Aikhenvald
agentive, and this is why in numerous languages imperative can-
not be formed on passive and stative verbs. Other moods do not
have such restrictions. Imperatives directed at the irst person
(e.g., Lets go!), also known as hortatives, are often expressed dif- Aikhenvald, Alexandra Y. 2004. Evidentiality. Oxford: Oxford University
ferently from second person imperatives. Imperatives directed at Press.
the third person (e.g., Long live the king!), also known as jussives, Chung, Sandra, and Alan Timberlake. 1985. Tense, aspect and mood.
In Language Typology and Syntactic Description. Vol. 3: Grammatical
may share similarities with irst person imperatives, or have
Categories and the Lexicon. Ed. Timothy Shopen, 20258.
properties diferent from all other imperative forms. Further,
minor moods include exclamative (as in hats so tacky!) and Dixon, R. M. W. 1994. Ergativity. Cambridge: Cambridge University
expressive types, such as imprecatives (or curses, often cast as Press.
commands but without a command meaning). Elliott, Jennifer R. 2000. Realis and irrealis: Forms and concepts of the
Mood interacts with modality, understood as a means used grammaticalisation of reality. Linguistic Typology 4: 5590.
by the speaker to express his or her opinion or attitude towards Fortescue, Michael. 1984. West-Greenlandic. Beckenham, UK: Croom
the proposition that the sentence expresses or the situation Helm.
that the proposition describes (Lyons 1977, 452). Expressions Jespersen, Otto. 1924. he Philosophy of Grammar. London: George Allen
of probability, possibility, and belief are epistemic modalities, and Unwin.
and expressions of obligation are deontic modalities. In English, Lyons, John. 1977. Semantics. Vol. 2. Cambridge: Cambridge University
Press.
these meanings are conveyed by modal verbs, e.g., he might
Mithun, Marianne. 1999. he Languages of Native North America.
come or he must have come (epistemic), he must come (deontic)
(see Palmer 1986, 51125; Jespersen 1924, 3201). Further modal Palmer, F. R. 1986. Mood and Modality. Cambridge: Cambridge University
distinctions include desiderative (unachievable desire), optative Press.
(achievable desire), conditional, hypothetical, potential, pur- Sadock, Jerrold M., and Arnold M. Zwicky. 1985. Speech act distinctions in
posive, and apprehensive (lest). Languages with a rich verbal syntax. In Language Typology and Syntactic Description. Vol. 1: Clause
morphology may have special marking for each distinction. Structure. Ed. Timothy Shopen, 15596. Cambridge: Cambridge
An alternative (rare) cover term for both mood and modality is University Press.
mode (Chung and Timberlake 1985).
Some languages have an aix with a general meaning of
MORPHEME
irrealis covering possibility, future, negative statements, and
commands. hese languages have the category of reality status, his term has been used in two ways: In Leonard Bloomields
the grammaticalized expression of an events location either in sense, a morpheme is a minimal meaningful form; in Zellig
the real world or in some hypothetical world (see Elliott 2000, for Harriss and Charles F. Hocketts later usage, a morpheme is an
its cross-linguistic validity). In Maung, an Australian language, abstract unit of analysis realized by a morph (= minimal mean-
statements in the present, past, and future are marked with rea- ingful form) or by a set of synonymous morphs in complemen-
lis suixes. Potential meanings I can do X are expressed with tary distribution. In Bloomields sense, the plural suixes -s
irrealis, as are commands. In Manam, an Oceanic language, irre- and -en are distinct morphemes; in the latter sense, they are
alis covers future, probable, and counterfactual statements, pos- distinct morphs realizing the same morpheme. he term is not
itive commands, and habitual actions. But in Yuman languages always used consistently in morpheme-based approaches to
513
Morphological Change
morphology. In paradigm-based approaches, no linguistic once operated between words can give rise to alternations with
principle is assumed to make essential reference to morphemes morphological value; thus, the consonant mutations of Irish
as a uniied class of elements. Gaelic, such as b/v/m in ba:d, va:d, ma:d her, his, their
boat, respectively, result ultimately from the diferential efects
Gregory Stump
of former word-inal consonants of possessors *as, *a, *an on
word-initial b. he efects of sound changes, such as the erosion
of inal syllables whereby English singular and plural forms
Bloomield, Leonard. 1933. Language. New York: Henry Holt and Co. oxe, oxene became ox, oxen, can lead to the reanalysis of internal
Harris, Zellig S. 1942. Morpheme alternants in linguistic analysis. word structure so that -en is interpreted as a (new) plural suf-
Language 18: 16980. ix. Loss of inal t in the pronunciation of French argent silver
Hockett, Charles F. 1947. Problems of morphemic analysis. Language
caused argent-ier silversmith to be reinterpreted as argen-tier,
23: 32143.
and allowed the new pattern to be extended to create derivatives
such as bijou-tier jeweler from bijou.
Syntax may supply the source of new morphology, as phrases
MORPHOLOGICAL CHANGE
are reinterpreted as single words (by a process called univerbation).
Morphological change involves alterations made by speakers over Former clitics may be reanalyzed as aixes in a process often
time to the analysis of complex words, or to the relations between described as a kind of grammaticalization with accompany-
a lexical base and its compounds and derivatives, or to the set ing functional changes. hus in Tocharian, new case suixes (with
of inlected words that share a common lexical base. he main meanings such as toward, through, with, from) were cre-
mechanisms of morphological change are reanalysis (the reinter- ated by fusing former postpositions with nouns in the oblique case.
pretation of forms) and the extension of patterns to create new Complex new inlectional markers can be created, such as French
forms. he impetus for reanalysis often comes from changes to aim-eras you will love, where the suix includes part of the Latin
the semantics, phonology, or syntax of the afected forms. ininitive suix, auxiliary verb have, second-singular (2sg) sub-
Semantic shift may afect the function of a grammatical ele- ject marker as the Romance future was grammaticalized from a
ment. hus, in various Australian languages, a subordinating construction have to VERB. he univerbation of phrases can even
purposive suix (in order to VERB), typically -ku, came to be lead to word-internal inlection, with grammatical markers becom-
used in independent clauses as a marker of intentional mood ing trapped between erstwhile lexical elements, for example, in the
(may VERB), then further shifted to express future tense (will slightly archaic English whomever and whose-ever, where ever was
VERB). Functional shift in a grammatical aix also took place once a separate word, or in Old Irish atotch sees you (vs. atch
in some Karnic (Australian) languages, in which a locative (at) sees), where ot continues an earlier pronoun that was positioned
case suix -nga came to mark the dative (for) function. Meaning between the two words that together meant see.
changes may lead to the reinterpretation of compound words as Much morphological change involves only rearrangements
simple lexemes (e.g. hlford loaf + ward > lord, shep-herd within the morphology itself, within and across paradigms, and
sheep+ herder > shepherd), or as a lexical stem plus a deriva- involving either stems or aixes. In leveling, one variant of a
tional aix (king-dom) with downgrading of a bound lexical form stem is extended to all cells in an inlectional paradigm; thus, in
-dom meaning condition to a derivational suix. Ancient Greek, the prehistoric kw of *leikw- leave developed by
Sound changes often create new allomorphs: For example, regular sound change into t or p before diferent vowels, but the
the earlier English plural suix -z split into three variants -z, -z, leip- variant was later generalized to the whole paradigm. Stem
-s, and the stem long developed a variant leng- in length with the variants may be redistributed according to a pattern prevalent
e conditioned by the following i in the former derivational suix in other paradigms by a process called analogical change (see
-ith. As the relationship between words becomes obscured by analogy; synchronic and diachronic). hus, in the early
the accumulation of sound changes, some phonological difer- modern German verb give the variant gib-, which arose by
ences become morphologized, that is, reinterpreted as partial or sound change in all the singular forms, was later conined to the
even sole signals of a morphological property. hus, the vowel e second and third persons singular because many other verbs, for
in slep-t (vs. ee in sleep) helps to mark past tense, and ee in feet example, sleep, had a pattern where only these two forms had
(vs. oo in foot) alone marks plural. Even sound changes that a diferent stem vowel. (See Table.)
Pre-Greek (Doric) Greek EMGerman ModGerman German

leave leave give give sleep
1Sg leip leip Gib geb-e schlafe
2Sg leiteis leipeis gib-st gib-st schlfst
3Sg leitei leipei gib-t gib-t schlft
1Pl leipomen leipomen geb-en geb-en schlafen
2Pl leitete leipete geb-t geb-t schlaft
3Pl leiponti leiponti geb-en geb-en schlafen
514
Morphological Typology Morphology
Where there are diferent inlectional classes, one class is ev -ler- im - iz -den
usually dominant and its inlectional pattern tends to inluence house -PL -POSS. 1-PL -ABL
the others. hus, in early Italic languages, noun stems in -, i-,
he third type of language expresses diferences in morphosyn-
u- remodeled their former ablative singular forms on the pattern
tactic and lexicosemantic properties through contrasting modi-
of d in the dominant o-stem class, creating new endings in -d,
ications, or inlections of a words stem. hese are inlectional
-d, -d, respectively. Words are often transferred from an irregu-
or fusional languages. he classical languages, Greek, Latin, and
lar inlectional class to the dominant one: hus, English drag, a
Sanskrit, belong to this type. In Latin you (sg[singular]) loved
former strong verb with past drug, changed to the weak class
is expressed by various modiications of the root am- love to
with regular past inlection dragged.
yield amvist: stem formative -v to express perfect, and -ist to
Iconicity has been emphasized by Natural Morphologists
express perfect (again) + 2d person + singular. Typically, proper-
as one of the principles motivating morphological change.
ties are fused in one exponent: Here aspect, person and num-
Iconically organized paradigms code more complex grammati-
ber agreement are expressed together. Equally, a property can
cal meanings by means of phonologically larger markers and
be expressed by more than one exponent: Here perfect is being
simpler meanings with smaller markers, and the most basic
expressed twice.
meanings (singular number, nominative case, present tense,
here has been general unease among modern linguists with
third person agreement, etc.) by no marker at all. In some
the classical typology. One reason is that languages rarely fall
Slavic languages, after sound changes created zero case-number
cleanly into one of these types. For example, Mandarin Chinese
suixes in both the nominative singular and genitive plural of
productively uses what looks like a derivational suix to build
o-stem nouns, the paradigm was repaired only in the genitive
agentive nouns, the word q mechanism: sn-r q cooler,
plural (by substituting an overt suix -ov from another inlec-
jin-c q monitor, yng-shng q speaker; compare the
tional class), whereas the iconic zero marking was retained in
English -er/-or agentive suix (Hippisley, Cheng, and Ahmad
the nominative singular. Iconicity in the syntagmatic dimension
2005). More importantly, there is some doubt that the typology
is increased by changes that reorder a form like (Australian)
ofers any theoretical insight, a point argued as far back as Sapir
Arrernte me-k(e)-atye mother-to-my, where atye was origi-
(1921). Part of the reason is that morphological type is really a
nally an enclitic pronoun, to the sequence m(e)-atye-ke moth-
function of other grammatical structures worthy of typologi-
er-my-to, which better mirrors the semantic scopal relations
cal investigation, and is, therefore, epiphenomenal (Anderson
between the elements.
1990).
Harold Koch A more promising approach is to focus on much more nar-
rowly deined word structures and to investigate how they
WORKS CITED AND SUGGESTIONS FOR FURTHER READING cross-cut languages that may or may not be genetically or typo-
logically related. he result is then a typology of narrowly deined
Anderson, Stephen R. 1992. Morphological change. In A-morphous
Morphology, 33672. Cambridge: Cambridge University Press. structures of words that answer the question What is a possible
Joseph, Brian D. 1998. Diachronic morphology. In he Handbook word? his is the approach taken by Greville G. Corbett and col-
of Morphology, ed. Andrew Spencer and Arnold M. Zwicky, 35173. leagues, who look at unusual morphology such as suppletion,
Oxford: Blackwell. deponency, and defectivenesss, recording such structures in a
Koch, Harold. 1996. Reconstruction in morphology. In he Comparative large number of individual languages and inducing diachronic
Method Reviewed: Regularity and Irregularity in Language Change, ed. and synchronic models of their appearance and use in syntax
Mark Durie and Malcolm Ross, 21863. New York: Oxford University (e.g. Corbett 2007; Baerman and Corbett 2007).
Press.
Andrew Hippisley
MORPHOLOGICAL TYPOLOGY WORKS CITED AND SUGGESTIONS FOR FURTHER READING

typology has its origins in nineteenth-century morphologi- Anderson, Steven. 1990. Sapirs approach to typology. In Contemporary
cal typology, a method of grouping languages not according Morphology, ed. W. Dressler et al., 27795. Berlin: Mouton de Gruyter.
to genetic relatedness but to structural similarity, where the Baerman, Matthew, and Greville G. Corbett. 2007. Linguistic typol-
structure was specifically word structure (see morphology ). ogy: Morphology. Linguistic Typology 11: 11517.
Traditionally, there are three possibilities for phonologically Corbett, Greville G. 2007. Canonical typology, suppletion and possible
expressing morphosyntactc (inflectional) and lexicosemantic words. Language 83: 842.
(derivational) properties at the level of the word. In an iso- Hippisley, Andrew, David Cheng, and Khurshid Ahmad. 2005. he
lating or analytical language, complex words are built from head modiier principle and multilingual term extraction. Natural
Language Engineering 11.2: 12957.
existing words, free forms. Mandarin Chinese could be viewed
Sapir, Edward. 1921. Language. New York: Harcourt, Brace and World.
as an isolating language. Productive coining of new terms is
through compounding. The word for Internet is h-lin
wng with h inter + lin related + wng net. In agglu-
MORPHOLOGY
tinating languages, the pieces of a complex word map onto
specific meaning elements biuniquely, both at the lexical and Morphology and Words
grammatical level. Turkish evlerimizden from our houses is While the lexicon of a language lists basic forms and their content
glossed as (meanings and grammatical properties), a languages complex
515
Morphology
words neednt be invariably listed, since their form and content comprises two subsystems. he derivational subsystem derives
are often partially or wholly deducible from those of their parts one lexeme from another; for instance, the derivational subsys-
by means of regular principles. his system of principles is the tem of English derives the verbal lexeme crystallize from the
languages morphology. nominal lexeme crystal. he compounding subsystem deines
In morphological theory, it is useful to distinguish three senses complex lexemes through the combination of other lexemes;
of word. In one sense, the form come is the same word in (1) and thus, the compounding subsystem of English creates the com-
(2); in another sense, come in (1) is a diferent word from come pound lexeme mountain lion from the lexemes mountain and
in (2). his apparent paradox arises because word can be used lion. A number of criteria have been adduced to distinguish
to refer to either a phonological or a grammatical unit: In (1) and inlection from derivation and to distinguish compounds from
(2), come represents the same phonological unit (phonetically syntactic combinations; see Booij (2000) and Matthews (1991),
[km]) but two distinct grammatical units: the unmarked inini- respectively.
tive form of the verb come in (1) and the past participial form of
this verb in (2). Incremental and Realizational Approaches to Morphology
Structuralist approaches to morphology gave primacy to
(1) Sandy should come home.
morphemes (minimal pairings of form with meaning) as the basic
(2) Sandy has already come home. units of morphological analysis. hey were incremental in orienta-
hus, it is useful to distinguish phonological words such as [km] tion, in that they presumed that the content of a word is the sum
from grammatical words such as the past participle of come. of the content of its component morphemes. hese structuralist
Moreover, there is a third theoretically relevant interpretation assumptions have been very persistent in modern linguistic theory.
of word according to which go and gone are diferent forms of heir widespread acceptance has led many linguists to assume that
the same word. Here, word refers neither to a phonological word all morphological phenomena can be accounted for by indepen-
nor to a grammatical word but to the abstract lexical element of dently needed principles of syntax and phonology; thus, since the
which go and gone are both realizations; abstract elements of this advent of generative linguistics, morphological issues have often
sort are referred to as lexemes. been addressed as a part of syntax (Selkirk 1982; Lieber 1992) or as
hese three senses of word are related in the following a part of phonology (Kiparsky 1982).
way: the pairing of a lexeme with an appropriate set of morpho- Incrementalist theories of morphology are problematic, how-
syntactic properties deines a grammatical word, as in (3), and ever (Stump 2001, 3 f). First, there are words whose content can-
the phonological realization of a grammatical word is a phono- not be factored into that of their component morphemes; that
logical word, as in (4). is, the content of a words individual morphemes may underde-
termine that of the word itself. he aorist verb form krd-o-x I
(3) Grammatical words (4) Phonological words stole in Bulgarian is unambiguous despite the fact that none of
its three component morphemes expresses irst-singular (1sg)
a.talk, {3sg present a.(3a)s realization: [tks]
subject agreement; compare 2sg/3sg krde, 1pl krd-o-x-me,
indicative}
2pl krd-o-x-te, and 3pl krdo-x-a. To account for such forms,
b.dog, {plural} b.(3b)s realization: [dgz] proponents of incrementalist theories must postulate zero mor-
c.good, {comparative} c.(3c)s realization: [b] phemes, which lack overt phonological realization but purport-
edly supply the missing content.
Correspondingly, a lexemes paradigm is the full set of grammat- Second, syncretism (the use of the same morphology to
ical words associated with it; a morphosyntactic property sets express distinct content) is problematic for incrementalist theo-
exponence is its phonological realization; and where G is a gram- ries. In Sanskrit, the accusative singular suix -m is also used as
matical word in the paradigm of lexeme L, Ls root is the phono- a nominative singular suix in the paradigms of neuter a-stem
logical form (if such can be identiied) with which the exponence nouns: Compare the masculine noun horse (nom. sg. ava-,
of Gs property set combines in the phonological word realizing acc. sg. ava-m) with the neuter noun gift (nom./acc. sg.
G. hus, in the realization of the grammatical word (3b) in the dna-m). Incrementalist theories must attribute syncretism to
paradigm of the nominal lexeme dog, the exponence [z] of (3b)s homonymy (e.g., to the existence of two distinct -m suixes in
property set {plural} combines with dogs root [dg]. he dis- Sanskrit), but in doing so miss important generalizations (e.g.,
tinction between a words exponence and its root is, of course, the fact that the nominative and accusative are always syncre-
sometimes diicult to draw, as in the realization of be, {1sg tized in the paradigms of Sanskrit neuter nouns, regardless of
[irst-singular] present indicative} as the portmanteau [m]. what the exponence of these cases might be).
Finally, incrementalist assumptions give no explanation for
Branches of Morphology the incidence of extended exponence (the appearance, within a
A languages morphology comprises two systems. he inlectional single word, of more than one morpheme expressing the same
system deines the phonological realization of the grammati- content). In Nyanja (Niger-Congo; Malawi), adjectives exhibit
cal words in a lexemes paradigm; for instance, the inlectional noun-class agreement with the noun they modify, and members
system of English speciies that the third-singular (3sg) present of one group of adjectives exhibit two agreement preixes, as in
indicative form of the lexeme talk is talks. By contrast, the sys- the case of ci-pewa ca-ci-kulu large hat, where kulu large
tem of word formation (better: lexeme formation) deines com- agrees with the class 7 noun -pewa hat by means of two distinct
plex lexemes in terms of simpler lexemes. he latter system itself preixes. On incrementalist assumptions, the preix ci- in ca-ci-
516
Morphology
kulu should alone suice to mark this form for class 7 agree- general account of such essentially paradigmatic phenomena as
ment; the appearance of the additional preix ca- not only seems syncretism (the relation among paradigm cells that are identi-
unnecessary but actually violates the anti-redundancy principle cal in their realization), deponency (the realization of one cell by
(Kiparsky 1982; 136 f) purported to prevent the suixation of plu- means of morphology appropriate to a diferent cell), heterocli-
ral -s to English men. sis (the realization of distinct cells within a paradigm according
he alternative to an incrementalist theory is a realizational to contrasting conjugational/declensional patterns), and defec-
theory, according to which a words content determines its mor- tiveness (the existence of unrealized cells within a paradigm). For
phological form (Matthews 1972; Zwicky 1985; Anderson 1992). discussion of these phenomena in paradigm-based frameworks,
In a realizational theory, the paradigm of the verbal lexeme talk see Baerman (2004), Baerman, Brown, and Corbett (2005), and
includes the grammatical word in (3a), and the phonological Stump (2001, 2006).
word that realizes this grammatical word arises from talks root he alternative, paradigm-based approach is instantiated
through the application of any rules associated with the mor- by such realizational theories as A-morphous morphology
phosyntactic property set in (3a); there is one such rule, which (Anderson 1992), network morphology (Corbett and Fraser 1993;
realizes the property set {3sg present indicative} through the suf- Brown and Hippisley in press), and paradigm function morphol-
ixation of s. ogy (Stump 2001). hese theories take paradigms rather than
In a realizational theory, the fact that a words form may morphemes as the primary object of morphological inquiry and
underdetermine its content is unproblematic, since content is formulate morphology as an autonomous grammatical compo-
not deduced from form in any event. hus, the fact that the irst- nent. Despite diferences of detail, they are alike in assuming that
singular aorist form krd-o-x I stole in Bulgarian has no expo- a lexeme has a paradigm of grammatical words a set of pairings
nent of irst-singular subject agreement is simply the efect of a such as those in (3) whose phonological realization is deter-
kind of poverty in the languages verb morpology: It happens not mined by a system of deductive rules, for example, the rule of
to have any means of expressing the property 1sg in the reali- exponence in (5a) and the rule of referral in (5b).
zation of the grammatical word steal, {1sg aorist}. Syncretism
(5) a. Where lexeme L has root R, L, {inite past } is realized
is likewise unproblematic: One need only assume that the reali-
as Red.
zation of one word in a lexemes paradigm may pattern after the
b. L, {past participle} has the same realization as L, {inite
realization of a diferent word in that paradigm. Rules of referral
past }.
(Zwicky 1985; Stump 1993) express this kind of relation between
cells in a paradigm; thus, in Sanskrit, a rule of referral speciies By (5a), the lexeme walk has the past tense form walked; by
that the realization of a neuter nouns nominative singular cell is (5b), this lexeme also has walked as its past participle. A cen-
the same as that of its accusative singular cell. Finally, extended tral assumption in paradigm-based theories is that rules act as
exponence is unproblematic in a realizational theory; in the case defaults and are therefore subject to override; in the inlection
of Nyanja ca-ci-kulu large [class 7], one need only assume that of verbal lexemes such as sing, the rules in (5) are overridden
more than one rule of preixation participates in the realization by those in (6). An important concern in such theories is that of
of the grammatical word large, {class 7}. establishing general principles regulating the default/override
relations among rules of morphology. In network morphology,
Current Theories of Morphology these relations are regulated by their position in default-inher-
Two approaches to morphology dominate the theoretical land- itance hierarchies; in paradigm function morphology, they are
scape: the morpheme-based approach and the paradigm-based regulated by the Pinian determinism hypothesis (Stump 2001,
approach. Distributed morphology (DM) is the main embodi- 23), according to which Rule A overrides Rule B if and only if A is
ment of the morpheme-based approach (Halle and Marantz narrower in application than B.
1993). DM maintains the structuralist focus on morphemes as
(6) Where L belongs to the sing class and has root R,
the central unit of morphological analysis, but difers from ear-
a. L, {inite past } is realized through the substitution of
lier morpheme-based approaches in its assumption that mor-
[] for [] in R.
phemes are inserted into abstract grammatical structures in a
b. L, {past participle} is realized through the substitution of
realizational fashion. (Here and throughout, I use morpheme in
[] for [] in R.
the Bloomieldian sense of minimal form-meaning pairing.)
he verb in hey talked instantiates the abstract grammatical Because they deine complex words by means of deductive
structure V-past-pl through the insertion of the verbal mor- rules such as those in (5)/(6), paradigm-based theories aford
pheme talk and the past tense morpheme -ed; the property of a parsimonious account of interactions between concatenative
third-plural agreement goes unrealized because there is no non- and nonconcatenative morphology: he fact that (5a) fails to
zero morpheme available to realize it. (DM therefore accom- apply in the deinition of sings past tense form can be directly
modates cases of underdetermination such as that of krd-o-x I attributed to the override relation between (6a) and (5a).
stole.) From earlier incrementalist approaches, DM inherits the In DM, by contrast, the absence of -ed in sang must instead be
assumption that morphological structures such as V-past-pl attributed to an overriding, phonologically empty suix whose
are deined by rules of syntax. his assumption presents prob- presence triggers a rule of [] [] ablaut. his account of the
lems that have never been convincingly resolved in the some- sing/sang (*singed) alternation implies a parallel account of sing/
what hermetic DM literature: In rejecting rules that (like rules sung (*have singed), mouse/mice (*mouses), thief/thieve (*thieize,
of referral) are deined over paradigms, DM is left without any cf. burglarize), and other alternations: In each case, a default aix
517
Morphology
must be seen as being overridden by a null aix, which by stip- words are invariant in their semantic interpretation; yet there
ulation triggers a rule of internal modiication. What emerges is are instances in which the semantics associated with a particular
a widely recurrent coincidence that is never explained: Again morphosyntactic property is sensitive to its paradigmatic context
and again, a zero aix is stipulated as the unoverridden over- (Stump 2007).
ride among a set of competing morphemes; over and over, this A inal area of current interest is that of implicative theories
unoverridden override by stipulation triggers a rule of internal of morphology. Like realizational theories, implicative theories
modiication. here is, of course, no overt class of phonologically depend on the postulation of paradigms, but unlike them, they
identical aixes in any language that ever shows the kind of syn- assume that the forms realizing a paradigms cells are deined
tagmatic and paradigmatic distribution that DM must stipulate by implicative relations among these cells (Blevins 2005, 2006).
for the artifactual class of phonologically null aixes upon which hus, in an implicative theory, certain words in a paradigm have
this approach depends. a privileged status because they serve as the basis for deducing
the paradigms other words. If a small number of such privi-
Current Issues in Morphology leged forms uniquely determine the entire paradigm (as the
he diferences between morpheme-based theories and par- forms laud, laudre, laudv, and laudtum suice to deter-
adigm-based theories have been most clearly articulated with mine the paradigm of praise in Latin), they may be charac-
reference to inlectional phenomena. But the question naturally terized as principal parts; but even words that arent principal
arises whether the principles of lexeme formation are morpheme parts may carry speciic implications for the formation of cer-
based or instead favor a paradigm-based approach. Signiicantly, tain other members of their paradigm. Reference to these rela-
derivational morphology exhibits the same sort of default/over- tions among a paradigms cells seems central for an account
ride relations as inlectional morphology: In much the same way of the processes of morphological deduction upon which the
as the lexeme sing possesses an inlectional paradigm in which acquisition and use of language depend; moreover, implicative
the appearance of the past tense form sang blocks that of *singed, relations among the cells in lexemes paradigms are a signii-
the lexeme strong seems to possess a derivational paradigm cant domain of typological contrast among languages (Finkel
in which the appearance of the nominal derivative strength and Stump 2009).
blocks that of *strongness. For discussion of the evidence for
Gregory Stump
derivational paradigms, see Bauer (1997), Booij (1997).
he nature of the interface between morphology and syntax WORKS CITED AND SUGGESTIONS FOR FURTHER READING
also requires further scrutiny. One question subsumed by this
broad issue is whether a languages periphrases are deined by its Ackerman, Farrell, and Gregory Stump. 2004. Paradigms and periphras-
tic expression: A study in realization-based lexicalism. In Projecting
morphology or by its syntax. A periphrase is a multiword realiza-
Morphology, ed. Louisa Sadler and Andrew Spencer, 11157. Stanford,
tion for a grammatical word; thus, while the grammatical word
CA: CSLI Publications.
smart, {comparative} has the synthetic realization smarter, Anderson, Stephen R. 1992. A-morphous Morphology. Cambridge:
the grammatical word intelligent, {comparative} has the Cambridge University Press.
periphrastic realization more intelligent (*intelligenter). For dis- Aronof, Mark. 1994. Morphology by Itself: Stems and Inlectional Classes.
cussion of the evidence in favor of a morphological approach to Cambridge, MA: MIT Press.
periphrasis, see Kersti Brjars, Vincent, and Chapman (1997) and Baerman, Matthew. 2004. Directionality and (un)natural classes in syn-
Ackerman and Stump (2004); the grammatical consequences of cretism. Language 80: 80727.
this conclusion remain to be worked out in detail. Baerman, Matthew, Dunstan Brown, and Greville G. Corbett.
Another controversial aspect of the morphology/syntax inter- 2005. he Syntax-Morphology Interface: A Study of Syncretism.
face is the phenomenon of clisis. Because their morphology Cambridge: Cambridge University Press.
Bauer, Laurie. 1997. Derivational paradigms. In Yearbook of
resembles that of aixes while their syntax is wordlike, clitics
Morphology 1996, ed. Geert Booij and J. van Marle, 24356. Dordrecht,
raise very speciic questions about the division of labor between
the Netherlands: Kluwer.
the components of morphology and syntax. Although recent Blevins, James P. 2005. Word-based declensions in Estonian. In
years have seen a vast amount of research into the properties of Yearbook of Morphology 2005, ed. Geert Booij and J. van Marle, 125.
clitics, there is, as yet, little consensus as regards their precise Dordrecht, the Netherlands: Springer.
theoretical status. Particularly urgent are the need to understand . 2006. Word-based morphology. Journal of Linguistics
the diferences between clitics and aixes (Zwicky and Pullum 42: 53173.
1983) and the need to reconcile these diferences with the com- Booij, Geert. 1997. Autonomous morphology and paradigmatic rela-
plex interactions between clisis and aixation (Spencer and Lus tions. In Yearbook of Morphology 1996, ed. Geert Booij and J. van
2005). Marle, 3553. Dordrecht, the Netherlands: Kluwer.
he principles of the morphology/semantics interface also . 2000. Inlection and derivation. In Morphology: An International
Handbook on Inlection and WordFormation, ed. Geert Booij,
urgently require clariication. Phenomena such as syncretism,
C. Lehmann, and J. Mugdan, 3609. Berlin: Walter de Gruyter.
deponency, extended exponence, and morphological underde-
Brjars, Kersti, Nigel Vincent, and Carol Chapman. 1997. Paradigms,
termination are apparently incompatible with the assumption periphrases and pronominal inlection: A feature-based account. In
(characteristic of morpheme-based theories) that a words mor- Yearbook of Morphology 1996, ed. Geert Booij and J. van Marle, 15580.
phology is isomorphic to its semantic structure. Research in par- Dordrecht, the Netherlands: Kluwer.
adigm-based theories has tended to assume (often tacitly) that Brown, D. and A. Hippisley. Network Morphology. Cambridge: Cambridge
the morphosyntactic properties associated with grammatical University Press. In press.
518
Morphology, Acquisition of
Corbett, Greville G., and Norman M. Fraser. 1993. Network morphol- early childhood. he acquisition of derivational morphology, on
ogy: A DATR account of Russian nominal inlection. Journal of the other hand, is concerned with the formation of new words and
Linguistics 29: 11342. is thus related to school-age language learning and reading.
Embick, David, and Rolf Noyer. 2007. Distributed morphology and the Jean Berkos classic 1958 study found that children ive to
syntax-morphology interface. In he Oxford Handbook of Linguistic
seven years of age are able to apply both inlectional and deri-
Interfaces, ed. G. Ramchand and C. Reiss, 289324. Oxford: Oxford
vational suixes to novel stems (e.g., the plural wugs from wug,
University Press.
Finkel, Raphael, and Gregory Stump. 2009. Principal parts and degrees
or the adjective quirky from the noun quirks). hese results were
of paradigmatic transparency. In Analogy in Grammar, ed. J. Blevins interpreted at the time as evidence against the predictions from
and J. Blevins, 1353. Oxford: Oxford University Press. the prevailing theories of Behaviorism (Skinner 1957) and sup-
Halle, Morris, and Alec Marantz. 1993. Distributed morphology and ported the cognitive revolution in psychology and linguistics.
the pieces of inlection. In he View from Building 20, ed. K. Hale and his study has been replicated with children as young as two
S. Keyser, 11176. Cambridge, MA: MIT Press. in English and other languages (e.g., Kopcke 1998; Akhtar and
Kiparsky, Paul. 1982. From cyclic phonology to lexical phonology. In Tomasello 1997). Together with work examining childrens natu-
he Structure of Phonological Representations (Part I), ed. H. van der ral productions of morphology, this experimental work has been
Hulst and N. Smith, 13175. Dordrecht, the Netherlands: Foris. taken as evidence that young children use morphological rules
Lieber, Rochelle. 1992. Deconstructing Morphology. Chicago: University
to inlect and to form parts of speech.
of Chicago Press.
Matthews, P. H. 1972. Inlectional Morphology. Cambridge: Cambridge
University Press.
Inlectional Morphology
. 1991. Morphology. 2d ed. Cambridge: Cambridge University FIRST LANGUAGE. One major set of works investigating the
Press. acquisition of morphemes examined the order of acquisition of
Selkirk, Elisabeth O. 1982. he Syntax of Words. Cambridge, MA: MIT inlectional morphemes (Cazden 1968; Brown 1973; de Villiers
Press. and de Villiers 1973), focusing on English. his work found a
Spencer, Andrew, and Ana Lus. 2005. A paradigm function account of consistent (though not identical) order of acquisition among
mesoclisis in European Portuguese (EP). In Yearbook of Morphology children. Cross-linguistic work demonstrated that there was no
2004, ed. Geert Booij and J. van Marle, 177228. Dordrecht, the universal order of morpheme acquisition between languages,
Netherlands: Springer.
that the order and speed of acquisition depends on the target
Stump, Gregory T. 1993. On rules of referral. Language 69: 44979.
language and the morphemes themselves (Slobin 1985; Clark
. 2001. Inlectional Morphology. Cambridge: Cambridge University
1998). Work on other languages has also found consistent, but
Press.
. 2006. Heteroclisis and paradigm linkage. Language 82: not identical, orders within a language. Several factors appear to
279322. inluence the order of acquisition, including perceptual salience,
. 2007. A non-canonical pattern of deponency and its implica- complexity of the morpheme either semantically (how many
tions. In Deponency and Morphological Mismatches, ed. Matthew concepts it encodes) or formally (how variable the aix is, how
Baerman, Greville G. Corbett, Dunstan Brown, and Andrew Hippisley, many parts it contains), and frequency in the input. Eve V. Clark
7196. Oxford: Oxford University Press. (1993) suggested that several principles transparency (how
Zwicky, Arnold M. 1985. How to describe inlection. In Proceedings easily the meaning is derived from the parts), simplicity (how
of the Eleventh Annual Meeting of the Berkeley Linguistics Society, ed. variable the forms are), and productivity govern the order of
M. Niepokuj, M. VanClay, V. Nikiforidou, and D. Feder, 37286.
acquisition of both inlectional and derivational morphemes.
Berkeley, CA: Berkeley Linguistics Society.
Morphemes that are consistent in form (have few allomorphs)
Zwicky, Arnold M., and Geofrey K. Pullum. 1983. Cliticization vs. inlec-
and semantically encode a single feature, such as plural or pro-
tion: English nt. Language 59: 50213.
gressive -ing, tend to be acquired earlier than morphemes that
show more allophonic variation, such as with regular past tense
MORPHOLOGY, ACQUISITION OF
-ed, and/or encode multiple features, such as third person
he acquisition of morphology has played a central role in singular -s (Brown 1973; Clark 1993). his preference to encode
exploring both the acquisition of syntax (see syntax, acquisi- single features/forms with a single morpheme holds across lan-
tion of) and lexical acquisition. he study of acquisition guages and even for children acquiring more than one language.
of morphology can be distinguished by research in irst language For example, Melanija Mikes (1967) discussed the acquisition of
acquisition (FLA) and second language acquisition (SLA) locatives for bilingual Hungarian-Serbo-Croatian children. he
and by research into inlectional or derivational morphology. Hungarian locative suix, which is relatively transparent, was
Little research has attempted to link acquisition of inlectional acquired earlier than the semantically equivalent structure in
and derivational morphology. here has been some inluence Serbo-Croatian, which required locative prepositions + agree-
of work in FLA on that in SLA, in particular in debates about ment, which varied by gender.
the order of acquisition of morphemes for second language Inlected forms appear early, from the earliest word use, espe-
learners. cially in highly or consistently inlected languages (Slobin 1985).
he acquisition of inlection is often examined as the acqui- Languages such as English, with fewer inlected forms, have bare
sition of morphosyntax, that is, structures that are governed by forms appearing irst and inlected forms appearing later, often
both morphological and syntactic rules such as subject-verb concurrently with irst word combinations. In English, childrens
agreement. he majority of literature focuses on children under production of inlectional morphology begins with an initial
the age of seven, as inlectional morphology is acquired during period of some variability, gradually becoming more consistent
519
over time. Across languages, some morphemes (such as plurals) SECOND LANGUAGE. Early research in second language acquisi-
are consistently produced accurately by the age of three or four, tion of inlectional morphology investigated whether there was
whereas others, such as conditional marking of verbs, are often a consistent order of acquisition of morphemes. Initial studies
not mastered until ages seven or eight. While little research has reported consistent orders of acquisition across both adult and
been done on inlectional preixes, it has been suggested that child second language learners of English (e.g., Dulay and Burt
inlectional preixes are more diicult and are acquired later 1974). Later research, however, criticized these early studies for
than suixes (Slobin 1982; Clark 1998). their methodology, and found variability among learners of dif-
In all languages, children often go through a period during ferent backgrounds that seemed to belie a single order of acqui-
which they overregularize irregular forms in their grammar, sition for second language learners (e.g., Hakuta 1974; Rosansky
for example, breaked or mans. Irregular forms are often acquired 1976). he present consensus seems to be that morpheme lan-
early (broke/men) and then when regular forms are acquired, guage studies provide strong evidence that ILs [interlanguages,
overregularized forms coexist with irregular forms (breaked/ developing grammars] exhibit common accuracy/acquisition
broke) for a time until the irregular forms and exceptions are orders (Larsen-Freeman and Long 1991, 92). After this intense
mastered (Marcus et al. 1992). period of debate about the acquisition order of morphemes, later
he majority of research on inlectional morphology has research in SLA has focused on one of several areas: research on
focused on the verbal and nominal domains, with relatively speciic structures such as tense or aspect, the role of context in
little work on adjectives or adverbs. According to Clark (1998), the acquisition of morphology, and the implications of missing
the earliest verb forms across languages tend to be imperative, morphology for the developing grammars of second language
ininitive verb forms, and third person singular; singular forms learners. Research has begun to look again at order of acquisition
tend to be acquired before plural forms. he characteristics of in SLA, considering how models of acquisition and/or the func-
individual languages determine which agreement markers tor morphemes themselves can explain the order (Goldschneider
are learned earliest in any given language. In terms of tense and DeKeyser 2001). Here again, the discussion parallels that of
and aspect, present/nonpresent is the irst distinction children irst language acquisition, with perceptual salience, morphopho-
make, and distinctions between past, present and future appear nological regularity, semantic complexity, and frequency being
to be in place by age three. In languages that distinguish aspect, posited as factors contributing to the order of acquisition.
aspect also appears to be acquired around age three. However,
early aspect marking is also associated with the semantic char- Derivational Morphology
acteristics of the verb. For example, Li and Shirai (2000) argue FIRST LANGUAGE. Because derived forms are often used to ill
that early use of perfective aspect is more likely to occur with semantic gaps within a lexicon, the acquisition of derivational
telic (e.g., walked) or resultative verbs (e.g., smashed) than other morphology is studied within the domain of vocabulary learn-
types of verbs. Compound or periphrastic tenses such as present ing. Derivational morphology is acquired somewhat later, and
perfect are acquired later, and may not be in place until age ive with a less clear order of development, than inlectional mor-
or older. phology. Highly inlected languages such as Turkish or Finnish
Within the nominal domain, number marking occurs early show evidence of early derivational morphology. Studies indi-
in nouns and is one of the earliest nominal morphemes to be cate that the earliest derivations are zero-stem alternations (e.g.,
seen. gender marking also appears early, just after the irst the noun knife used as a verb) and compounding (e.g., dog-book)
nouns. Early gender/noun class marking appears to be based for languages where these processes are productive (Clark 1993).
primarily on the phonological shape of the noun and later Agentive suixes (-er in particular) and diminutives (dogg-y) are
becomes associated with individual lexical items. Noun class also acquired between ages two and three across many languages.
marking appears by age three, but adultlike acquisition of gen- However, the bulk of derivational morphology is acquired during
der or noun class marking, which requires attention to both middle childhood and adolescence (Tyler and Nagy 1989).
phonological form and semantics, does not appear until age As with inlectional morphology, the course of develop-
four or ive in many languages (Demuth 2003). In languages ment for derivational morphology varies from language to lan-
with classiier systems, such Chinese, Japanese or hai, general guage and depends on both the patterns within the language
classiier patterns appear early, and more ine-grained seman- and the properties of the derivations themselves. For example,
tic distinctions appear gradually. Case marking occurs early as Hebrew-speaking children derive verbs from nouns as young as
well, just after irst nouns are learned. In nominative-accusa- age three, while derived nominals are acquired later, after age
tive languages, the irst distinction to be acquired appears to eight and continuing into adolescence (Berman 2003; Ravid
be between nominative and accusative. Dative case is next, and Avidor 1998). Awareness of and ability to decode deriva-
followed by other oblique cases. here is also evidence from tional morphology continues to develop throughout the school
ergative languages that the major case distinctions in these lan- years. Comprehension of derivational morphology is correlated
guages are acquired early as well. Languages in which case var- with reading skills in a number of languages, including English,
ies by gender (e.g., Russian) take longer for contrasts to appear Hebrew and Chinese (Tyler and Nagy 1990; Levin, Ravid, and
than in languages that simply mark for case (e.g., Hungarian, Rapaport 2001; Kuo and Anderson 2006). As with inlectional
or Turkish). morphology, acquisition appears to be best explained by the
Inlectional morphology is generally mastered by the early transparency of the morpheme, complexity (semantic and for-
school years for all but the most infrequent or irregular construc- mal), and productivity (Clark 1998). Later-learned morphemes
tions across languages. such as -tion have more allomorphs, make more changes to the
520
stem they attach to, encode more concepts, and/or are less pro- words, while regular forms are concatenated via rules (Pinker
ductive than early learned morphemes such as -er. and Ullman 2002). he other approach, from those mainly work-
ing within connectionism, claims that learners are extracting
SECOND LANGUAGE. As with irst language (L1) acquisition, sec- statistical regularities and patterns, without positing a mental
ond language acquisition of derivational morphology has been rule (e.g., Plunkett and Marchman 1993). hese researchers sug-
studied together with general vocabulary growth and knowl- gest that there is a single, associationist mechanism for forming
edge of the lexical semantics of the language (Redouane 2004; past tense for all verbs, regular or irregular, and thus the rep-
Montrul 2001; Lardiere 1997). Little research has examined this resentations of regular and irregular forms do not difer. While
area of second language acquisition in detail. he research that rule-based accounts appear to have underestimated the human
exists suggests that beginning learners tend to ill lexical gaps ability to track statistical information about morphology, con-
with word formation strategies from their L1 or by extending nectionist models may not be able to generalize regular forms
existing second language vocabulary. More advanced learn- based on the typical frequencies of those forms in the input
ers are more likely to use derivational morphology found in the (Marcus 1995). hus, the extent to which these networks can
target language, although they are still likely to use non-target model human language acquisition is unclear.
forms. Evidence also suggests that derivational morphology pat-
terns that are substantially diferent from the L1 present consid- Summary
erable problems to second language learners and are learned he acquisition of morphology is central to both morphosyntac-
only gradually through time. tic development and lexical development for irst and second
language learners. While the acquisition of inlectional mor-
Current Debates phology is largely complete for children by the time they enter
Major areas of debate in the acquisition of morphology include school, it is often problematic for adult learners of a language.
the status of missing inlectional morphemes in learners gram- Derivational morphology, on the other hand, is an ongoing, life-
mars, whether productive use of morphemes relects rules and long process for both irst and second language learners. Little
an innate capacity for rule formation, and whether regular and research has attempted to link the two types of morphological
irregular forms are acquired and represented similarly. acquisition. More research on both derivational and inlectional
For irst language acquisition, one area of strong debate has morphology is needed on a wider variety of languages or varying
been the status of missing or omitted morphology by children. typologies, in particular on non-Indo-European languages. his
Within a general nativist framework, explanations range from is particularly the case for SLA, where the bulk of the research
maturation of grammatical structures, to prosodic and/or pho- has focused on European languages.
nological learning, to language-speciic lexical learning (e.g.,
Peters and Stromqvist 1996; Santelmann, Berk, and Lust 2000). Lynn Santelmann
More empiricist approaches point to such issues as frequency
of morphemes, vocabulary size, and transparency of the mor- WORKS CITED AND SUGGESTIONS FOR FURTHER READING
phemes themselves to explain the course of acquisition (e.g., Akhtar, N., and M. Tomasello. 1997. Young childrens productivity
Rowland et al. 2003). with word order and verb morphology. Developmental Psychology
Parallel debates have taken place within the literature on 33.6: 95265.
second language acquisition. Within the search for a consistent Berko, Jean. 1958. he childs learning of English morphology. Word
order of morpheme acquisition, debates have focused on the 14: 15077.
possible reasons for such an order: Is it due to an underlying uni- Berman, Ruth A. 2003. Childrens lexical innovations: Developmental
perspectives on Hebrew verb structure. In Language Processing and
versal grammar (Dulay and Burt 1974), characteristics of the
Acquisition in Languages of Semitic, Root-Based, Morphology, ed.
input, or general learning and characteristics of the morphemes
J. Shimron. Amsterdam: John Benjamins.
themselves (Goldschneider and DeKeyser 2001). In addition, the
Brown, Roger W. 1973. A First Language: he Early Stages.
issue concerning why adult learners, unlike child learners, con- Cambridge: Harvard University Press.
tinue to make mistakes with inlectional morphology even after Cazden, Courtney B. 1968. he acquisition of noun and verb inlections.
years of exposure to a language has been a source of signiicant Child Development 39: 43348.
debate. Explanations for this phenomenon range from a lack of Clark, Eve V. 1993. he Lexicon in Acquisition. Cambridge: Cambridge
access to universal grammar for adult learners to issues with pro- University Press.
sodic diferences between languages, diferences in proiciency, . 1998. Morphology in language acquisition. In he Handbook of
or other factors. Morphology, ed. A. Spencer and A. Zwicky. Oxford: Blackwell.
Within the domains of both inlectional and derivational Demuth, K. 2003. he acquisition of the Bantu languages. In he Bantu
Languages, ed. D. Nurse and G. Phillipson. Surrey : Curzon.
morphology, another major theoretical debate concerns the
de Villiers, Jill G., and Peter A. de Villiers. 1973. A crosssectional study of
nature of rules. Do children and adults learn rules, or do they
the acquisition of grammatical morphemes. Journal of Psycholinguistic
simply extract regularities from the speech stream? his debate Research 2: 26778.
has been particularly strong in the area of regular versus irregu- Dulay, Heidi, and Marina Burt. 1974. Natural sequences in child second
lar verbs. One side of the debate argues that children use rules for language acquisition. Language Learning 24: 3753.
regular verbs, and thus have distinct processes for forming and Goldschneider, Julie M., and Robert M. DeKeyser. 2001. Explaining
representing past tense with regular versus irregular forms (e.g. the natural order of L2 morpheme acquisition. Language Learning
Marcus 1996). his view argues that irregular forms are stored as 51: 150.
521
Morphology, Acquisition of Morphology, Evolution and
Hakuta, Kenji. 1974. A preliminary report on the development of gram- Slobin, Dan I, ed. 1985. he Cross-Linguistic Study of Language Acquisition.
matical morphemes in a Japanese girl learning English as a second lan- Hillsdale, NJ: Erlbaum.
guage. Working Papers on Bilingualism 3: 1838. Tyler, Andrea, and William Nagy. 1989. he acquisition of English
Kopcke, K. M. 1998. he acquisition of plural marking in English and derivational morphology. Journal of Memory and Language
German revisited: Schemata versus rules. Journal of Child Language 28.6: 64967.
25.2: 293319.
Kuo, Li-jen, and Richard C. Anderson. 2006. Morphological awareness
MORPHOLOGY, EVOLUTION AND
and learning to read: A Cross-language perspective. Educational
Psychologist 41.3: 16180. For some aspects of language, possible evolutionary explana-
Lardiere, Donna. 1997. On the transfer of morphological parameter tions are not hard to imagine, even if establishing the truth of
values in L2 acquisition. Proceedings of the Annual Boston University any of them may be diicult or impossible. For example, the
Conference on Language Development, 1997 21.2: 36677.
fact that utterances can be segmented into individual meaning-
Larsen-Freeman, Diane, and Michael H. Long. 1991. An Introduction to
ful elements (words or morphemes) has a clear functional
Second Language Acquisition Research. New York: Longman.
advantage in that combinations of these elements can be used
Levin, Iris, Dorit Ravid, and Sharon Rapaport. 2001. Morphology and
spelling among Hebrew-speaking children: From kindergarten to irst to express a huge range of complex meanings an advantage
grade. Journal of Child Language 28.3: 74172. exploitable in natural selection. Likewise, it is advantageous
Li, P. and Y. Shirai. 2000. he Acquisition of Lexical and Grammatical to have a syntax, that is, a set of traic rules for combining
Aspect . Berlin: Mouton de Gruyter. words or morphemes into larger units that can be interpreted
Marcus, Gary F. 1995. he acquisition of the English past tense in children reliably. Less immediately obvious but nevertheless vigorously
and multilayered connectionist networks. Cognition 56.3: 2719. defended by some scholars in recent years is the possibility
. 1996. Why do children say breaked? Current Directions in that certain aspects of language are as they are for physical or
Psychological Science 5.3: 815. mathematical, rather than biological, reasons. he hexagonal
Marcus, Gary F., Steven Pinker, Michael Ullman, Michelle Hollander, T.
shape of honeycomb cells is not due to a hexagonal-cell gene
John Rosen, and Fei Xu. 1992. Overregularization in language acqui-
inherited by bees but is a self-organizing outcome of cell con-
sition. Monographs of the Society for Research in Child Development
57.4: v164.
struction under particular spatial constraints. Conceivably,
Mikes, Melanija. 1967. Acquisition des categories grammaticales dans nonbiological self-organizing factors inluence language,
le langue de lenfant [Acquisition of grammatical categories in the lan- too.
guage of the child]. Enfance 3/4: 28998. For the existence of morphology, however, no such expla-
Montrul, Silvina. 2001. he acquisition of causative/inchoative verbs in nations seem immediately plausible. What functional advan-
L2 Turkish. Language Acquisition 9.1: 158. tage is there in the availability of not one but two patterns of
Peters, Ann M., and Sven Stromqvist. 1996. he role of prosody grammatical organization for complex expressions: syntactic,
in the acquisition of grammatical morphemes. In Signal to as in the French phrase tasse th (literally cup to tea) and
Syntax: Bootstrapping from Speech to Grammar in Early Acquisition, the English sentence hey were being bitten, and nonsyn-
ed. J. Morgan and K. Demuth. Mahwah, NJ: Lawrence Erlbaum.
tactic, as in the English compound word teacup and the Latin
Pinker, Steven, and M. T. Ullman. 2002. he past and future of the past
one-word sentence Mordebantur? More puzzling still, what
tense. Trends in Cognitive Sciences 6: 45663.
Plunkett, Kim, and Virginia A. Marchman. 1993. From rote learning to
functional advantage is there in the fact that the plural of pan is
system building: Acquiring verb morphology in children and connec- pans while the plural of man is men or that the past tense forms
tionist nets. Cognition 48.1: 2169. of bring, sting, and sing are brought, stung, and sang, respec-
Ravid, Dorit, and Avraham Avidor. 1998. Acquisition of derived nomi- tively? he irst question concerns the relationship between
nals in Hebrew: Developmental and linguistic principles. Journal of morphology and syntax, while the second concerns allomor-
Child Language 25.2: 22966. phic variation. Wouldnt languages, in general, function better
Redouane, Rabia. 2004. he acquisition of MSA word formation pro- if there were just one set of traic rules, not two, to guide the
cesses: A case study of English-speaking L2 learners and native speak- interpretation of complex expressions? And wouldnt English
ers. ITL, Review of Applied Linguistics, 145/146: 181217. function better if plurality and pastness were expressed in a
Rosansky, E. 1976. Methods and morphemes in second language acqui-
uniform fashion, as in the Newspeak of George Orwells 1984
sition. Language Learning 26: 40925.
or as in an artificial language such as Esperanto? Should
Rowland, Caroline F., Julian M. Pine, Elena V. Lieven, and Anna L.
heakston. 2003. Determinants of acquisition order in wh-ques-
we then look for a physical or quasi-mathematical explanation
tions: Re-evaluating the role of caregiver speech. Journal of Child instead? Yet these phenomena do not display the sort of hon-
Language 30.3: 60935. eycomb-like elegance that renders them obvious candidates
Santelmann, Lynn, Stephanie Berk, and Barbara Lust. 2000. Assessing for that explanation.
the strong continuity hypothesis in the development of English For possible solutions to these puzzles, it is natural to con-
inlection: Arguments for the grammatical mapping paradigm. sult hypotheses speciically concerning the evolutionary origin
Proceedings of the XIX West Coast Conference on Formal Linguistics of morphology as a component of grammar. Hypotheses of that
19: 43952. kind so far published are sketchy. Nevertheless, four trends can
Skinner, B. F. 1957. Verbal Behavior. New York: AppletonCentury
be distinguished:
Crofts.
Slobin, Dan I. 1982. Universal and particular in the acquisition of lan- (a) an appeal to uninterpretable features in Noam
guage. In Language Acquisition: he State of the Art, ed. E. Wanner Chomskys minimalist syntax (one type of appeal to self-
and L. Gleitman, 12872. Cambridge: Cambridge University Press. organization);
522
Morphology, Evolution and
(b) the projection into prehistory of grammaticalization used by some minimalist theorists to explain the existence of
processes such as are observed in historical linguistic morphology.
change; Grammaticalization theory concerns itself with the process
(c) an appeal to phonological consequences of the fact whereby in language change, what were once free forms
that the speech signal is continuous, not segmentable into dis- with concrete, lexical meanings can change in three ways: gram-
crete chunks with clear boundaries (and gesture likewise, if matically, so as to become bound rather than free (as the free
we suppose that language originated in that medium); form full has developed into a suix in helpful); semantically,
so as to contribute grammatical rather than lexical information
(d) a variant of (c) that also invokes the special circumstances
(as the verb will in English has shifted from desire to future
of our huntergatherer ancestors.
tense); and phonologically, so as to merge with a neighboring
Chomskyan Minimalism explores rationales for apparent phonological word (as in Ill come, derived from I will come)
imperfections in language. One such apparent imperfection (Heine and Kuteva 2002). All three changes can be observed in
is syntactic displacement: for example, the kind of noun phrase the history of Swedish, where what was once a free pronoun sik
fronting exhibited in Beans I like and Who did you see? and per- meaning himself/herself has developed into a suix -s with a
haps even in a simple clause such as John kissed Mary, if one habitual passive meaning. Bernard Comrie (1992) has suggested
assumes that syntactic subjects originate internally to the verb that not only individual aixes but also morphology overall origi-
phrase. Such displacement may serve communicative purposes nated this way. At an earlier, simpler stage of language, there was
(topicalization, for example). It still counts as a grammati- syntax but no morphology. Subsequently, phonological reduc-
cal imperfection, however, if there is nothing within grammar tion and meaning change in frequently occurring collocations
itself to drive it. his is where morphology may come in (it is brought into being a new kind of structure, with bound items
claimed). Let us suppose that some constituents have fea- alongside free ones, and phonologically reduced items alongside
tures that are uninterpretable and thus need to be erased phonologically full ones.
by moving those constituents to a location where these features A drawback with this approach is that it privileges syntax over
can be matched (Chomsky 2000, 2004). So far as grammar is morphology in an arbitrary way. Granted, all languages have
concerned, this matching helps to ensure that all of the syntactic syntax while some languages today make little or no use of mor-
requirements of the vocabulary items in the sentence are met, phology. hat does not, however, constitute evidence that syntax
while so far as purposes of language that lie outside grammar evolved earlier than morphology did. Implicitly, this approach
are concerned (such as communication), it may aid their posits a sort of prehistoric linguistic Golden Age when forms
fulillment by (for example) moving shared information to the and meanings were neatly paired one-to-one, and when lan-
start of utterance. he apparent imperfection thus disappears. guage did indeed have only one set of grammatical traic rules.
If, incidentally, some of the features that drive displacement However, such a Golden Age would have no parallel elsewhere
manifest themselves in overt morphology (for example, as case in evolutionary biology; there is no reason to think that func-
inlections), that is hardly surprising; it may facilitate the acqui- tionality as an outcome of natural selection was more pervasive
sition of grammar, for example. hus, the existence of morphol- at some time in the past than it is now.
ogy helps to resolve tensions between the way that the grammar Is there any reason to think, then, that the anomalies of mor-
is ideally structured and the extragrammatical uses to which phology have been around for as long as modern-style syntax
language is put. has, or even longer? Andrew Carstairs-McCarthy (2010) argues
his line of argument has at least three weaknesses, however. just this. Individual meaningful units (morphemes), whether
Firstly, it says nothing about the allomorphy exhibited in pans spoken or signed, are not diamond-hard discrete entities, unaf-
and men or in brought, lung, and sang. Secondly, it says noth- fected in their shape by neighboring units. his would have been
ing about why derivational morphology exists (for example, why just as true before fully modern syntax had evolved as subse-
we say writer and artist rather than, say, person write or person quently. hus, there would already then have been in existence
art). hirdly, it relies too much on the intellectual appeal that phonological processes that would give rise to allomorphy, and,
paradoxes can exert. Consider the orbits of the planets around just as now, historical changes would sometimes have deprived
the sun. hese orbits are not circular, which may be seen as an this allomorphy of its phonological motivation (just as the voic-
imperfection, but the imperfection disappears in a paradoxical ing of the [v] in wives, plural as wife, now lacks the phonological
yet satisfying way (one may think) in that, even though an orbit motivation it had in Old English and has thus acquired a gram-
is elliptical, the planets position on its orbit is correlated with matical function, as an exponent of plural). Let us assume
its speed, as Johannes Kepler demonstrated. But the enjoyment that this allomorphy was coupled with an expectation that for-
of paradox can go too far. I carry a puncture repair kit when I go mal diferences should always be accompanied by diferences
cycling, which is an apparent imperfection because it adds to in information content. One has already the seeds of a kind of
the weight of my equipment. Am I then entitled to claim that this grammar in which the same item can be viewed as having more
imperfection disappears whenever I get a puncture because my than one form, provided that these forms are diferentiated
repair kit enables me to get on the road again? And can I even somehow. he diferentiation could be semantic (e.g., rise versus
argue that getting a puncture is paradoxically a positive event raise), grammatical (e.g., sang versus sing), or in terms of phono-
because it justiies my carrying the repair kit? Flat tires thus con- logical environment (e.g., in Italian udire to hear, the stem is
tribute to perfect cycling! his style of argument is strange, to od- when stressed and ud- when unstressed). (Grammatical dif-
say the least. Yet it is uncomfortably close to a style of argument ferentiation in this hypothetical stage of development would ex
523
Morphology, Evolution and Morphology, Neurobiology of
hypothesi not involve syntax, but could conceivably involve sys- MORPHOLOGY, NEUROBIOLOGY OF
tematic expression of categories such as number, tense, or dei-
he nature of word formation and word storage has long been
niteness.) And, where formal diferences involved extra segments
prominent in the disparate ields of linguistics, psychology, and
at the beginning or the end of an item, the seeds were sown for
neurobiology, and recently the neurobiology of morphology has
what we now call aixes, arising by a process quite distinct from
emerged as its own distinct subield of study.
grammaticalization.
Two prevalent issues in formal linguistic and psycholinguistic
A variant of Carstairs-McCarthys approach has been devel-
approaches to morphology are the distinctions drawn between
oped by Dieter Wunderlich (2006a, 2006b), linking grammatical
inlectional versus derivational morphology and between regular
evolution with cultural and economic change. he sort of syntax
versus irregular morphology. Another line of inquiry has sought
that many modern languages have, with lavish opportunities for
to identify the formal structure of basic morphological represen-
long-distance syntactic movement, would not have had signii-
tations (i.e., words, stems, aixes) and to determine the extent
cant evolutionary advantages (Wunderlich suggests) until after
to which complex words are either composed by a grammati-
the emergence of large speech communities whose members
cal process or stored as unanalyzed wholes. he issue of com-
did not all know one another, that is, until the Neolithic period.
positionality interacts with inlectional/derivational status and
Until then, he suggests, that is, as long as all humans were hunt-
with regularity; for example, various theorists have proposed
ergatherers living in small groups, elaborate morphology would
that irregular inlection is not compositional (1a), that familiar
have preponderated over syntax.
inlected forms are not compositional (2a), or that derivational
It will be seen that, as regards morphological evolution,
morphology is not compositional (3a) in the way that they are
widely divergent suggestions have been made about the balance
stored or processed.
between cultural and noncultural factors. Carstairs-McCarthys
and Chomskys approaches, though diferent in many ways, Noncompositional Compositional
agree in emphasizing noncultural reasons for the existence of (1) a. ran = [ran] b. ran = [run]+[past]
morphology as a component of grammar. For Comrie, on the (2) a. walked = [walked] b. walked = [walk]+[past]
other hand, cultural change is at least as important as biological
(3) a. hopeless = [hopeless] b. hopeless = [hope]+[less]
or self-organizational factors, while Wunderlich revives the view
that fully modern syntax came late as a cultural by-product of
For these issues, there is an immense and contradictory body
population expansion and the transition to agriculture, with the
of linguistic evidence. As a consequence, some linguists have
added twist that an elaborate morphological component was
looked to new methods, such as neuropsychology and neuroim-
already in existence early. Time will tell which viewpoint pre-
aging, as alternative sources of evidence.
vails or which combination of viewpoints.
Psychological research in the lexicon and morphology has
Andrew Carstairs-McCarthy largely focused on the roles of frequency, familiarity, and simi-
larity in word storage and identiication. A critical inding in
WORKS CITED AND SUGGESTIONS FOR FURTHER READING this tradition concerns the efect of lexical frequency. Words
Carstairs-McCarthy, Andrew. 2010. he Evolution of Morphology. Oxford: that occur more frequently are more quickly and successfully
Oxford University Press. recalled in a wide range of experimental settings. his inding
Chomsky, Noam. 2000. New Horizons in the Study of Language and has enabled psychologists to pose deeper questions about the
Mind. Cambridge: Cambridge University Press. Chapter 1 is especially nature of lexical representations in terms of which aspects of
relevant. those representations are crucial to the frequency efect. he
. 2004. Language and mind: Current thoughts on ancient prob-
fact that processing morphologically complex words may be
lems. In Variations and Universals in Biolinguistics, ed. L. Jenkins,
afected by the frequency of the entire form has led some to
379405. Amsterdam: Elsevier.
argue that familiar aixed words, including regularly inlected
Comrie, Bernard. 1992. Before complexity. In he Evolution of Human
Languages, ed. J. Hawkins and M. Gell-Mann, 193211. Reading, forms like walked, are stored as wholes in the mental lexicon
MA: Addison-Wesley. (e.g., Baayen, Dijkstra and Schreuder 1997). In essence, this
Heine, Bernd, and Tania Kuteva. 2002. World Lexicon of whole-word approach to representation and processing treats
Grammaticalization. Cambridge: Cambridge University Press. regular forms like walked and irregular forms like ran as equiv-
Hinzen, Wolfram. 2006. Mind Design and Minimal Syntax. Oxford: Oxford alent: Both are past tense forms, and, by hypothesis, access to
University Press. his book confronts frankly, from a Chomskyan per- that inlectional information is mediated in both cases by the
spective, some of the diiculties that morphology poses for the min- whole-word recognition.
imalist program. Neurobiological methods irst made an impact on our under-
Wunderlich, Dieter. 2006a. What forced syntax to emerge? In Between
standing of morphology through studies of individuals with
40 and 60 Puzzles for Krifka, ed. Hans-Martin Grtner, Sigrid Beck,
acquired impairments to their morphological systems. hese
Regine Eckardt, Renate Musan, and Barbara Stiebels. Berlin: Zentrum
studies typically tried to establish the loci of morphological func-
fr Allgemeine Sprachwissenschaft. Available online at: http://www.
zas.gwz-berlin.de/fileadmin/material/40 60-puzzles-for-krifka/ tions in the brain (in either anatomical structures or functional
index.html. architectures). he progenitors of these studies are the works of
. 2006b. Why is there morphology? Abstract of paper presented at French neurologist Paul Pierre Broca and German neurologist
Workshop on heoretical Morphology, Leipzig, June. Available online Karl Wernicke during the late nineteenth century. Broca reported
at: http://www.uni-leipzig.de/~jungslav/rmag/Wunderlich.pdf. on a patient whose production abilities were severely impaired
524
Morphology, Neurobiology of
due to a lesion in the inferior frontal gyrus of the frontal lobe, produced as *la mia studia [det.Fs my.Fs oice.Fs]) and between
a lesion/deicit pattern now know as Brocas aphasia. Wernicke subjects and verbs (io vivo solo [I live.1s alone.Ms] was produced
reported that patients with lesions in the left posterior section as *io vive solo [I live.3s alone]). In repetition tasks, FS was 98 per-
of the superior temporal gyrus sufered from a severe compre- cent correct in repeating derived words with their derivational
hension deicit; their speech was luid and natural sounding but morphology intact, but only 40 percent correct in repeating the
their word selection seemed divorced from meaning. Numerous inlection.
researchers have since argued that lexical storage and retrieval One issue not expressly examined in the case of this patient
functions are located in wernickes area, and that morpho- was whether the diiculty that FS had in repeating inlected
logical grammar functions are housed in brocas area. forms was modulated by the regularity of their morphology.
A quite diferent approach to the study of morphology, but one he relevance of this point relates to the importance of identi-
that is also grounded in neurobiology, is connectionist mod- fying the particular level of morphological representation that is
eling. Work in connectionist modeling has called into question implicated in the deicit. In some cases of acquired morpholog-
some of the most fundamental tenets of morphological theory. ical deicit, performance on regular and irregular morphology
D. E. Rumelhart and J. L. McClelland (1986) presented argu- dissociates. For example, patient SJD presented with a deicit
ments from modeling against the distinction between regular and that disrupted the production (but not the comprehension) of
irregular morphology. M. S. Seidenberg and L. M. Gonnerman regularly inlected forms like walked (which was read as walk
(2000) challenged the very existence of morphological represen- and as walking on diferent occasions), as well as morphologi-
tations, arguing instead that what linguists call morphemes are cally derived words like publisher (which she read as publishing)
merely the points of convergence between sound and meaning (Badecker and Caramazza 1991). In comparison, that patients
codes, and are not a distinct type of entity of their own. hese performance on irregularly inlected words was equal to her rel-
studies have drawn ierce criticism, stimulated vigorous debate, atively intact production of uninlected forms. A complementary
and played a major role in shifting the standards of mainstream dissociation of regular versus irregular inlection has also been
morphology to a consideration of brains and simulated brains as reported. For example, patient AW exhibited poor performance
viable data sources. for irregularly inlected forms in comparison to nearly intact per-
In each of the disciplines concerned with the neurobiology formance with regularly inlected words (Miozzo 2003; see also
of morphology, many central issues remain unresolved and cases reported in Laiacona and Caramazza 2004; Shapiro and
hotly debated. Furthermore, there had been little occasion until Caramazza 2003).
recently for these methodologically distinct disciplines to com- Some single-case studies have reported patients who present
municate, despite their concern with fundamentally the same with impaired comprehension and production for both regular
topic. However, recent technological advances have provided and irregular inlection in both spoken and written modalities
each of these disciplinary perspectives with new methods of though not always in equal proportion. his deicit has been
study, and thus new insights into perennial questions. hese construed as resulting from an abstract, morphosyntactic level
advances, combined with shifting disciplinary boundaries, have of deicit (i.e., one where walked and ran are both represented as
enabled the neurobiology of morphology to become a largely morphologically complex) (Badecker 1997).
coherent line of inquiry into the neural underpinnings of word Most often, the method of neuropsychological studies is to
storage and processing. We may divide recent approaches into establish dissociations between distinct morphological subsys-
neuropsychological, hemodynamic, and neurophysiological tems, but sometimes the content of patients errors themselves
methods. provides insight into the nature of morphological grammar. For
example, patient SJDs aix selection errors were not always
Neuropsychological Methods grammatically licensed (e.g., she read poorest as poorless,
Patients with brain damage and resulting impairments have along with an elaborative comment that indicated comprehen-
long been a valuable source of evidence about the neurobiology sion of the superlative aix: the most poorless Indians have
of morphology. Modern neuropsychological studies of morphol- very little money). hese performance features suggest that the
ogy evaluate patients who sufer impairments that selectively mechanisms for producing productively aixed words exploit
afect (or selectively spare) morphological functions as the compositional procedures (Badecker and Caramazza 1991).
result of brain damage. Typically, the rationale of these studies
is to identify dissociations between patterns of impaired and Hemodynamic Methods
preserved capacities in order to establish whether certain mor- In contrast to neuropsychological methods, hemodynamic
phological functions are distinct from other components of the neuroimaging methods have made it possible to directly observe
lexical system. areas of the brain involved in normal (intact) morphological
One such dissociation is a morphological deicit that disrupts processing. hese methods include positron emission tomography
the processing of inlectional morphology in the context of a (PET) and functional magnetic resonance imaging (fMRI). hese
relatively spared ability to process derivational morphology. he methods compare levels of blood low and blood oxygenation in
Italian-speaking patient FS reported in Miceli and Caramazza diferent areas of the brain as subjects perform cognitive tasks.
(1988) presented with one of the clearest instances of this per- One of the few PET studies focused narrowly on morphology
formance pattern. In spontaneous speech, FS made frequent sought to identify a neural correlate of the use of overt inlec-
errors of agreement between nouns and attributive modiiers tional verbal morphology in German (Gnther et al. 2001).
(e.g., the target phrase il mio studio [det.Ms my.Ms oice.Ms] was his study contrasted German verbs with and without overt
525
Morphology, Neurobiology of
inlectional suixes, revealing a diference in activation in and the orthographical and semantic levels (e.g., boilbroil, screech-
around Brocas area. he researchers interpret these results as scream). he former pairs exhibit a facilitatory priming efect on
evidence that Brocas area subserves morphological and/or the M350 latency, while the latter pairs do not. However, as in
morphosyntactic functions. many MEG studies, efects that were visible in the M350 peak
Increasingly, fMRI is the preferred method for hemodynamic latency were obscured in the behavioral response latency, pre-
investigations of morphology. Numerous fMRI studies have sumably by a diferent and opposite efect that arose later in the
pursued a functional localization for morphology by contrast- time course of processing.
ing conditions with and without aixes, but they have often Many other neurophysical studies have found convergent evi-
returned inconclusive results (see, e.g., Davis, Meunier, and dence that morphological constituents are actively recognized
Marslen-Wilson 2004). in the early stages of lexical processing. Repetition priming has
A few fMRI studies of morphology and the lexicon, however, been found to attenuate the N400 response component to iso-
have taken on more reined questions, producing more robust lated words in lexical decision tasks (Rugg 1985). his efect on
results. One such study is the investigation by A. Beretta and col- ERP has also been observed with priming by morphological rela-
leagues (2003) of German regular and irregular inlection. hey tives: Regularly inlected primes elicit a weaker N400 response
ind signiicant overall diferences in neural activation between for uninlected verb targets in comparison to unrelated primes,
the processing of regularly inlected and irregularly inlected whereas irregularly inlected primes do not produce a compa-
nouns and verbs. hey interpret this as evidence that regular and rable reduction (Mnte et al. 1999). hese contrasting priming
irregular morphological functions are subserved by distinct neu- efects have been taken as evidence for morphological parsing
ral systems, though their indings do not address where irregulars of regularly inlected forms. Other studies have found evidence
are processed or where regulars are processed, if such distinct that the recognition of regularly inlected words is supported by
locations were to exist. morphological decomposition in brain responses to morpho-
logically illegal combinations like bringed (Morris and Holcomb
Neurophysiological Methods 2005; Lck, Hahne, and Clahsen 2006; see also McKinnon, Allen,
In order to examine how speciic types of morphologically com- and Osterhout 2003 for evidence of bound-stem parsing).
plex words are processed (e.g., regularly vs. irregularly inlected Evidence for decomposition is also observed in the efects
words) over the time course of lexical processing, researchers of regularity and lexical frequency on the P600 response to
have increasingly turned to neurophysical recording techniques inlected words that are ungrammatical for their context. In
whose temporal resolution is well suited to the rapid changes in a study that manipulated lexical frequency, morphological
brain response to linguistic materials. hese imaging methods regularity, and grammatical it, high-frequency irregularly
include electroencephalography (EEG) also known as event- inlected verbs in ungrammatical contexts (e.g., the boy couldnt
related brain potentials (ERPs) which measures electrical cur- *ran / *walked fast enough) showed an earlier onset of the P600
rents caused by neural activity, and magnetoencephalography response than did low-frequency irregularly inlected verbs,
(MEG), which measures the magnetic ields that result from this in comparison to their grammatical counterparts (e.g., the boy
neuroelectrical activity. couldnt run / walk fast enough); but the onset of this response
For the most part, neurophysical studies of morphology was unafected by lexical frequency for regularly inlected verbs
exploit well-studied event-related response components under (Allen, Badecker, and Osterhout 2003). he pattern suggests
a variety of stimulus conditions (including lexical priming, that for irregular verbs, surface frequency afects the speed with
manipulations of lexical properties such as surface or stem fre- which lexical recognition mechanisms can gain access to (and
quency, and contextual it). In EEG/ERP studies, there are two exploit) inlectional content, but that this is not so for regularly
response components that have been exploited to some advan- inlected forms.
tage: the N400 a negative delection peaking around 400 ms Recent studies have used MEG to explore how morphology
that increases in amplitude after a novel or unexpected lexi- shapes the recognition and interpretation of compounds and
cal stimulus and the P600 a positive current shift following morphologically derived words (Fiorentino and Poeppel 2007;
syntactic anomalies (Kutas and Hilyard 1980; Osterhout and Pylkknen et al. 2004). hese studies provide further support for
Holcomb 1992; Osterhout and Nicol 1999). In MEG, most mor- the view that the detection and exploitation of morphological
phology studies have focused on the M350 response compo- structure play a major part in the early and subsequent stages of
nent believed to relect some of the currents underlying the lexical recognition.
N400 ERP which peaks approximately 350 ms after the presen- Neural methods provide a distinctive and potentially compel-
tation of a word stimulus and is sensitive to stimulus factors such ling source of data for the study of morphology. Still, the set of
as lexical frequency (Embick et al. 2001). compelling neural studies of morphology remains quite small,
Several MEG studies have engaged the connectionist litera- relative to other research methods, and the coherence among
ture on morphology, addressing the issue of whether there exists these studies remains low. his is due in part to the limited avail-
a distinctly morphological level of representation, one that dif- ability of costly neuroimaging equipment and to the limited num-
fers from the representation of meaning and spoken/written ber of scholars who have expertise in both the technical issues of
form. In a MEG priming study, L. Stockall and A. Marantz (2006) morphology and the technical methods of cognitive neurosci-
found that genuine morphological relatives (e.g., givegave, ence. However, as the equipment proliferates and the methods
teachtaught) pattern diferently at the M350 response than gain a stronger foothold in the ield, we can expect that more rig-
morphologically nonrelated word pairs that are merely similar at orous methodological conventions will develop, and that studies
526
Morphology, Neurobiology of Morphology, Universals of
in this emerging area will have an even greater impact on our Pylkknen, L., S. Feintuch, E. Hopkins, and A. Marantz. 2004. Neural
understanding of language and morphology. correlates of the efects of morphological family frequency and family
size: An MEG study. Cognition 91: B3545.
Ehren Reilly and William Badecker Pylkknen, L., and A. Marantz. 2003. Tracking the time course of word
recognition with MEG. Trends in Cognitive Science 7: 1879.
WORKS CITED AND SUGGESTIONS FOR FURTHER READING Pylkknen, L., A. Stringfellow, and A. Marantz. 2002. Neuromagnetic
Allen, M., and W. Badecker. 2000. Morphology: he internal struc- evidence for the timing of lexical activation: An MEG component sen-
ture of words. In What Deicits Reveal about the Human Mind/ sitive to phonotactic probability but not to neighborhood density.
Brain: Handbook of Cognitive Neuropsychology, ed. B. Rapp, 21132. Brain and Language 81: 66678.
London: Psychology Press. Rugg, M. D. 1985. he efects of semantic priming and word repetition
Allen, M., W. Badecker, and L. Osterhout. 2003. Morphological analy- on event-related potentials. Psychophysiology 22: 6427.
sis in sentence processing: An ERP study. Language and Cognitive Rumelhart, D. E., and J. L. McClelland. 1986. On learning the past tenses
Processes 18: 40530. of English verbs. In Parallel Distributed Processing:Explorations in the
Baayen, R. H., T. Dijkstra, and R. Schreuder. 1997. Singulars and plurals Microstructure of Cognition, ed. D. E. Rumelhart, J. L. McClelland, and
in Dutch: Evidence for a parallel dual-route model. Journal of Memory the PDP Research Group, 21671. Cambridge, MA: MIT Press.
and Language 37: 94117. Seidenberg, M. S., and L. M. Gonnerman. 2000. Explaining derivational
Badecker, W. 1997. Levels of morphological deicit: Indications from morphology as the convergence of codes. Trends in Cognitive Sciences
inlectional regularity. Brain and Language 60: 36080. 4: 35361.
Badecker, W., and A. Caramazza. 1991. Morphological composition in Shapiro, K., and A. Caramazza. 2003. Grammatical processing of nouns
the lexical output system. Cognitive Neuropsychology 8: 33567. and verbs in the left frontal cortex? Neuropsychologia 41: 118998.
Beretta, A., C. Campbell, T. H. Carr, J. Huang, L. M. Schmitt, Stockall, L., and A. Marantz. 2006. A single route, full decomposition
K. Christianson, and Y. Cao. 2003. An ER-fMRI investigation of morpho- model of morphological complexity. Mental Lexicon 1: 85123.
logical inlection in German reveals that the brain makes a distinction
between regular and irregular forms. Brain and Language 85: 6792.
MORPHOLOGY, UNIVERSALS OF
Davis, M., F. Meunier, and W. Marslen-Wilson. 2004. Neural responses to
morphological, syntactic, and semantic properties of single words: An he topic of universals has not been as prominent in the area of
fMRI study. Brain and Language 89: 43949. morphology as it has been in some other areas of linguistics.
Embick, D., M. Hackl, J. Schaefer, M. Kelepir, and A. Marant z. 2001. A Many linguists share the impression that morphology is pre-
magnetoencephalographic component whose latency relects lexical dominantly a domain of the language particular, rather than the
frequency. Cognitive Brain Research 10: 3458.
general and universal. Whereas all languages compose words to
Fiorentino, R., and D. Poeppel. 2007. Compound words and structure in
make sentences in one way or another (syntax), it is not cer-
the lexicon. Language and Cognitive Processes 22.7: 9531000.
Gnther, T., F. Longoni, O. Sabri, L. Sturz, K. Setani, and W. Huber.
tain that all languages compose morphemes to make words.
2001. PET study of basic syntax and verb morphology. NeuroImage he Chinese languages, for example, have almost no morphol-
13: 538. ogy apart from the possibility of compounding two roots to make
Kutas, M., and S. A. Hillyard. 1980. Reading senseless sentences: Brain a word.
potentials relect semantic incongruity. Science 207: 2035. While it is probably too strong to say that some languages have
Laiacona, M., and A. Caramazza. 2004. he noun/verb dissociation in no morphological system at all, at least it seems clear that there
language production: Varieties of causes. Cognitive Neuropsychology are no universal morphological categories notions that are
21: 10323. expressed by morphological means in all languages. For exam-
Lck, M., A. Hahne, and H. Clahsen. 2006. Brain potentials to morpholog-
ple, English requires that past tense be expressed as a suix on
ically complex words during listening. Brain Research 1077: 14452.
verbs (stun vs. stunned) and that plural number be expressed as
McKinnon, R., M. Allen, and L. Osterhout. 2003. Morphological decom-
a suix on nouns (box vs. boxes), but there are many languages
position involving non-productive morphemes: ERP evidence.
NeuroReport 14: 8836. in which these notions are not expressed morphologically (e.g.,
Miceli, G., and A. Caramazza. 1988. Dissociation of inlectional and deri- Yoruba); they are either expressed by syntactic constructions or
vational morphology. Brain and Language 35: 2465. not expressed at all. Moreover, morphology is a notorious reposi-
Miozzo, M. 2003. On the processing of regular and irregular forms of verbs tory of many historical relics, irregularities, exceptions, and idio-
and nouns: Evidence from neuropsychology. Cognition 87: 10127. syncrasies. For example, the plural of box in English is boxes, but
Morris, J., and P. Holcomb. 2005. Event-related potentials to violations the plural of ox is oxen; similarly, the past tense of stun is stunned
of inlectional verb morphology in English. Cognitive Brain Research but the past tense of run is ran. Irregularities of this sort are tol-
25: 96381. erated in morphology in a way that they may not be (or not as
Mnte, T., S. Tessa, H. Clahsen, K. Schlitz, and M. Kutas. 1999.
much) in other linguistic domains. What universals we can hope
Decomposition of morphologically complex words in
to ind in morphology, then, are statistical and implicational
English: Evidence from event-related brain potentials. Cognitive
universals, rather than absolute universals (see absolute and
Brain Research 7: 24153.
Osterhout, L., and P. J. Holcomb. 1992. Event-related brain potentials statistical universals).
elicited by syntactic anomaly. Journal of Memory and Language Despite the inherent noisiness of morphology, some uni-
31: 785806. versals of these sorts are discernable. Perhaps the best known
Osterhout, L., and J. Nicol. 1999. On the distinctiveness, independence, and best understood are universals of markedness. hese uni-
and time course of brain responses to syntactic and semantic anomal- versals have the form of statements saying that no language will
ies. Language and Cognitive Processes 14: 283317. have an aix that expresses a marked category Y unless it also
Pinker, S., and M. Ullman. 2002. he past-tense debate: he past and has an aix that expresses a less marked category X within the
future of the past tense. Trends in Cognitive Science 6: 45663.
527
Morphology, Universals of
same semantic domain. For example, many languages (includ- aix of the language is indexed as being able to appear (ideally)
ing English) have a plural form for nouns but no dual form that in one and only one of these slots.
means exactly two, whereas there are few or no languages that Another putative universal of morphology is that inlectional
have morphological marking for the category dual without also morphology can only appear outside of (further from the root
marking the category plural. Similarly, verbs in a given language than) derivational morphology. Roughly speaking, derivational
do not express the distinction between inclusive irst person morphology creates new words by adding aixes to existing
plural (we including you) and exclusive irst person plural (we stems, whereas inlectional morphology creates the forms of a
not including you) without also expressing the more basic dis- word that a speciic syntactic context might require. Examples
tinction between irst person and second person (we versus of derivational aixes in English include -ize (crystalcrystallize),
you). Along the same lines, languages do not have special aixes -ship (friendfriendship), -less (carecareless), -able (liftliftable),
for remote past and remote future without also having aixes and -ing (clipclipping). Examples of inlectional aixes in
for simple past and simple future, and if a language makes any English include the plural aix that attaches to nouns (crystal-
aspect distinctions in its verbal morphology, it will make a dis- crystals, friendfriends) and the past tense aix that attaches to
tinction between imperfective aspect and perfective aspect. verbs (liftlifted, clipclipped). Now, there is no problem adding
A plausible reason why universals of this sort hold has to do inlectional morphology to the output of derivational morphol-
with the logic of features. It is assumed that more marked, seman- ogy: Words like fossil-ize-d, friend-ship-s, and clip-ping-s are per-
tically complex categories are built up out of simpler categories. fectly possible. But the reverse order is not allowed: he process
For example, the category dual shares a semantic feature [Group] of having a solution turn into more than one crystal is not to
with the category plural but adds a feature such as [Minimal], *crystal-s-ize and the state of having many friends is not *friend-s-
which it shares with singular (Harley and Ritter 2002). It stands ship, nor is a *clip-ped-ing something that was clipped out in the
to reason, then, that a language will not have morphemes that past. Also bad are words like *lift-ed-able and *care-s-ful (having
realize a more complex feature bundle like [Group, Minimal] many cares). A similar constraint says that inlectional morphol-
(dual) without also having morphemes that realize the simpler ogy cannot be found inside a compound word: One can have
feature bundles [Group] (plural) and [Minimal] (singular) that doghouse but not *dogshouse (a large house intended for more
this bundle properly contains. It seems likely that this vision can than one dog); one can have a pickpocket but not a *pick-ed-
be extended to the full range of nominal and verbal inlectional pocket (a thief who has already done his dirty work). One can cal-
categories, although many details remain to be worked out. culate what these words should mean, and in some cases one can
One of the most general universals of morphology is that the imagine uses for the word; nevertheless, the examples are at best
order of morphemes in a complex word is almost always rigidly highly marked and unlikely to be used. Similar restrictions can
ixed. Almost all languages allow the words of a sentence to be be observed in many other languages, although a limited range
rearranged to some degree for stylistic or pragmatic efect. of counterexamples has occasionally been pointed out. here are
his is particularly true of a language such as Mohawk, in which also some unresolved questions about how exactly to deine the
any word order is usually possible. In contrast, no language diference between derivational and inlectional morphology,
allows the morphemes in a complex word to be freely rearranged which need to be clariied to make this generalization meaning-
in this way. For example, the following Mohawk word consists ful and applicable in all cases (see, for example, Anderson 1982).
of 11 distinct morphemes, but any other ordering of these mor- Nevertheless, there is little doubt that an important universal
phemes is ungrammatical: characteristic of morphological systems lurks here.
Some iner-grained universal restrictions have been discov-
(1) Wa-sha-ko-t-yat-awi-tsher-ahetkv-t-v
ered. One idea, supported in many studies, is that the order of
FACT-he-her-self-body-put.on-NOML-be.ugly-make-for-
morphemes in a complex word relects the scope of those mor-
PUNC
phemes the order in which they were composed for syntax and
He made the thing you put on the torso [i.e., a shirt or dress]
semantics (Baker 1985, 1988; Bybee 1985; Cinque 1999; Rice
ugly for her.
2000). An example from derivational morphology is the follow-
here are occasional examples that might seem like exceptions to ing pair from Quechua (Baker 1988):
this rule, when two aixes can come in diferent orders to express
(2) a. Mikhu-naya-chi-wa-n.
diferent semantic scopes (as will be seen), or when a morpheme
eat-want-make-1sO-3sS
is displaced from its expected position in order to respect con-
It makes me feel like eating.
ditions of phonological well-formedness. But even such devia-
b. Mikhu-chi-naya-wa-n.
tions, which are strongly motivated by semantic or phonological
eat-make-want-1sO-3sS
concerns, stand out as being rather unusual. A striking univer-
I feel like making someone eat.
sal of morphology, then, is that morpheme order is ixed for a
language, and is not permitted to vary for pragmatic or stylistic he suixes chi to make and naya to want can attach to a verb
reasons. structuralist linguists and descriptive linguists stem in either order, but there is a systematic diference in mean-
commonly capitalize on this property of morphology by using ing. If make attaches before want, the combination means
the device of a template to describe the morphological structure to want to make someone eat, whereas if want attaches
of words in a given language: A set number of morphological slots before make, the combination means to make someone want
(position classes) are identiied for each word class, and every to eat. he order in which the aixes attach in Quechua matches
the order in which the words are combined syntactically in the
528
Morphology, Universals of
corresponding English paraphrases, and this in turn relects how Cinque (1999) presents a more ine-grained approach of this
the meanings are composed semantically in both languages. Not kind, in which some 30 distinct inlectional categories are identi-
all languages that have similar aixes allow both of the orders ied, each of which is shown to attach to a verb in a set order rel-
shown in (2), but it is generally true that the orders that are used ative to all the others. (It is possible that the ban on inlectional
correspond systematically to the order of interpretation for pur- morphology coming inside of derivational morphology is a spe-
poses of syntax and semantics. cial case of this mirror principle/relevance principle, though that
Compositional ordering efects of this kind are rather wide- is not obvious in all cases.)
spread and apply to diferent types of morphology. One can see In some of the more recent literature, however, there have
something similar in English in the domain of compounding. For been hints that the ixedness of morpheme order might be even
example, ethics committee proposal refers to a proposal by or for more restricted than one would expect, given considerations of
the ethics committee, whereas ethics proposal committee refers semantic compositionality alone. For example, Larry Hyman
to a committee in charge of formulating an ethics proposal. As in (2003) discusses examples like (5) from Chichewa:
Quechua, the diferent morpheme orders relect diferent orders
(5) Alenj a-ku-tks-its-il-a mkz mthko. (*a-ku-tks-il-its-a)
of semantic composition. he only diference between the two
Hunters 3.PL-PROG-stir-make-APPL-FV woman spoon
cases is that chi and naya are bound aixes that must attach to
he hunters are making the woman stir with a spoon.
verb roots, whereas proposal and committee are roots that can be
used as words in their own right in English. he observation that At issue here are the suixes its causative and il applicative,
morpheme orders must directly relect the order of syntactic/ which (in this case) adds the meaning of doing an action with
semantic composition is sometimes called the mirror principle a particular instrument. From the semantic point of view, one
(Baker 1985); Bybee (1985) refers to a similar idea as the principle would think that these aixes should be able to attach to the verb
of relevance. in either order, giving diferent compositional meanings. One
Much the same constraint seems to hold of inlectional mor- could start with the base verb stir, add the applicative aix il to
phology as well, except that in this domain, there are very few get stir with a spoon, and then add the causative to get make
cases in which the order of morphemes can be reversed to give a someone [stir with a spoon]. Alternatively, one could add the
semantic efect. For example, Joseph Greenberg (1963) showed causative aix to the base verb irst to get a causative action
that when both number marking and case marking are attached make someone stir and then add the applicative aix to create
to a noun root, the number marking almost always attaches [make someone stir] with a spoon. In the irst case, it would be
before the case marker does: the woman who is using the spoon to stir; in the second case, it
would be the hunters who are using the spoon to impose their
(3) a. adam-lar-a (man-PL-DAT) to the men (Turkish)
will on the woman. Yet only the second aix order is possible,
b. *adam-a-lar
and this form is ambiguous (or perhaps vague) concerning
he reason is presumably because the plural operator is deined the two imaginable meanings.
semantically over the meaning of the noun itself, whereas the Hyman himself points to a historical explanation of the
function of the case marker is to relate the noun phrase as a absence of a second form in (5). He shows that the same restricted
whole to the rest of the sentence in which it appears. hus, in ordering holds true for a wide range of Bantu languages spoken
(3a) the order of the aixes relects the natural order of the in sub-Saharan Africa and that it is a special case of a more far-
semantic composition, just as in the examples in (2). In contrast, reaching template, which stipulates not only that applicative
(3b) is bad because the relevant semantic operators do not must follow causative but also that the reciprocal suix can only
combine that way. he morphological universal in (3) can thus follow both of these, and the passive suix can only follow all of
be related to the fact that in languages like English, the plural the others. He claims that this particular template was inherited
marker must attach directly to the noun, not to the prepositional from Proto-Bantu by most of the descendant languages. But this
phrase that contains the noun (to the boy+s, not *to+s the boy). sort of historical explanation might not be general enough. First,
Joan Bybee (1985) applies the same kind of reasoning to verbs it begs the question of why the relevant aixes had to attach in
(see also Bybee, Perkins, and Pagliuca 1994). When verbs bear this particular order in the ancestor language. Second, it turns
multiple inlectional aixes, they almost always come in a ixed out that most non-Bantu languages also allow the causative aix
order: An aspect marker attaches irst, a tense marker attaches to attach before applicative but not vice versa as well. Example
outside an aspect marker, mood markers attach outside both (6) shows that this is true for classical Nahuatl (spoken in Mexico
tense and aspect, and subject agreement markers attach last of [Launey 1981, 197]); it also holds for Mohawk (northeastern
all (though the position of agreement is a bit more variable than United States), Hiaki (southwestern United States), Shipibo
the others). Example (4) thus shows a typical morpheme order; (Peru), Mapudungun (Chile), and many others.
other orders are rare or nonexistent.
(6) Ti-ne:ch-in-tlacua-l-ti:-li-a in no-pil-hua:ntoto:n.
(4) aku-wye-a-y-mi. (Mapudungun) 2sS-1sO-PL-eat-caus-appl the my-chidren
arrive-PERF(aspect)-FUT(tense)-IND(mood)-2sS(agreement) You made my children eat for me. (*Ti-ne:ch-in-tlacua-li-tia)
You will have arrived.
Something more general seems to be at work here.
Notice that the English auxiliary verbs appear in essentially the In a similar vein, Gabriella Caballero and her colleagues
same relative order, suggesting that this, too, can be attributed (2006) have recently argued that the morpheme order noun-
to domain-general facts about semantic composition. Guglielmo verb is universally preferred to the order verb-noun whenever a
529
Morphology, Universals of Motif
noun and a verb combine to form a single verb. his holds true MOTIF
regardless of the speciic mode of combination, whether it is
Motif is a unit of measurement and content analysis. It is applied
the result of syntactic noun incorporation, productive morpho-
to expressive culture, especially literatures and certain branches
logical compounding, or idiosyncratic lexical combination. he
of the arts, such as painting, sculpture, and music. he system-
order of morphemes in the Mohawk example in (7) is thus typical
atic application of the term was established in the early 1900s
in this respect:
through the work of the Finnish School and its attempts to use
(7) Wa-ke-nakt-a-hninu- (not: hninu-nakt buy-bed) an international folktales history (time) and dispersal in social
Fact-I-bed-0-buy-PUNC and physical space (geography) to reconstruct the tales orig-
I bought a bed. inal form (urform, archetype), place of birth, and other related
matters of diffusion. his approach is also known as the his-
his universal order is occasionally overridden by syntactic
toric-geographic method for its reliance on objective veriiable
ordering principles in particular languages (like Mapudungun).
criteria, rather than speculative hypotheses. For the Finnish
But noun-verb order is always preferred by the morphology, and
School, two key concepts became indispensable research instru-
it emerges more strongly in the more lexicalized, purely morpho-
ments: tale-type, a problematic term signifying a full folktale
logical constructions, where contamination from syntactic fac-
known cross-culturally, and motif, a smaller unit designating a
tors is minimal.
detail contributing to the formation of the plot. Motif complex/
Taken together, studies like these hint that there might be a
cluster/sequence and episode are other related measurement
universal morphological template, roughly of the form noun-
units. hough arising from the works of the Finnish School,
verb-causative-applicative-passive. his template appears to be
perceived by many as useful only in comparative studies, and
a force (though not an irresistible one) that is at work inluencing
shackled by problems of name interpretation and linkage to the
the morpheme orders of all languages. Moreover, this order does
currently unfashionable quest for origins, the usefulness of these
not seem to reduce to historical factors or semantic composition.
terms as tools of data identiication and objective analysis tran-
Why this should be is unknown at this point. However, it seems
scends these limitations (El-Shamy 1997, 235).
likely that more morphological universals of this sort will be dis-
he most salient attributes of a motif are its endurance (con-
covered in the future as linguists recover from their impression
tinuity in time) and recurrence within a community (continuity
that morphology is primarily the domain of the idiosyncratic and
in space). Continuity in time and space are basic requirements
the language particular.
for traditionality. In 1925, Arthur Christensen argued that a motif
Mark C. Baker persists in tradition according to a psychological law which is
not easily explicable. his empirical characteristic was left unex-
WORKS CITED AND SUGGESTIONS FOR FURTHER READING plained (Bdker 1965, 2013). From the perspective of the bearer
of lore and the author or composer of elite art, certain traditional
Anderson, Stephen. 1982. Wheres morphology? Linguistic Inquiry
themes possess cognitive salience (impressiveness) that make
13: 571612.
Baker, Mark. 1985. he mirror principle and morphosyntactic explan-
them stand out and grab a persons attention. his salience (log-
ation. Linguistic Inquiry 16: 373415. ical or afective) may be due to frequency of occurrence (repeti-
. 1988. Incorporation: A heory of Grammatical Function Changing. tion), meaningfulness, structure, uniqueness, ego involvement,
Chicago: University of Chicago Press. and so on, properties that make such themes easily perceived,
Bybee, Joan. 1985. Morphology: A Study of the Relation Between Meaning learned, retained, and recalled (El-Shamy 1997).
and Form. Amsterdam: John Benjamins. he concept of motif is a close parallel to that of theme. he
Bybee, Joan, R. Perkins, and W. Pagliuca. 1994. he Evolution of two terms are often used interchangeably. However, theme has
Grammar: Tense, Aspect, and Modality in the Languages of the World. dominated in the study of elite literature, whereas motif has
Chicago: University of Chicago Press. been more common in the study of folklore. Research in literary
Caballero, Gabriella, Michael Houser, Nicole Marcus, Teresa
themes has commonly been pursued because of its interpret-
McFarland, Anne Pycha, Maziar Toosarvandani, Suzanne Wilhite,
ive potentialities and its intrinsic congruency with the history
and Johanna Nichols. 2006. Nonsyntactic ordering efects in syn-
tactic noun incorporation. Manuscript, University of California at
of ideas (Jost 1988, xv). Additionally, literary authorities have
Berkeley. considered it an eicient counteragent against primarily aes-
Cinque, Guglielmo. 1999. Adverbs and Functional Heads: A Cross- thetic movements, such as progressive Universalpoesie. It is
Linguistic Perspective. New York: Oxford. also argued that the motif is intellectual by nature. It expresses
Greenberg, Joseph. 1963. Universals of Language. Cambridge, MA: MIT a process of reasoning about mens conduct of life and, as a con-
Press. sequence, does not concern itself with the analysis of individual
Harley, Heidi, and Elizabeth Ritter. 2002. A feature-geometric analysis of characters or extraordinary happenings (Jost 1988, xvii).
person and number. Language 78: 482526. he viewpoint outlined here addressing elite literature stands
Hyman, Larry. 2003. Suix ordering in Bantu: A morphocentric
at some variance with the folkloristic usage of the term motif and
approach. In Yearbook of Morphology 2002, ed. Geert Booij and Jaap
the perceived scope of its applicability. Introducing his Motif-
van Marle, 24582. Dordrecht, the Netherlands: Kluwer Academic
Index, Stith hompson explained that his system is built around
Publishers.
Launey, Michel. 1981. Introduction la Langue et la Littrature the interest of students of the traditional narrative and would
Azteques. Vol. 1. Paris: LHarmattan. address a certain type of character, action, as well as atten-
Rice, Keren. 2000. Morpheme Order and Semantic Scope. Cambridge: dant circumstances of the action (1955, I.11). (See, respectively,
530
Motif Movement
for examples: W10, Kindness; T72.2.1, Prince marries scorn- Dundes, Alan. 1964. he Morphology of North American Indian Folktales.
ful girl and punishes her; R216, Escape from ship while captors Folklore Fellows Communications, no. 195. Helsinki: Academia
quarrel). Scientiarum Fennica.
Inluenced by anthropology, hompsons system may be El-Shamy, Hasan. 1995. Folk Traditions of the Arab World: A Guide to
Motif Classiication. 2 vols. Bloomington: Indiana University Press.
compared to the analytical-classiicatory devices of culture ele-
. 1997. Psychologically-based Criteria for Classiication by Motif
ment, culture complex, and culture institution with culture ele-
and Tale-Type. Journal of Folklore Research 34.3: 23343.
ment being the smallest identiiable component of culture. In Garry, Jane, and Hasan El-Shamy, eds. 2005. Archetypes and Motifs in
congruence with his companions in the Historic-Geographical Folklore and Literature: A Handbook, Armonk, NY: M. E. Sharpe.
School, hompson saw folk literature (especially narratives) as Jost, Franois. 1988. Introduction. In Dictionary of Literary hemes
analyzable in terms of motifs, episodes (or motif complexes/ and Motifs, ed. Jean-Charles Seigneuret, xv-xxiii. Westport,
sequences), and full narrative plots constituting tale-types. A CT: Greenwood.
motif, though considerably more intricate, is comparable to Murdock, G. P., Clelland S. Ford, and Alfred E. Hudson. 1938. Outline of
culture element; culture complex is comparable to episode; and Cultural Materials. New Haven, CT: Institute of Human Relations, Yale
culture institution is comparable to tale-type. For hompson, University.
motifs are those details out of which full-ledged narratives are Sperber, Hans, and Leo Spitzer. 1918. Motiv und Wort, Studien zur
Literatur-und Sprachpsychologie. Leipzig: O. R. Reisland.
composed (1955, I.10).
hompson, Stith. 19558. MotifIndex of Folk Literature. 6 vols. Revised
Explaining the rationale for his motif system and its main
ed. Bloomington: Indiana University Press.
objective, hompson declares that it emulates what the sci-
entists have done with the worldwide phenomena of biology
(1955, I.1011). In this respect, the underlying principle for
MOVEMENT
motif identiication and indexing is comparable to that devised
by anthropologists at Yale for categorizing culture materials Movement is an operation posited in theoretical syntax, in which
in terms of 78 macrounits (1088) and 629 subdivisions thereof words, phrases, and perhaps also morphemes are relocated from
used to establish he Human Relations Area Files (HRAF); one part of a sentence to another. Movement is generally invoked
these iles, begun almost contemporaneously with the irst pub- in cases in which a phrase has a combination of properties asso-
lication in the 1930s of hompsons Motif-Index, may be viewed ciated with distinct positions in the sentence. Consider (1):
as an unprinted index. Twenty-three divisions make up the spec-
(1) Whom did you see?
trum of sociocultural materials covered in hompsons Motif-
Index, each treated in an independent chapter (e.g., B. ANIMALS; Approaches to syntax that posit movement typically claim that
C. TABU; F. MARVELS; X. HUMOR). hese cardinal themes are the word whom in (1) has moved from the end of the sentence,
divided into 1,730 subdivisions (El-Shamy 1995). where the direct object would ordinarily be, to the beginning.
Because hompsons Motif-Index seeks global coverage, his explains why, for instance, whom bears the accusative case;
numerous geographic regions and national entities did not whom has properties associated with direct objects, because it
receive adequate attention. Consequently, signiicant ields of has occupied the direct objects position.
human experience are missing or sketchily presented. Major One type of argument for movement tries to establish that
expansions are ofered in ensuing works, for example (note: the an apparently empty position must be illed, and that the logical
sign indicates addition to hompsons system): F70, Ascent iller is a moved phrase. For example, a verb like ix must take a
to other planets (worlds) by space ship (lying saucer); J70, direct object, which is typically in immediately postverbal posi-
Teaching (training) by cruel example; P610, Homosociality tion in English:
[]; P770, Markets: buying, selling, trading; X580, Humor
(2) a. You ixed the car.
concerning misers and miserliness (El-Shamy 1995; cf. Birkhan,
b.*You ixed.
Lichtblau, and Tuczay 2005).
c.*You ixed yesterday the car.
An ofshoot of the motif system is the concept of motifeme,
a hybrid of folklore and linguistics modeled after morphological However, a question like (3a) is well formed (unlike (3b):
analyses of folktales. In this system, an abstract unit of action or
(3) a. What did you ix?
state is labeled motifeme, and its manifestations are motifs. he
b. *When did you ix?
variety in which a given motifeme is manifested is termed allo-
motif (Dundes 1964). he ill-formedness of (3b) shows that the required transitivity of
ix is not suspended in questions; ix must have a direct object in
Hasan El-Shamy
(3a). he contrast in (3) suggests that the direct object is what,
since what is present just in the well-formed (3a). Example (2c)
WORKS CITED AND SUGGESTED FURTHER READING
shows that the direct object in English must typically be imme-
Birkhan, Helmut, Karin Lichtblau, and Christa Tuczay. 2005. Motif-Index diately postverbal; we can preserve this generalization in (3a)
of German Secular Narratives from the Beginning to 1400. 6 vols. Berlin by positing movement of what from postverbal position to the
and New York: W. de Gruyter. beginning of the sentence.
Bdker, Laurits. 1965. Folk Literature (Germanic). Vol. 2 of International
Other arguments for movement try to show that a moved
Dictionary of Regional European Ethnology and Folklore.
phrase has occupied positions in the sentence that it no longer
Copenhagen: Rosenkilde and Bagger.
occupies. For instance, James McCloskey (2000) discusses a
531
Movement
West Ulster dialect of English in which the questions in (4a) and (9) a. What did he say that he wanted all?
(4b) are synonymous: b. What did he say all that he wanted?
c. What all did he say that he wanted?
(4) a. What all did you buy?
b. What did you buy all? he examples in (9) difer with respect to how far all travels with
c. *What did all you buy? what: not at all (9a), only to the intermediate site (9b), or to the
beginning of the clause (9c).
The position of all is not completely free in this dialect, as
Another controversial type of movement is involved in the
the ill-formed (4c) shows. In fact, all can only appear in posi-
Chinese and Japanese wh-questions in (10a) and (10b), both of
tions that what has occupied. In (4b), all appears in the direct
which have the same meaning as the English (10c):
object position; this is the position that what occupies before
moving to the beginning of the sentence. Theories that posit (10) a. John mai-le sheme?
movement can account for the distribution of all in this dia- John buy Perf what
lect; the phrase what all can leave the word all behind when b. John-wa nani-o kaimasita ka?
it moves. John TOP what ACC bought Q
Another argument for movement is based on the phenom- c. What did John buy?
enon of reconstruction, in which moved phrases are treated by
In English, what is moved to the beginning of the clause, but in
certain semantic dependencies as though they had not moved.
Japanese and Chinese, the corresponding words can appear in
Space constraints prevent further discussion here (cf. Romero
noninitial positions, just where they would be if they were not
1997; Fox 2000).
wh-phrases (wh-in-situ). On one approach to these data, Chinese
Syntacticians distinguish several subtypes of movement. he
sheme and Japanese nani, like their English translation what,
examples thus far have all involved wh-movement, which forms
undergo wh-movement to the beginning of the clause; unlike
questions by moving certain phrases to the beginning of the
what, however, these words move in a way that does not afect
clause. Another type, head-movement, derives (5b) from (5a), via
where they are pronounced (covert movement).
movement of is:
Some arguments for covert movement center on the inter-
(5) a. He is leaving. action of wh-in-situ with established constraints on movement
(cf. Huang 1982; Richards 2009). For instance, wh-movement
b. Is he ___ leaving? in many languages is unable to pass out of an embedded inter-
rogative clause; we say that interrogative clauses are islands for
wh-movement. hus, (11a) is well-formed, but (11b), with wh-
A third type is sometimes called NP-movement; examples
movement out of an interrogative clause, is not:
include movement of John in (6a) (compare the roughly syn-
onymous [6b]), and movement of Mary from object to subject (11) a. What does Mary think that John bought __ ?
position in (7):
b. *What does Mary wonder [who bought __ ]?

(6) a. John seems __ to be happy.
b. It seems that John is happy.

Japanese wh-in-situ exhibits a similar constraint; the sentences
in (11) have the Japanese translations in (12) with a similar con-
trast in grammaticality:
(7) Mary was promoted __. (12) a. Mary-wa John-ga nani-o katta to omoimasu ka?
Mary TOP John NOM what ACC bought that thinks Q
Other types of movement are more diicult to detect. For b. * Mary-wa [dare-ga nani-o katta ka] siritagattei-
instance, much work argues that wh-movement in (8) is not a masu ka?
single move from the end of the sentence to the beginning, but Mary TOP who NOM what ACC bought Q wonders Q
stops in at least one intermediate landing site (successive-cyclic
movement): Norvin Richards
(8) What did he say that he wanted __? Fox, Danny. 2000. Economy and Semantic Interpretation. Cambridge,
MA: MIT Press.
One argument for this type of movement comes from West Ulster Huang, James. 1982. Logical relations in Chinese and the theory of
English. We saw that this dialect allows all to be stranded in posi- grammar. Ph.D. diss., Massachusetts Institute of Technology.
McCloskey, James. 2000. Quantiier loat and wh-movement in an Irish
tions formerly occupied by a wh-moved phrase. he all-strand-
English. Linguistic Inquiry 31: 5784.
ing facts for this dialect in (9) show that wh-movement has an
Richards, Norvin. 2009. Wh-questions. In he Oxford Handbook
intermediate landing site:
of Japanese Linguistics, ed. Shigeru Miyagawa and Mamoru Saito.
532
Music, Language and
Romero, Maribel. 1997. he correlation between scope reconstruc- yield, as Brown notes, analogies between music and language,
tion and connectivity efects. In Proceedings of WCCFL 16, 35166. helpful and suggestive analogies, to be sure, but which do not
Stanford, CA: CSLI. constitute arguments for a shared cognitive basis.
Musicalist Representation of Linguistic Structure

MUSIC, LANGUAGE AND he studies just alluded to are representative of recent scholar-
Connections between music and language have been a peren- ship in that they attempt to ground subjective judgments with
nial concern of scholars, poets, music and literary theorists, and respect to musical structure on the hard foundation provided by
musicians going back to antiquity. he basis of this interest lies linguistic science. While this has been the prevailing direction
in certain commonalities that are intuitively understood to lie at of inluence, it has at times extended in the opposite direction.
the heart of the two capacities, but which become complex and Most notably, musical notation for several centuries constituted
problematic when one attempts to elucidate their precise nature. the only efective means for visually representing the structure
Confusion on the question of the interaction of music and lan- of audible sound. Among the acoustical phenomena rendered
guage is not surprising since the underlying basis of music and visible and thereby amenable to a structural analysis were speech
language as independent objects has been poorly understood sounds of English carefully transcribed into a modiied form of
until recently. Nor is it surprising, given traditional disciplin- musical notation by Joshua Steele in his 1775 Essay Towards
ary divisions, that scholars have tended to focus on developing Establishing the Melody and Measure of Speech.
descriptive and explanatory frameworks (see descriptive, In a recent review of the work, Jamie Kassler (2005) credits
observational, and explanatory adequacy) for each, Steele as being among the irst to identify linguistic supraseg-
which take for granted their status as independent rather mentals the tier of linguistic structure computed indepen-
than common faculties. With the exception of Jean-Jacques dently of and mapped onto phonemic segments. tone, the
Rousseaus ([1763] 1997) Essai sur lorigine des langues, which hierarchically related sequence of pitch locations assigned to
claims an ancestral proto-language from which language and voiced segments, is one such suprasegmental and is relatively
music both derive, most attempts at engaging the question have naturally represented in musical notation. Steele also recog-
stressed that music and language not only serve distinct ends, nized that unlike musical pitch, which tends to be discrete, the
one mainly aesthetic and the other mainly communicative, but target pitches of speech are consistently connected by contin-
also access distinct underlying psychological means. uous glissandi or slides, represented in his scores by diagonal
he umbrella of cognitive science has provided the context line segments of various types attached to note stems, shown in
for a renewed discussion of some of these points of comparison. Figure 1.
Perhaps most striking is the reemergence of arguments for an he other linguistic suprasegmental identiied by Steele,
evolutionary precursor in the form of what Stephen Brown (2000) stress, emerges somewhat obliquely from his transcriptions.
refers to as musilanguage whose essential characteristics are One of Steeles important insights was to have recognized that a
identiied in Stephen Mithen (2005) by the acronym hmmmm particular type of musical accent, the metrical accent, is associ-
(holistic, manipulative, multimodal, musical, and mimetic). ated with linguistic stress. hus, for example, the initial beat of
hese approaches are somewhat controversial not only in their the musical measure is metrically strong, and the most stressed
endorsement of what might be called a neo-Rousseauvian per- syllables of a text assigned to a tune (generally the stressed
spective but also in their assumption of a signiicant overlap in syllables of polysyllabic words Peter, going, mistake, and coming
some of the cognitive structures that underlie both music and in Figure 1) are assigned to what he refers to as the ictus position.
language. he best-known explorations of the common ground, Finally, and most signiicantly, Steele recognized that metrical
Leonard Bernsteins he Unanswered Question, Deryk Cookes accent is not an objective feature of the musical event but is a
he Language of Music, and Joseph Swains Musical Languages psychological attribute inherited from its temporal location. A
have not suggested any speciic shared mechanisms. Rather, strong position will be perceived as such regardless of whether the
they and others have applied certain aspects of the descriptive event occupying the position is objectively accented in the form
apparatus and general methodologies of linguistic theory to of higher pitch, amplitude, or length. Indeed, it may be heard as
strong even when it is vacant occupied by a rest. Metrical accent
Figure 1.
533
Music, Language and
the same representation, namely, the grid, would emerge as the

optimal means for representing linguistics stress. Indeed, met-
rical stress theory, the dominant explanatory framework within
the generativist paradigm would be deined by the grid represen-
tation, one variant of which is shown in example 3 (from Halle
Figure 2. and Vergnaud 1987) (Figure 3).
A comparison of the grids in examples 2 and 3 reveals two
essential diferences between linguistic and musical structure.
* level 3
First, while the stress grid projects syllables onto higher metri-
* level 2
cal levels, the bottom level of the musical grid indicates not
** * level 1
** ** * level 0 actual musical events (i.e., notes) but, rather, temporal loca-
Ticonderoga tions. As a consequence, empty metrical locations, such as those
in example 2, which are a necessary component of any reason-
Figure 3.
able description of musical meter, are excluded from the stress
grid. Secondly, as Jackendof and Lerdahl (1983) show, musical
is therefore, in Steeles words, a subjective mental sensation structure imposes strict requirements on the geometric form
deriving from a sense of pulsation giv[ing] the mind an idea that grids may assume, limited to what they refer to as a small
of emphasis independent of any actual increment of sound or class of well-formed structures. In contrast, there are no a priori
even of any sound at all (1775, 117). In recognizing the abstract constraints on the form taken by the stress grid. he successive
character of meter, he anticipated twentieth-century cognitiv- positions projected onto line 1 of example 3 would be ruled out
ist approaches that view linguistic stress, along with most other as a potential metrical structure in music, where strong positions
salient characteristics of language, as mental constructs, pho- need to be separated by at least one position on level 0. his vio-
nological rather than phonetic, psychologically real but lation of musicalist well-formedness does not, however, prevent
only obliquely related to the acoustical or physiological surface example 3 from accurately characterizing the pattern of second-
form. ary and primary stress for the word in question.
he asymmetries in the two forms of representation are, it
The Grid Representation would seem, necessary for a description of the output of each
he measure of speech referred to in Steeles title the pat- system: As mentioned, the projection of musical meter requires
terned occurrence of strong and weak metrical positions is an underlying temporal periodicity that is neither intuitively
represented in his transcriptions by a three-level hierarchy obvious nor empirically demonstrable in language except as a
appearing below the staf in example 1: Heavy, light, and light- statistical regularity (see Patel and Daniele 2003 for discussion).
est locations within each measure are assigned a triangle, three In addition, the asymmetry relects essential diferences in the
dots, and two dots, respectively. his would be the irst, and for character of the basic elements of musical versus linguistic struc-
many years one of the few, attempts to make explicit the under- ture. he assignment of stress is a formal computation efected
lying form corresponding to the way that meter is mentally con- on syllables from the rich phonemic inventory of particular
structed by listeners. When this objective would be reinitiated languages. In contrast, the computation of musical meter what
in the 1970s, most notably within the generative theory of Ray is known in the music perception literature as beat induction
Jackendof and Fred Lerdahl (1983), the representation would can be efected on a highly impoverished musical input. As has
take the form of the metrical grid shown in example 2 from been shown repeatedly, a listener will unproblematically assign
Mozarts Symphony 40 (Figure 2). It will be noticed that this a metrical structure, even when the events to which it is assigned
example omits the conventional notational means for indicat- appear as series of pitchless claps, clicks, or drumbeats. he sorts
ing the metrical hierarchy barlines, the beaming of eighth notes, of subtle variation in timbre and pitch characteristic of the pho-
and the time signature. It can do so since these are indicated nemic repertoire may tip the balance between competing met-
with greater precision by the grid, which identiies the relative rical interpretations when these appear in a musical context;
prominence of particular locations by their inclusion at suc- however, they are in themselves insuicient for the inference of
cessive horizontal tiers, referred to as higher levels of the grid. meter.
Relatively strong positions at the measure, half note, and quar-
ter notes are represented by columns appearing above these Rhythmic Structure in Language and Music
locations, while weak positions at the eighth-note level appear It is worth noting that the uncoupling of a musicalist interpre-
only at the lowest level of the grid. tation from metrical grids in their application in most variants
hat metrical structure is a fundamental component of of metrical stress theory is, in some respects, inconsistent with
music or, to put it informally, that music frequently has a the grid notation as it was proposed in work by Mark Liberman
beat is, of course, self-evident to most listeners. hat normal ([1975] 1979), undertaken concurrently with Jackendof and
linguistic utterances are rhythmic in anything like the same Lerdahl (1983). Here, the intention was explicitly musicalist,
sense is less apparent and remains a subject of some controversy namely, to relate the metrical structure of simple childrens
within linguistics. For this reason, it might appear surprising songs and chants to the syntactic and phonological structure
that as phonologists confronted a range of data provided by a of the words and phrases assigned to them. In contrast to most
cross section of the worlds languages, it became apparent that approaches, cognitivist and traditional, which view musical and
534
Music, Language and
L (2) and language used to be intertwined, they parted ways long ago.
L (1) Musical meter and meter of verse texts cannot be equated; musi-
L (0) cal theories of meter need no resurrection (2002, 39).
a) * Thirteen men Tarlinskaya probably represents a consensus position among
scholars of metrics in doubting that there is signiicant evidence
for the inluence of musicalist rhythm on the structure of literary
L (2)
L (1) verse. An ofshoot of generative metrics has been able to avoid
L (0) this problem by taking as its primary empirical domain texts
b) Thirteen men that are unambiguously intended as functioning with a musical
context, namely, lyrics of familiar strophic song forms. he basis
Figure 4.
of this work is the recognition that average listeners encounter-
ing unfamiliar texts for a familiar strophic song will sometimes
linguistic computations as independent and self-contained, efect considerable modiications in the structure of the original
Libermans objective was to establish an equivalence between to accommodate the text. hus, as noted in Halle and Lerdahl
the underlying representation of song and speech, a connection (1993), those minimally competent in the relevant linguistic and
that was understood by Liberman to be in some ways a very musical idiom will delete 3 of the original 10 notes of the song
deep one (1975, 81). his hypothesis, whatever its ultimate heu- he Drunken Sailor when they encounter the 7-syllable text
ristic or conceptual value, has not been inluential within the Keel haul him til hes sober, while augmenting the original
ield of linguistics or in music theory. melody with 3 additional notes when confronted with the 13 syl-
Two partial exceptions should, however, be mentioned. lables Scrape the hail of his chest with a hoop iron razor. hese
While word-level stress provides evidence for the disassociation strikingly uniform intuitions constitute the core data of what
of metrical structure in language and music, higher levels of lin- Bruce Hayes (in press) designates as the textsetting problem,
guistic structure provide some evidence for a musicalist inter- for which he proposes an optimality theoretic solution. It
pretation of linguistic performance. In particular, phrasal stress, remains to be seen whether this work will validate Liebermans
unlike word stress, is not phonologically austere and requires initial insight of the deep connection between linguistic and
for its computation, in addition to morphological, syntac- musical structure, or whether the relevant intuitions will apply
tic and pragmatic factors, the quasi-musical considerations solely to the narrow artistic domain with which these analyses
of what is referred to as phrasal euphony. Most conspicuous are concerned.
among these is the stress clash resulting from stressed syllables
from two words appearing adjacent to each other within the Conclusion: An Internalist Perspective on Language
same phrase. he unacceptable form in a) triggers the applica- and Music
tion of the rhythm rule, which achieves euphony by retracting One possible explanation for the discrepancies in the forms
leftward the irst of the two syllables involved in the stress clash taken by linguistic and musical representations, even when they
to produce the acceptable form b). While not a validation of the are supericially similar, is that this discontinuity simply relects
musicalist view, the terminology that is adopted by linguists, as the fact of the matter. hat is, no signiicant overlap in the empir-
well as the mechanisms by which this particular phenomenon is ical domains of music and language exists beyond the fact that
explained, is suggestive of a shared basis underlying the compu- both are, in an important sense, products of our minds, which
tation of phrasal stress and the assignment of musical meter. make use of our psychological capacities for structuring the
A second point of contact, as noted by Jamie Kassler (2005), external world. But it should be recognized that however close or
is the musicalist interpretation of metrical structure incor- distant the ultimate relationship, granting a signiicant psycho-
porated into certain approaches to formal prosody and in its logical basis to musical structure is itself testimony to the inlu-
transformational variant, generative metrics. he stated goal ence of linguistics, namely, the recognition by modern linguists
of these approaches is to deine the abstract structure of lines of the status of language as grounded not in the external reality of
of texts composed in a poetic meter. But even here, the adop- speech its acoustic and physiological structure (E-language)
tion of a musicalist view as representing a signiicant aspect of but in the underlying psychological mechanisms that give rise to
the relevant empirical domain remains controversial. Although linguistic behavior (i-language).
poets routinely invoke the music of poetry and the rhythms In contrast, musical scholarship has remained largely a
of verse and appeal to explicitly musical terms such as phras- structuralist enterprise, devoted primarily to describing
ing, staccato, harmony, and so on, it remains an open question the external tokens of music, most commonly musical scores.
whether poetic rhythm has any relationship to musical rhythm Approaches that take as primary the unconscious knowledge
as musicians understand as the term. he reaction to Derek that listeners (and composers in their capacity as listeners)
Attridges Rhythms of English Poetry (1982) sheds some light on access in making sense of what they hear and compose are decid-
these questions. Attridge posits a scansion that assigns syllables edly peripheral within the ield. Consequently, confusions as to
to alternating beat and ofbeat positions (indicated by b and o, what a theory of music is a theory of arise more or less routinely.
respectively), some of which can remain vacant, most notably at Insofar as traditional structuralist theories are seen as ofering
the end of lines (see verse line). he general approach and the the only empirically sound and intellectually satisfying accounts
representation of empty positions, in particular, is criticized by of musical form, then linguistic and musical scholarship, aside
Marina Tarlinskaya for failing to recognize that though music from occasional points of convergence, are likely to continue on
535
Narrative, Grammar and
their separate paths. If, on the other hand, the posing of interest- a principled extension or, rather, a more or less metaphorical
ing questions and a viable theoretical framework relies crucially extrapolation of concepts of grammar developed within the
on viewing musical works as psychologically based natural language sciences.) Such higher-order grammars take the form
objects in the Chomskyan sense, then in this respect, what we of rule systems designed to capture the basic units of narrative
understand about language has a great deal to ofer our under- and specify their distributional patterns in a more or less clearly
standing of music. deined corpus of narrative texts (see corpus linguistics).
his tradition of research can be traced back to the early prece-
John Halle
dent set by Vladimir Propp ([1928] 1968), who analyzed a corpus
of 100 folktales into a inite number of structural constituents
that he termed functions (or character actions deined in terms
Attridge, Derek. 1982. he Rhythms of English Poetry. New York: of their sequential position within an unfolding plot) and identi-
Longman. ied rules for their patterning in the corpus he studied.
Bernstein, Leonard. 1976. he Unanswered Question: Six Talks at In the 1970s and 1980s, spurred by the (re)discovery of Propp
Harvard. Cambridge: Harvard University Press.
by structuralist narratologists, as well as by the attempt to develop
Brown, Stephen. 2000. he musilanguage model of music evolution.
automated systems for story understanding and story generation,
In he Origins of Music, ed. N. L. Wallin, B. Merker, and S. Brown,
271300. Cambridge, MA: MIT Press.
theorists tried to create story grammars with the widest possible
Cooke, Deryk. 1959. he Language of Music. Oxford: Clarendon. scope. Gerald Prince (1973) drew on Chomskyean transforma-
Halle, John, and Fred Lerdahl. 1993. A generative textsetting model. tional generative grammar in an efort to develop a grammar of
Current Musicology 55: 326. stories with greater descriptive adequacy than the one pre-
Halle, Morris, and Jean-Roget Vergnaud. 1987. An Essay on Stress. sented by Tzvetan Todorov in a 1969 book entitled Grammaire
Cambridge, MA: MIT Press. du Dcamron. Another narratologist, homas G. Pavel (1985),
Hayes, Bruce. Textsetting as constraint conlict. In Toward a proposed a move grammar that drew not on Chomskyean the-
Typology of Poetic Forms, ed. Jean-Louis Aroui and Andy Arleo. ory but rather on Propps foundational work, analyzing narra-
Amsterdam: Elsevier. In press. tives into problems and moves performed by characters seeking
Jackendof, Ray, and Fred Lerdahl. 1983. A Generative heory of Tonal
to bring about their solution. Meanwhile, in a contribution to the
Music. Cambridge, MA: MIT Press.
cognitive-psychological strand of story grammar research, J. M.
Kassler, Jamie. 2005. Representing speech through musical notation.
Journal of Musicological Research 24: 22739.
Mandler (1984, 22) argued that stories have an underlying, or
Liberman, Mark. [1975] 1979. The Intonational System of English. base, structure that remains relatively invariant in spite of gross
New York and London: Garland. diferences in content from story to story. his structure consists
Mithen, Stephen. 2005. he Singing Neanderthals: he Origins of Music, of a number of ordered constituents that include a setting and
Language, Mind and Body. London: Weidenfeld and Nicolson. an episode, which is in turn decomposable into a beginning that
Patel, A. D., and J. R. Daniele. 2003. An empirical comparison of rhythm causes a development that causes an ending.
in language and music. Cognition 87: B35B45 Given that another entry in this encyclopedia covers story
Rousseau, Jean-Jacques. [1763] 1997. Essai sur lorigine des langues. grammars in greater detail, the remainder of the present dis-
Paris: H. Champion. Reproduction of the Neuchtel manuscript. cussion will focus on the second broad approach to studying
Steele, Joshua. 1775. An Essay Towards Establishing the Melody and
the relations between narrative and grammar. he aim of this
Measure of Speech to Be Expressed and Perpetuated by Peculiar Symbols.
approach is to provide not a grammar of stories but, rather, a
London: J. Almon.
Swain, Joseph. 1997. Musical Languages. New York: Norton.
principled account of how stories exploit grammatical resources
Tarlinskaya, Marina. 2002. Verse text: Its meter and its oral rendition. in narrative-pertinent (or even narrative-speciic) ways. Here,
In Meter, Rhythm, and Performance, ed. C. Kueper, 3955. Frankfurt the emphasis shifts from a grammar of narrative to the functions
am Main: Peter Lang. of grammar in narrative or, put another way, to narrative uses
of grammatical structures. his second approach is arguably
the predominant one in current linguistically oriented research
N on narrative, in part because of critiques of the story grammar
enterprise by theorists such as P. N. Johnson-Laird (1983), who
contends that settings, reactions, moves, and other basic units
posited by story grammarians are not speciied clearly enough
NARRATIVE, GRAMMAR AND
to be construed as elements of a grammar, strictly deined. hey
Analysts of stories have developed two broad strategies for are, rather, heuristic constructs based on a prior, unstated gloss
studying the relations between narrative and grammar. he irst or interpretation of the narrative.
emphasizes grammar of narrative, and the second grammar in Grard Genettes ([1972] 1980) tripartite model of narrative
narrative. structure, which encompasses 1) the story (= the basic sequence
Drawing on deinitions of grammar as a model of the catego- of states, actions, and events recounted), 2) the text on the basis
ries and processes underlying intuitive knowledge of (or com- of which interpreters reconstruct that story, and 3) the act of nar-
petence in) a language, theorists working in ields such as ration that produces the text, provides a framework for study-
cognitive psychology, artiicial intelligence research, and nar- ing grammar and narrative. Speciically, analysts can focus on
ratology have proposed story grammars, that is, gram- the role of grammar in narrative viewed under proiles 2 and
mars of narrative. (A key question is whether this work involves 3 story grammars being a perhaps quixotic attempt to model
536
Narrative, Grammar and
proile 1 via grammatical paradigms. Studied as both textual Diferent distributions of verb types in texts cue interpreters to
structure and narrational process, narrative can be analyzed as a reconstruct storyworlds in which these aspectual values may play
discourse genre that draws in distinctive ways on the same stock a more or less prominent role. Contrast the emphasis on accom-
of grammatical resources used diferently in other forms of dis- plishments in sports broadcasts and hard-boiled detective ictions
course, such as lists, scientiic descriptions, lyric poems, and so with the emphasis on mental states in the modernist novel of con-
on. Although a range of grammatical resources from lexical sciousness or narratives of the self told in therapeutic settings.
relations and phrase structure to discourse anaphora, What is more, in ways that M. A. K. Hallidays (1994) func-
gapping (see ellipsis), and topicalization are all poten- tional grammar helps illuminate, patterns of verb selection
tially relevant for the study of grammar in narrative, in this brief assign more or less static or dynamic roles to participants in sto-
discussion I focus on just two elements of morphosyntax verbs ryworlds. From a functionalist perspective, and in parallel with
and deictic expressions (see deixis) and map out some of cognitive linguistic research on how grammar relects
their narrative functions at both the level of text and the level of underlying perceptual and conceptual processes used to make
narration. sense of the world (see also cognitive grammar ), verbs
encode construals of experience in terms of processes of vari-
Functions of Verbs in Narrative ous types; in turn, each such process type speciies preferences
Verbs perform crucial functions in narratively organized dis- for assigning roles to the participants involved. For example, the
course. At the level of narration, the selection of a particular material process type, encoded in verbs like put or get, assigns
tense for the primary or matrix narrative can be used to indi- the roles of actor and goal to participants: e.g., She [actor] kicked
cate the relation between event time and narration time, as the ball [goal]. By contrast, the mental process type, encoded
when past tense verbs are used to signal retrospective narra- in verbs like saw, felt, and thought, assigns to participants the
tion. Further, shifts in verb tense can be used to mark especially roles of senser and phenomenon: e.g., He [senser] saw the sunrise
salient episodes, as when (in English-language narratives) sto- [phenomenon]. Although the functionalist approach originally
rytellers engage in shifts between the simple past tense and the focused on process types and participant roles at the level of
conversational historical present as a strategy for underlining the the clause, aspects of the approach can be scaled up to account
signiicance of the events being recounted (Wolfson 1982). he for discourse-level patterning in narrative. hus, Herman
selection of particular verbal moods can also be used to posi- (2002) suggests that storytelling genres can be characterized
tion a tellers account on the continuum stretching between the as preference rule systems in which variable preference
realis and irrealis modalities. At issue is a scale that ranges rankings obtain for diferent kinds of process types, yielding,
from expressions indicating a speakers full commitment to the in turn, preferred and dispreferred role assignments. Whereas
truth of a proposition about the narrated world to more or less epics preferentially rely on material processes, with participants
hedged expressions, which indicate varying degrees of noncom- slotted in the roles of actor and goal, psychological novels prefer
mitment. In a foundational contribution to the study of stories mental over material processes, and with them the participant
told in face-to-face communicative interaction, William Labov roles of senser and experiencer.
(1972) argued that one of the identifying features of properly nar-
rative clauses (i.e., clauses that cannot be reordered in discourse Deixis in Narrative
without changing the original semantic interpretation of the Deictic expressions such as I, here, and now that is, expressions
narrative that they convey) is their reliance on past tense indica- with interpretations that depend on who utters them to whom
tive verbs. By contrast, subjunctive and other nonindicative ver- in what communicative circumstances (see pragmatics)
bal moods are used by storytellers to evaluate (signal the point are another part of the grammatical system exploited by nar-
of) the narrated events. In a series of studies, however, David ratives in distinctive ways. At the text level, deictic expressions
Herman found that tellers of supernatural tales in face-to-face serve to locate narrators and characters in time and space vis-
interaction regularly use nonindicative verbal moods (not went, -vis objects, events, and situations in the storyworld, whose
but would go or used to go) to accomplish fuzzy or strategically space-time coordinates often do not match those of the current
inexact reference to the spatiotemporal positions and behavior moment of narration.
of ghosts (cf. Herman 2002, 335). Consider, for example, the irst two sentences of Ernest
Meanwhile, at the level of text, verbs play a key role in the pro- Hemingways 1927 story Hills Like White Elephants: he hills
cess that can be characterized as storyworld (re)construction across the valley of the Ebro were long and white. On this side
that is, the use of textual cues to encode or interpret the special there was no shade and no trees and the station was between
class of mental models that support narrative understanding two lines of rails in the sun. he preposition across in the irst
(Herman 2002). For one thing, both conversational narratives sentence and the demonstrative pronoun this in the second sen-
and more complex literary narratives rely on verbs like come tence must both be interpreted in light of the assumed position
and go (and cognate forms) to map characters paths of motion of the vantage point from which events are being narrated a
through space and time in a process sometimes correlated with vantage point that here overlaps with that of the characters.
patterns of alliance or conlict, as well as with internal or psy- David A. Zubin and Lynne E. Hewitt (1995) propose the notion
chological growth, as in the classical Bildungsroman, or novel of of deictic shift to account for such displaced or transposed
development. Verbs and verb phrases also express Aktionsarten, modes of deictic reference, which must be anchored in the sto-
or aspectual values, including states (I was a hiker), activities ryworld evoked by the text, rather than in the world(s) that the
(I hiked), and accomplishments (I hiked up the mountain). text producer or the text interpreter occupies when producing or
537
Narrative, Grammar and Narrative, Neurobiology of
decoding these textual signals. his model builds on a number Mandler, J. M. 1984. Stories, Scripts, and Scenes. Hillsdale, NJ: Lawrence
of prior theoretical frameworks, including Karl Bhlers account Erlbaum.
of Deixis am Phantasma (= imaginary relocation to the alterna- Pavel, homas G. 1985. he Poetics of Plot. Minneapolis: University of
tive sets of space-time coordinates implied by utterances about Minnesota Press.
Prince, Gerald. 1973. A Grammar of Stories. he Hague: Mouton.
ictional or imaginary situations) and Kte Hamburgers argu-
Propp, Vladimir. [1928] 1968. Morphology of the Folktale. Trans. Laurence
ment that only ictional narrative can provide direct access to the
Scott, rev. Louis A. Wagner. Austin: University of Texas Press.
consciousness or I-originarity of another to felt, experien- Wolfson, Nessa. 1982. he Conversational Historical Present in American
tial knowledge of the world as presented via someone elses van- English Narrative. Dordrecht, the Netherlands: Foris.
tage point on events. Zubin, David A., and Lynne E. Hewitt. 1995. he deictic center: A the-
In natural-language narratives told in contexts of face-to- ory of deixis in narrative. In Deixis in Narrative: A Cognitive Science
face communication, deictic expressions can serve other func- Perspective, ed. Judith F. Duchan, Gail A. Bruder, and Lynne E. Hewitt,
tions as well. In particular, when stories are told on-site, that 12955. Hillsdale, NJ: Lawrence Erlbaum.
is, where the events being recounted are purported to have
taken place, deictic references can cue recipients to map fea-
tures of the here-and-now circumstances of narration onto the NARRATIVE, NEUROBIOLOGY OF
space-time environment evoked by the narrative text. hus, in
Despite cognitive neurosciences long-standing interest in the
the narrative discussed in Herman (2007), Monicas use of per-
processing of individual words and sentences, most neuro-
son deixis her references to I and we create a referential link
biological enquiry into the comprehension and production of
between Monica as the teller in the here and now and Monica
more holistic and contextual forms of narrative discourse has
as the coexperiencer (with her friend Renee) of the supernatu-
come only recently. Chief among the obstacles to such studies
ral encounter she tells about. More than this, however, Monica
has been a tension between scientiic control and ecological
refers deictically to spatial features of the current communica-
validity: Experimental control demands repeatable and rather
tive context, as indicated by the items in bold in the following
predictable stimuli and constrained responses, whereas free-
partial transcript:
ranging narrative discourse eschews these properties. In addi-
Because she is telling her story on-site, Mary can use deictic tion, an analytical focus on mapping individual brain regions
expressions to recruit from features of the current environment to individual phonological, morphological, and local
contextual properties has upstaged examination of the ways in
Monica: we walkin up the hill,
which the construction of meaning may be subserved by network
this way, coming up through here. interactions among many brain regions. With newly broadened
[] experimental paradigms, though, and with novel analytical meth-
And Im like on this side and Renees right here. ods addressing functional interactions, neuroscience promises
to add a biological dimension to cognitive psychological theories
and thereby orient her interlocutors vis--vis the storyworld; of narrative discourse.
those features provide spatiotemporal coordinates for the situa- Central to cognitive neuropsychological explorations of nar-
tions and events of which Mary is giving an account. rative has been the realization of shared structure between the
More generally, on the basis of fundamental cognitive abili- explicit narratives of literature and the implicit narratives of
ties studied in research on conceptual blending, storytellers everyday mental representations: Information processing
like Mary exploit aspects of grammar to build complex mapping in both these spheres depends on the representational power
relationships between two mentally projected worlds: the world conferred by narrative abstraction. In the everyday world no
evoked by the narrative text and world in which the act of narra- less than the literary, disconnected percepts gain meaning and
tion unfolds. separability from time, place, and action insofar as they become
transformed into representative mental texts, stories whose dis-
David Herman
tinct scenes contain recognizable characters that act in coherent
WORKS CITED AND SUGGESTIONS FOR FURTHER READING plots and evince meaningful themes. A neurobiological perspec-
tive only reinforces these observations from human thought and
Genette, Grard. [1972] 1980. Narrative Discourse: An Essay in Method. behavior: Neurophysiological study of brain dynamics reveals
Trans. Jane E. Lewin, Ithaca, NY: Cornell University Press.
that human cognitive architecture may be engineered to
Halliday, M. A. K. 1994. An Introduction to Functional Grammar. 2d ed.
represent its processing in series of discrete frames somewhat
London: Edward Arnold.
Herman, David. 2002. Story Logic: Problems and Possibilities of Narrative.
analogous to those of cinema (Freeman 2006), and cognitive
Lincoln: University of Nebraska Press. neuroimaging has begun to reinforce a neuropsychological
. 2007. Storytelling and the sciences of the mind: Cognitive nar- view of thought as an activity of constructing blended spaces
ratology, discursive psychology, and narratives in face-to-face interac- between narrative schemata. he central role of narrative
tion. Narrative 15: 30634. scripting in cognition was explored in connection with early
Johnson-Laird, P.N. 1983. Mental Models. Cambridge: Harvard University eforts at constructing symbolic systems capable of artiicial
Press. intelligence, and was latent in literary criticism even before the
Labov, William. 1972. he transformation of experience in narrative syn- cognitive revolution of the late twentieth century. What is
tax. In Language in the Inner City, 35496. Philadelphia: University of
new about this connection between thought and narrative is its
Pennsylvania Press.
538
Narrative, Neurobiology of
explicit elaboration in light of cognitive neuroscience, connect- to anteriorly, progressing toward the anterior medial temporal
ing literature and philosophy with psychobiological information lobe and its memory-related structures. hese physiological
and constraint. results in normal volunteers agree very well with the locations of
Neuropsychologically, comprehension has been studied more lesions that impair comprehension in aphasia patients.
completely than production, and this entry focuses on compre- An important question is whether the N400 relects the com-
hension. Narrative organization is implemented by interacting parison of new information against the context maintained in
and not entirely separable processes of perceptual organization, working memory, or the encoding and integration of this
attention, and memory. Perceptual organization is the process information into the context, or both these processes in combi-
that binds separate physical stimuli into coherent higher-order nation. In addition to these associations with working memory,
objects within a scene, replacing, for example, a horizontal plane N400 amplitude seems correlated with the diiculty of retriev-
and four perpendicular posts with the single entity of a table. ing related information from long-term memory: Words that
Attention, a group of many subprocesses, focuses processing are used rarely, for instance, evoke greater N400s than do com-
on those parts of a scene deemed relevant to the current script mon words, and semantic incongruities that introduce unre-
or story schema. Memory encoding, maintenance, and lated categories (buttersand) evoke greater N400s than
retrieval, by holding in mind the higher-order representations within-category violations of congruity (butteroil). N400
of what one has seen before and what one expects to see next, amplitude may thus relect the complexity of constructing
inform attention and perceptual organization with the context of blended spaces between the semantic space signiied or evoked
this story (Gerrig and McKoon 2001). Although much is known by a term and the space established by its context. It remains an
about the neural substrates of these processes individually, their open question as to what extent similar processes may under-
signiicance in narrative processing lies in their interactions. An lie the construction of more complex and temporally extended
understanding of these interactions subserving the comprehen- blends during the comprehension of complex discourses. It is
sion or production of narrative discourse might best begin at the interesting to note that negative voltages with timing similar
beginning, with a discussion of contextual integration at the level to the N400 are evoked by all manner of nonlinguistic stimuli,
of individual words. such as pictures and drawings, suggesting that all forms of
he principal physiological index of a words integration into semantic evaluation may involve processes akin to those active
its context is the N400 (Kutas and Federmeier 2000), a negative during narrative comprehension or, more abstractly, that the
voltage produced by the brain in response to a word, and maxi- computations involved in all forms of cognition may have nar-
mal about 400 milliseconds after the word is presented. he N400 rative character.
relects a truly textual process largely independent of the partic- he stronger N400 in the left hemisphere seems more driven
ular sensory mode of representation: Although there are some by category structure and afected by the retrieval of informa-
more subtle efects of sensory modality, the N400 arises no mat- tion from long-term semantic memory, whereas the right
ter whether the word is read from a page or heard from a speaker. hemisphere may be more driven by broader contextual inte-
he N400 is thought to relect a process or processes of contex- gration and afected by the retrieval and/or updating of working
tual integration since its amplitude varies parametrically with a memory. he left hemisphere is, therefore, most afected by the
words predictability; the canonical method of evoking a large sense of a word considered individually or in relation to its local
N400 is to present a word whose semantics violate contextual context, and the right hemisphere by the broader context of the
expectation, for instance, At breakfast we ate toast with sand. narrative (Gernsbacher and Kaschak 2003). his computational
he initial words in this sentence, and the syntax into which distinction of more local semantic evaluation by the left hemi-
theyre arranged, prime activations for appropriate breakfast sphere and more extended contextual evaluation by the right
foods. he ongoing construction of meaning is then challenged hemisphere may map fairly directly onto an anatomical dis-
by the non sequitur sand, eliciting a large N400 response. he tinction of small and more isolated dendritic arbors in the left
N400 arises no matter whether the conlicting context is estab- hemisphere and larger and more overlapping dendritic arbors in
lished by a surrounding sentence, as in this brief example, or the right (Jung-Beeman 2005), although evoked potentials sug-
simply by a single adjacent word, or by an entire discourse. For gest that this relation may be more a product of left hemisphere
instance, the following discursive context reduces the N400 in specialization for local processing than of any complementary
the example above: We camped on the beach. A stif wind blew right hemisphere specialization for broader context (Coulson
of the dunes into all our supplies. At breakfast we ate toast with and Van Petten 2007). Cognitively, the pattern is relected in the
sand. hus, words that are semantically related to their sur- right hemispheres involvement in strongly context-dependent
roundings or are otherwise contextually expected evoke small constructions, for instance, those involving frame shifts, such as
N400s, whereas words that cannot be predicted from context metaphor or humor (see verbal humor, neurobiology
and which, by inference, supply new information with which the of). Activation of the right hemisphere strengthens as one pro-
context must be updated evoke large N400s. ceeds from the level of individual words to sentences to entire
Anatomically, electromagnetic source localization and func- discourses and as a narratives contextual complexity builds
tional neuroimaging place the generators of the N400 primarily from its beginning to its resolution (Xu et al. 2005).
in the superior temporal lobe and temporo-parietal junc- In addition, in comparison to words and sentences, discourse
tion (Van Petten and Luka 2006). hese sources are mainly in the uniquely activates medial prefrontal cortex, the temporo-parietal
left hemisphere but have some contribution from the right. junction, and the precuneus (at the junction of posterior parietal
Across the period of the N400 response, they proceed posteriorly and anterior occipital lobes), as well as subcortical regions
539
Figure 1.
(caudate nucleus and dorsomedial thalamus) that communicate frontal activation relects a more general association with com-
with prefrontal cortex (Xu et al. 2005). (See Figure 1.) he involve- plex social narratives perhaps related to contextual selection of
ment of these regions likely relects discourses demands to imag- details that build coherence within a narrative and that engage a
ine scenes perceptually and especially visually, to place scenes and works suggestion structure to make it relevant to personal
events in spatial relation and temporal sequence, to take up and to experience and self-representation (Ferstl and von Cramon
shift between spatial perspectives and personal points of view, 2002). Such contextual selection may instantiate the prefrontal
and to emote and empathize. In particular, the precuneus seems cortexs more general involvement in the inhibition of responses
associated with visual spatial perception and attention, medial deemed inappropriate to the current behavioral and cognitive
prefrontal cortex and its linked subcortical nuclei with perception context. In contrast, an experimentally based argument has been
of events in sequence and context, the temporo-parietal junction made for a more selective association of temporo-parietal junc-
with theory of mind, and medial temporal lobe structures with tion with the late-developing, belief-oriented variety of theory of
emotion and memory. Narrative representation can be viewed as mind (Saxe and Powell 2006).
an emergent property of interactions among these and other struc- It remains an open question in evolutionary psychol-
tures subserving a broad array of cognitive processes. ogy as to what extent theory of mind may be a modular cogni-
One of the most discussed capacities involved in narrative tive capacity independent of other aspects of cognition, versus to
comprehension and production is theory of mind. heory of what extent it may depend on developmental specialization aris-
mind was irst characterized as the general ability to understand ing in the interaction of earlier-maturing, more general capaci-
or to model the thoughts and beliefs of other people. However, ties for social perception and executive function; recent views
more recent neuroscientiic results show that a great deal of on genes and language suggest that human cognitive adap-
such social attribution can be accomplished using principally tation for narrative discourse may combine these modular and
perceptual mechanisms. hese social perceptual capacities are generalist perspectives, perhaps by putting to novel uses a large
computationally and developmentally prior to theory of mind collection of small modules specialized for cognitive processes
and include specialized representations for qualities that typify that are applicable to language but not necessarily restricted to
agency, such as autonomous movement and direction of gaze. language (Bookheimer 2002). Proponents of the modular view
Such perceptual qualities underlie the attribution of volitional have often pointed to autism and language, or more spe-
mental states (she/he/it wants or she/he/it wants to) and ciically to autisms impairments in social communication, as
perceptual mental states (she/he/it sees) attributions that evidence for a modular dysfunction of theory of mind. However,
form a ubiquitous shorthand in narrative descriptions even in many people with autism pass theory of mind tests; the dys-
the case of plainly mechanical and nonsubjective entities (for function of medial prefrontal cortex found in imaging studies of
example, the computer doesnt see the network, or the printer autism seems consistent with a more general deicit in automati-
wants attention). heory of mind in its most speciic sense is cally engaging contextual evaluation and self-representation,
essential only for the attribution of belief (she/he/it thinks/ and in any case, such abnormalities within speciic regions in the
believes/knows), is associated with activation of brain regions autistic brain may be relections of a more fundamental disrup-
distinct from those subserving more elementary forms of social tion in the information transfer and integration between regions.
attribution, and is distinguished from these more elementary Behaviorally observed deicits in theory of mind may thus stem
forms by its appearance at a later stage in child development, from a more general perturbation of narrative processing, and
at or near four years of age. his developmental connection may appear especially prominent only because theory of mind is
is signiicant: heory of mind seems to arise from simpler pro- so frequently applied in everyday social interaction.
cesses that deal in sensory and especially visual data. Although In addition to those regions uniquely activated by discourse
early studies associated theory of mind most strongly with the processing, most other brain regions involved in language
medial frontal cortex, later work has suggested that this medial or higher-order cognition become more heavily recruited by
540
discourse than by individual sentences or words. In particu- that, has thus far been realized in multisubject functional neu-
lar, the middle frontal gyrus, on the dorsolateral surface of the roimaging. Contributing to this limited resolution is a potentially
prefrontal cortex, seems involved in placing sentential or other high degree of variation in detailed functional anatomy across
discursive elements in temporal, causal, or logical sequence individual subjects. A further challenge is an intersubject vari-
(Gernsbacher and Kaschak 2003; Mar 2004). his prefron- ability in information transfer between brain areas that actually
tal sequencing and coordination of ideas seems analogous to relects individual diferences in cognitive style: For instance, dif-
the more concrete executive sequencing and coordination of ferent individuals may make more or less use of working mem-
body movements implemented in more posterior regions of ory in comprehending and producing narrative, and there are
the frontal cortex. In a computational sense, therefore, narra- indications that these diferences may be relected in functional
tive comprehension can be viewed as an elaboration of motor connectivity with cooperating structures in the medial temporal
control or, conversely, motor control itself can be said to lobe related to long-term memory.
have narrative character, in the sense that it fundamentally his connectivity frame holds out the potential for a rap-
involves sequencing and relating actions in contexts. his rela- prochement between connectionism and explicitly represen-
tion between language and action gives crucial context to evo- tational, modularist views of language and narrative processing,
lutionary biologys eforts to explain the phylogenesis of such since modular functions may reside not so much in any par-
an abstract cognitive capacity, in that the roots of this capacity ticular anatomical locus as in the incoming and outgoing links
may rest in the very concrete domain of motor control. Along among these loci: In this sense, the more closely the localization
the same lines, the anterior inferior frontal gyrus seems distin- problem is examined, the more ill-posed it may become. he
guished from neighboring cortex by an involvement in selecting aforementioned characterizations in terms of regional functional
semantic relations in communication with semantic retrieval mapping may, therefore, be understood as a irst approach to a
processes in the temporal lobe, whereas posterior regions of the description in terms of regional functional interaction. It is in this
inferior frontal gyrus are more immediately bound up with the sense that the new neuroscientiic study of discourse processing
more concrete, sequencing-related details of syntax and pho- is approaching an understanding of narrative connectivity in
nology (Bookheimer 2002; Jung-Beeman 2005). terms of neural connectivity.
In general, the neural implementation of narrative compre- his entry is current as of 15 August 2007.
hension seems to take advantage of individual capacities for
movement, sensory perception, and emotion, activating these WORKS CITED AND SUGGESTIONS FOR FURTHER READING
systems in an internal simulation of the events evoked by the
Bookheimer, Susan. 2002. Functional MRI of language: New approaches
narrative. he neural implementation of narrative understanding
to understanding the cortical organization of semantic processing.
thus depends crucially on embodiment. At the most concrete
Annual Review of Neuroscience 25: 15188.
level, that of simulating movements, this process of comprehen- Coulson, Seanna, and Cyma Van Petten. 2007. A special role for the right
sion engages the mirror neuron system (see mirror systems, hemisphere in metaphor comprehension? ERP evidence from hemi-
imitation, and language) in the ventrolateral frontal lobe ield presentation. Brain Research 1146: 12845.
and the supplementary motor area in the dorsomedial frontal Ferstl, Evelyn C., and D. Yves von Cramon. 2002. What does the fronto-
lobe (Wilson, Molnar-Szakacs, and Iacoboni 2008). median cortex contribute to language processing: Coherence or theory
Cognitively, narrative understanding seems to emerge from of mind? NeuroImage 17.3: 15991612.
the interaction of many specialized subsystems. Neurally, there- Freeman, Walter J. 2006. A cinematographic hypothesis of cortical
fore, a full description of narrative processing must encompass not dynamics in perception. International Journal of Psychophysiology
60.2: 14961.
only the individual brain regions engaged, but also the dynamics
Gernsbacher, Morton Ann, and Michael P. Kaschak. 2003. Neuroimaging
with which these regions connect and interact over a wide vari-
studies of language production and comprehension. Annual Review
ety of narrative processes and subprocesses. hese analyses of
of Psychology 54: 91114.
functional connectivity are just beginning (Karunanayaka et al. Gerrig, Richard J., and Gail McKoon. 2001. Memory processes and
2007) via techniques including structural equation modeling, experiential continuity. Psychological Science 12.1: 815.
dynamic causal modeling, and model-free multivariate meth- Jung-Beeman, Mark. 2005. Bilateral brain processes for comprehending
ods such as partial least squares and independent components natural language. Trends in Cognitive Sciences 9: 51218.
analysis and initial work has demonstrated information low Karunanayaka, Prasanna R., Scott K. Holland, Vincent J. Schmithorst,
from regions of the temporal lobe adjoining the auditory cortex Ana Solodkin, E. Elinor Chen, Jerzy P. Szalarski, and Elena Plante.
(often described as wernickes area, though the anatomical 2007. Age-related connectivity changes in fMRI data from children
deinition of this term has never been precise) to higher-order listening to stories. NeuroImage 34.1: 34960.
Kutas, Marta, and Kara D. Federmeier. 2000. Electrophysiology reveals
processing in the inferior frontal lobe (brocas area) and near
semantic memory use in language comprehension. Trends in
the temporo-parietal junction. From these areas, the network of
Cognitive Sciences 4: 46370.
interactions becomes more complex, with the inferior frontal lobe Mar, Raymond A. 2004. he neuropsychology of narrative: Story compre-
projecting to the dorsolateral prefrontal cortex, temporo-parietal hension, story production, and their interrelation. Neuropsychologia
junction, and medial frontal cortex, implementing wide-rang- 42: 141432.
ing efects of core language processing on complex semantics, Saxe, Rebecca, and Lindsey J. Powell. 2006. Its the thought that
sequencing and coordination of ideas, and social attribution. counts: Speciic brain regions for one component of theory of mind.
he data now in hand suggest functional specialization and Psychological Science 17: 6929.
subdivision of brain regions beyond the anatomical resolution
541
Narrative, Scientiic Approaches to
Van Petten, Cyma, and Barbara J. Luka. 2006. Neural localization of their formal devices are limited in number while also being
semantic context efects in electromagnetic and hemodynamic stud- universal in nature. In a way, the Russian Formalists took up
ies. Brain and Language 97.3: 27993. where Aristotle had left off in his Poetics and his Rhetoric on
Wilson, Stephen M., Istvan Molnar-Szakacs, and Marco Iacoboni. 2008. the study of the architecture of narrative. But their work, as in
Beyond superior temporal cortex: Intersubject correlations in narra-
the case of Aristotle, did not go beyond the boundaries of clas-
tive speech comprehension. Cerebral Cortex 18.1: 23042.
sification and typology.
Xu, Jiang, Stefan Kemeny, Grace Park, Carol Frattali, and Allen Braun.
2005. Language in context: Emergent features of word, sentence, and
he research program developed by the French Structuralists
narrative comprehension. NeuroImage 25.3: 100215. in the 1960s and 1970s in the guise of a narratology draws on the
work of the Russian Formalists and that of Vladimir Propp con-
cerning the structural analysis of fairy tales (a group included
NARRATIVE, SCIENTIFIC APPROACHES TO in the Aarne-hompson Index of folktale types). he program
also consists mainly in a taxonomy, but its clariication of con-
Knowledge is acquired in many ways and may exhibit many cepts and criteria of classiication is much more advanced and
degrees of generality, certitude, and power one can use to pre- sophisticated.
dict the future. It can be the product of direct observation and he deinition of narrative proposed by H. Porter Abbott is
a small number of general assumptions, or the result of a very simple, capacious, and suiciently precise for our purpose here.
elaborate and long chain of hypotheses and deductions. It can Abbott says: Narrative is the representation of an event or a series
possess a rich factual content or be almost devoid of it, but it of events (2002, 13). And he explains: he diference between
must always lead back to factual observations. Knowledge, in its events and their representation is the diference between story
most developed form, called science, rests on the basic assump- (the event or sequence of events) and narrative discourse (how
tion that the whole universe is structured and functions accord- the story is conveyed) (2002, 15). From this point of view, narra-
ing to laws that hold without exceptions, in a precise way, and tive can be studied without regard to the medium through which
throughout all time. Sciences main tool is mathematics, which the event or events are represented. Because of this property,
is as universal as science but is not itself a science; it is sciences narrative in texts, in ilms, in comic books, and so on can be stud-
language (or logic). In its turn, science (and sometimes its lan- ied by the means furnished by narratology, with a small amount
guage, mathematics) is used as a tool by many intellectual activi- of adjustments in each case. As the most fundamental taxonomy,
ties and ields of study that do not have the status of a science. narratology deals with formal universals, and in that capacity it
Such is the case of narrative, a cognitive endeavor wherein the is indispensable for a research program seeking to establish the
methods used and the classiications arrived at are still far from scientiic study of narrative.
yielding scientiic (factual) observations and scientiic (factual) Since the mid-1990s there has been a renewed interest in the-
predictions. Classiications are an important step in science; sci- matics and its empirical study with the purpose of determining
entists generally proceed by identifying and isolating a group of how narratives relate to human universals. he multidisciplinary
phenomena that seem related, formulating hypotheses about research by Max Louwerse, Willie van Peer, and Donald Kuiken
their main characteristics, and trying to connect them by means should be mentioned here, as well as the imaginative and solid
of a theory. When they succeed, a branch of science becomes research program being developed by Patrick Colm Hogan on the
established. basis of his identiication of three narrative universal structures
he study of narrative as such has not followed this path far or genres (romantic, heroic, and sacriicial tragicomedic) gener-
enough to become a scientiic discipline. But during the last 15 ated by emotion prototypes. Another group of approaches con-
years, more and more scholars have specialized in the study of cerns the study of the universal features of the relation between
narrative have been using indings from neurobiology, cogni- reader or audience and narrative. Here, empirical work, such as
tive science, and evolutionary psychology to enlarge our under- that being realized by David Miall and Kuiken, as well as Deirdre
standing of narrative and to ground it in the architecture of the Wison and Dan Sperber, to name but a few once again, is also
human brain. contributing to a scientiic approach to very complex relations.
The first attempts to establish narrative as an autonomous Evolutionary psychology and cognitive science are tools used by
scientific discipline took place in Russia just before the 1917 these researchers in the wake of Joseph Carroll and Robert Storey.
Revolution and continued until about 1930, when they were And with respect to the universal features of rhythm, metered
stopped by Stalins regime in the Soviet Union. The Russian speech, onomatopaeia, and prosody, the long and patient work
Formalists, as the researchers were called, made salient con- of Reuven Tsur has led to many fruitful results. he same is true
tributions to their field such as the search for literariness, of the much more recent research relating music, neurobiology,
or the formal properties defining the literary text; the dis- and the emotions.
tinction between plot (syuzhet) and story (fabula); the con- Indeed, a common experiential link among these groups
cept of ostraniene, translated into English by the neologism of approaches is the emotions. Recent neuroscience has given
estrangement; the notion that a literary text is a system, as solid evidence to the hypothesis that emotions, too, are univer-
is literature itself; and the setting of boundaries between the sal. his is a very important inding, since emotions are a central
study of a text itself and the scientifically irrelevant study of concern in the study of narrative. he discovery of a mirror neu-
its production or its reception. These contributions were con- ron system in humans (after its discovery in macaque monkeys
ceived within a research program that acknowledges the fact in 1996) has opened the way to a multitude of new explorations
that narratives are infinite in subject and presentation, but concerning the brain and our social behavior, including the
542
Narrative, Scientiic Approaches to Narrative Universals
production and reception of narrative. he mirror neuron sys- literary prosody is available online at: http://www.arsversiicandi.net/
tem in humans has been consistently reported as being related backissues/vol1/essays/tsur.html.
to imitation, action observation, intention understanding, and Wilson, Deirdre, and Dan Sperber. 2006. Relevance theory. In
understanding of the emotional states of others, to mention a Handbook of Pragmatics, ed. Gregory Ward and Laurence R. Horn,
60732. Oxford: Blackwell.
few of the human faculties that are essential for the right percep-
tion of ictional narratives and, of course, for the survival and
evolution of the human race. And there seems to be even more NARRATIVE UNIVERSALS
to it. In 2004, Vittorio Gallese, Christian Keysers, and Giacomo
In one Arawak story, a tiger sees a great hunter in the forest,
Rizzolatti published a paper in which they explore the possibil-
changes herself into a woman, and marries him. he two have a
ity that the mirror neuron system, by providing us with an expe-
happy married life until the good wife suggests that they visit the
riential (precognitive) insight into other minds, could provide
hunters family. She warns him that he must not reveal her ori-
the irst unifying perspective of the neural basis of social cogni-
gin to anyone. he hunter, however, tells his mother the secret.
tion (2004, 401). he mirror neuron system is apparently the
Feeling ashamed in front of the community, the woman changes
basic mechanism that allows us to grasp the intentions of others
back into a tiger and returns to the forest. he poor husband
and to experience similar emotions (empathy). As these authors
would often go into the bush and call his wife, but there never,
say in their article, A crucial element of social cognition is the
never came a reply (Roth 1915, 2034).
brains capacity to directly link the irst- and third-person expe-
At irst glance, the story might seem strange and clearly not
riences of these phenomena (i.e., link < I do and I feel > with < he
universal. A man marries a tiger and must keep it a secret from
does and he feels >) (ibid., 396).
his mother? Claude Lvi-Strauss analyzes the story in relation
he question arises: Would all art then be nothing but a spe-
to a complex of South American myths (see homologies and
ciic and specialized activity aimed at iring the mirror neurons
transformation sets) that recount the decline from a golden
in a certain direction? Would the will to style displayed by
age. He also connects the tale with cannibalism (1973, 259). Lvi-
authors, composers, ilm directors, and so on be nothing but the
Strausss analysis does implicitly link the tale with other tradi-
deliberate use of syntax, semantics, pragmatics, and phonemics
tions, simply by referring to a decline from a golden age, for this
to trigger the mirror neuron system in speciic and predeter-
topic is found in diferent cultures. In the Judeo-Christian tradi-
mined ways in order to elicit speciic and predetermined insights
tion, the story of the Fall is a case in point. However, in some ways,
and emotions in the reader and the audience?
this link only makes the story more alien. Pairing Adam and Eve
Frederick Aldama with a hunter and a tiger seems to highlight cultural diference
and pairing the eating of an apple with cannibalism seems to put
WORKS CITED AND SUGGESTIONS FOR FURTHER READING the Arawak at quite a distance from anything Western.
Abbott, H. Porter. 2002. he Cambridge Introduction to Narrative. On the other hand, there is something deeply familiar about
Cambridge: Cambridge University Press. the story. It tells of a couple joined by attachment (witness the
Aristotle. 1984. he Rhetoric and Poetics of Aristotle. Trans. Rhys Roberts husbands pathetic search at the end of the story), sufering con-
and Ingram Bywater. New York: McGraw-Hill. lict due to the husbands divided loyalties to his mother and his
Carroll, Joseph. 2004. Literary Darwinism: Evolution, Human Nature, wife; it treats shared secrets, feelings of betrayal and shame, and a
and Literature. New York: Routledge. concern about social origins. Moreover, there is nothing in the tale
Gallese, Vittorio, Christian Keysers, and Giacomo Rizzolatti. 2004. A uni- itself that suggests cannibalism, despite Lvi-Strausss analysis.
fying view of the basis of social cognition. Trends in Cognitive Science In short, it is a story that seems both strange and familiar, both
8.9: 396403.
culturally particular and imbued with cross-cultural concerns.
Hogan, Patrick Colm. 2003. he Mind and Its Stories: Narrative Universals
and Human Emotion. Cambridge: Cambridge University Press.
Louwerse, Max, and Don Kuiken, eds. 2004. he Efects of Personal
Cultural Construction, Universality, and Narrative
Involvement in Narrative Discourse Processes. Philadelphia: Lawrence here is a common view in cultural studies the interdisciplinary
Erlbaum. area of the humanities and social sciences devoted to analyzing
Louwerse, Max, and Willie van Peer, eds. 2002. hematics: Interdisciplinary culture that practices, dispositions, artifacts, communicative
Studies. Amsterdam: John Benjamins. actions are socially constructed. his is to say that they are
Miall, David. 2006. Literary Reading: Empirical and heoretical Studies. not innate or biologically determined but result from cultural
New York: Peter Lang. developments. In this view, anything from individual emotions
Miall, David, and Donald Kuiken. 1994. Foregrounding, defamiliariza- to political structures might not be analyzed in terms of rela-
tion, and afect response to literary stories. Poetics 22: 389407.
tively constant human predispositions, but rather in terms of the
Propp, Vladimir. 1968. Morphology of the Folk Tale. Austin: University of
historically contingent organizations and imperatives of social
Texas Press.
practice or performance. Additionally, writers in cultural studies
Rizzolati, Giacomo, Vittorio Gallese, Leonardo Fogassi, and Luciano Fadiga.
1996. Action recognition in the premotor cortex. Brain 119: 593609. commonly understand social construction as widely variable. In
Storey, Robert. 1996. Mimesis and the Human Animal: On the Biogenetic the most extreme versions, this variation may be seen as limited
Foundations of Literary Representation. Evanston, IL: Northwestern by little beyond the laws of physics.
University Press. An alternative view, often associated with evolutionary
Tsur, Reuven. 1997. Poetic rhythm: Performance patterns and their approaches to culture, takes a wide range of social practices to
acoustic correlates. Versiication 1: 1997. his electronic journal of be very narrowly constrained by genetic propensities. hese
543
Narrative Universals
propensities are thought to have resulted from adaptations that hese processes have adaptive value because they approximate
are speciically social. While writers in cultural studies tend to see the function of giving us access to other peoples states of mind.
social practices as quite variable, writers adopting this approach However, since they are mere approximative mechanisms, they
tend to see societies as manifesting a wide range of univer- are fallible a point that is highly consequential both in life and
sals. Language study has been one area in which universalism in stories. Consider narratives that focus particular attention on
has been prominent, though there has been some disagreement the relative opacity of others intentions (e.g., between lovers in
as to the precise evolutionary origins of language (see cases of misguided jealousy). If these recur cross-culturally (as
biolinguistics). they do), they do so not because adaptive mechanisms succeed
narratology has incorporated both tendencies. A com- but because they fail in certain systematic ways.
mon view in cultural studies is that narrative is socially con- nongenetic universals may also arise due to patterns in
structed and can vary widely from culture to culture. In contrast, childhood development that are not genetically programmed.
some researchers drawing on models from linguistics and psy- For example, we seem to have innate predispositions to emo-
chology have argued that there are remarkably consistent nar- tional attachment. However, our attachment responses are not
rative patterns across cultures. here are, however, diferences wholly hardwired. hey are shaped in some crucial ways by
between the study of narrative and the sorts of study that have childhood experiences, as a number of writers have stressed fol-
occupied linguists for example, the study of syntax. While lowing John Bowlby. here appear to be some crucial parameters
writers in cognitive linguistics have viewed syntactic prin- in our early childhood experiences that have lasting efects on
ciples as resulting from general cognitive structures and pro- the quality and durability of our attachments in later life. While
cesses, a common view within the ield is that there are some cultures may vary in the degree to which one or another parent-
aspects of cognitive architecture that are specially devoted to ing/attachment style predominates, it is inevitably the case that
syntactic processing (see autonomy of syntax). he case for every society has variation in parenting/attachment styles. hus,
an autonomy of narrative is much weaker. It seems much more insofar as narratives cross-culturally focus attention on social
likely that narrative results from the interaction of various cog- emotion, we would expect to ind cross-cultural expression of
nitive structures and processes that are not specially devoted to the same basic attachment styles.
narrative. As such, narrative is a less likely candidate for simple he preceding example indicates that there are two prob-
evolutionary analysis. Put diferently, if narrative results from lems with the usual framing of the division between those who
our cognitive abilities to draw causal inferences, to attribute claim that there are cross-cultural universals and those who
intentions, to imagine counterfactual or hypothetical situations, deny that claim. First, universality does not entail innateness. A
to adopt varying physical points of view, to simulate experiences, pattern may be universal without being genetically determined.
and so on, then it is less likely that there is any single adaptive Second, social construction does not entail cultural diference.
function for cross-cultural narrative patterns (comparable to the hough they include a genetic component, attachment styles are
commonly posited communicative function for language). socially constructed in that they result from the childs experi-
he point is consequential for a number of reasons, relating ence of parenting. Yet it seems likely that similar divisions of, for
to narrative and to other areas of study in the language sciences. example, secure and insecure attachment will recur everywhere,
Speciically, if narrative patterns are unlikely to be genetically even if they do so in diferent proportions. hese considerations
coded in any detail, one of two conclusions may be drawn: One suggest that common dichotomies regarding universality and
may simply see this as further evidence for the culturalist posi- cultural construction are false. he point has consequences for
tion, further reason to believe that narrative may vary across our understanding of universality in a range of areas, not only
cultures with few limitations. However, the existence or nonex- narrative.
istence of universals is an empirical issue. It cannot be decided Indeed, the point goes further. Research in group dynamics
a priori. If one believes that the evidence supports conclusions and elsewhere (see network theory; self-organizing
of universality, then one is likely to draw a diferent conclusion systems; pragmatics, universals in) suggests that many
from the indirectness of the relation between narrative and patterns may arise through convergent development (independ-
adaptation. Certainly, some universal patterns will derive more ent processes in diferent societies that give rise to parallel prac-
or less directly from aspects of cognitive architecture (e.g., causal tices) for example, patterns in the ways social networks operate
attribution) that are deined by genetic programs resulting from to deine in-groups and out-groups, intragroup inequality, inter-
selective pressure. Others will derive from commonalities in the acting subgroups, and so on. Given the importance of group
physical environment. But that is not all. antagonism, social hierarchy, and subgroup divisions for any
Sticking close to biology, we may note that some cross-cul- society and, thus, their importance for the lives of individual
tural patterns are likely to derive from the fact that adaptations agents we might expect narratives to emplot these relations fre-
are mechanisms, not functions which means that there are quently. Insofar as these relations derive from group dynamics,
cases where the mechanism fails. More exactly, the genetic pre- cross-cultural patterns of such emplotment would not result
dispositions that serve us so well in daily life do so because they from genetic predispositions per se but from convergent social
set out relatively simple procedures that approximate advanta- developments.
geous functions. For example, although evolutionary psy- With these points in mind, we might return to the story of
chologists often refer to our ability to read minds, we do not the tiger woman or jaguar woman, in Lvi-Strausss version.
directly know other peoples intentions. Rather, we engage in It is, as it turns out, framed by attachment (along with sexuality)
complex processes of simulating and inferring those intentions. and group opposition. Moreover, there are hints that the group
544
Narrative Universals
opposition may point either toward in-group hierarchy or toward force of arms. However, Layla is married to another man. When
in-group/out-group antagonism. Alexandra Aikhenvald reports Layla and Majnun die, they are reunited in paradise.
that at least among some Arawaks, members of one low-prestige
subgroup are referred to as people of jaguar (2006, 12). Walter Romance and Prototypes
Roth cites an Arawak proverb that identiies tigers with enemies Although this is only a tiny selection of narratives, it is signiicant
(1915, 367). Moreover, the story relies on, indeed elaborates on, in part due to their prominence in distinct narrative traditions.
the failure of mind reading, which is precisely what allows the (On complications of establishing this distinctness in literary
secret and the issue of spousal loyalty to arise in the irst place. study, see areal distinctness and literature.) We may
So there is certainly commonality here, commonality that makes already begin to see the ways in which these narratives may
cognitive sense. But is that all there is to it? After all, we knew that share certain prototypical characteristics. I say prototypi-
there was some commonality already. Is there any greater univer- cal characteristics because these examples do not suggest a set
sality to this story or to narrative patterns more broadly? of necessary and sufficient conditions but a gradient of
Attachment combined with sexual desire (as in pair bonding more or less standard cases. Speciically, one common sort of
or marriage) points toward a set of stories that recur across cul- narrative that recurs cross-culturally tends toward a prototype
tures. Perhaps we will get a better understanding of the issues if involving the following elements.
we consider some other stories of this sort, particularly paradig- We begin with two lovers. heir mutual interest combines
matic works from other traditions. (In a brief entry, we cannot sexuality and attachment. However, they face inhibition.
consider many stories, or other evidence for narrative universals. hat inhibition is frequently a matter of conlict with author-
For a range of cases, and for references to other accounts of nar- ity usually parental or religious or group division, or both.
rative universality, see Hogan 2004.) (he priming of religious igures due to the prototype may
explain their surprising presence as helpers in some stories.)
Four Romances he group division is itself regularly one of in-group hierarchy
To begin, lets consider what is almost certainly the paradigm of or in-group/out-group antagonism. Works that do not involve
romantic narrative in the English-speaking world Romeo and such a conlict commonly suggest it, as in akuntal. After a
Juliet. Romeo and Juliet fall in love. However, they are prevented brief union, the lovers are separated often, one is conined
from uniting by the group antagonism of their parents. With the to home while the other is sent away. In tragic versions, one or
help of a friar, they are briely united, but then Romeo is exiled both die. In comic versions, the separation may be associated
and Juliet is conined to the home. Juliet is to be married to a with death. During this separation, one lover proves himself
rival by her father. She fakes her death to escape this fate. Romeo (or, less often, herself) worthy of the beloved, sometimes by
returns, kills his rival, then commits suicide just at the moment defeating the rival (who may ultimately die). his demonstra-
when he might have been united with Juliet. Juliet, too, kills her- tion may overturn the disapproval of the parents or society. In
self, but after their deaths, the families are reunited. the end, the lovers are reunited and the conlicting families are
Now consider the Romance of the Western Chamber, Chinas reconciled.
most popular love comedy, both on stage and in print, begin- Of course, individual stories must vary this pattern. However,
ning in the twelfth century (Idema 2001, 800). Chang and Ying- as a standard case, it appears to be remarkably consistent across
Ying fall in love. Chang goes of to take the imperial exams. cultures and across time. he Arawak story varies the pattern
Meanwhile, a rival comes to marry Ying-Ying with her mothers more than the others we have considered. But it remains rec-
approval. Chang succeeds in the examination and returns to ognizable. he diferences are largely a matter of order. he
elope with Ying-Ying. He is successful due to the help of a monk. hunter proves himself worthy of the beloved through his suc-
he rival commits suicide (see Idema 2001, 798800). cessful hunting right at the outset (the point is related to Arawak
he Recognition of akuntal, the most revered work of cultural practices in which potential bridegrooms must prove
Sanskrit drama, begins when Duyanta and akuntal fall in themselves; see Roth 1915, 31516). he conlict occurs after
love. Duyanta worries that they cannot marry due to caste marriage, rather than before, and it is in part the fault of the
(thus, an internal group hierarchy). akuntal worries that they man for violating the trust of his wife and preferring his mothers
cannot marry due to her fathers disapproval. Both turn out to interests over hers, no matter how briely. Is there a reason for
be mistaken. hey are united. However, akuntal violates her these diferences? he comic form of the romantic plot involves
obligations to a holy man, who curses her with separation from suggestions of death or unending separation of the lovers
Duyanta. In consequence, akuntal is exiled, while Duyanta prior to their ultimate reunion in part because this intensiies
remains at home, sufering conlict with the demands of an earl- the inal joy of their union. he point is a simple matter of the
ier wife (thus, a rival). Duyanta defeats an army of demons psychology of emotion the joy of an outcome is intensiied by
in battle and is subsequently reunited with akuntal and the diiculty of achieving the outcome (see Ortony, Clore, and
their son. Collins 1988, 73) and by the gradient of change from a previous
In the Arab and Muslim world, few stories have been as popu- emotional state. he same point holds for the Arawak story, but
lar and inluential as that of Layla and Majnun. Layla and Majnun in reverse. Here, the atypically early union, the apparently hap-
fall in love, but Laylas father refuses the marriage. Majnuns pily ever after condition of the couples married life, and the
father tries to cure Majnun of his love madness through religion, hunters subsequent tragic error serve to intensify the pathos of
but Majnun only calls on God to make him worship Layla more. the conclusion.
Majnun wanders the desert, eventually trying to win Layla by
545
Narrative Universals Narratives of Personal Experience
Understanding Narrative Universals narrative universals would seem to have consequences for our
Perhaps surprisingly, the explanation of the Arawak storys dif- understanding of universals elsewhere and for our understand-
ference from the cross-cultural prototype begins to suggest why ing of the place of both biology and social construction in an
there is a cross-cultural prototype to begin with. Cross-culturally, account of universals.
there are two common purposes of narrative verbal art the
Patrick Colm Hogan
communication of emotionally satisfying experiences (roughly,
a psychological purpose) and the treatment of thematically sig-
niicant issues, often ethical or political (roughly, a social pur-
pose). he explanation of narrative universals bears importantly Aikhenvald, Alexandra. 2006. A Grammar of Tariana. Cambridge:
on these two elements.
Bowlby, John. 1982. Attachment and Loss. New York: Basic Books
Narratives involve sequences of action engaged in by inten-
Hogan, Patrick Colm. 2004. he Mind and Its Stories: Narrative Universals
tional agents pursuing goals that we share and that engage
and Human Emotion. Cambridge: Cambridge University Press.
us emotionally. One thing that cross-cultural patterns sug- . Afective Narratology: he Emotional Structure of Stories.
gest is that these narrative goals are much more limited, and Lincoln: University of Nebraska Press. In press.
much more cross-culturally widespread, than one might have Idema, Wilt. 2001. Traditional dramatic literature. In he Columbia
imagined. For example, they include union with a partner in History of Chinese Literature, ed. Victor Mair, 785847. New
an enduring relationship that is both sexual and founded in York: Columbia University Press.
attachment thus, romantic love (on other happiness goals, the Lvi-Strauss, Claude. 1973. From Honey to Ashes (Introduction to a
related emotions, and the associated narrative structures, see Science of Mythology: 2). Trans. John and Doreen Weightman. New
Hogan 2004). he precise development of narratives results in York: Harper & Row.
Ortony, Andrew, Gerald Clore, and Allan Collins. 1988. he Cognitive
part from the means necessary to intensify emotional experi-
Structure of Emotions. Cambridge: Cambridge University Press.
ences such as creating a relatively sharp change from separa-
Roth, Walter. 1915. An Inquiry into the Animism and Folk-Lore of the
tion anxiety to reunion, enhancing conlict by involving people
Guiana Indians: hirtieth Annual Report of the Bureau of American
who themselves have attachment bonds (e.g., parents and chil- Ethnology to the Secretary of the Smithsonian Institution: 19081909.
dren), and so on. Washington, DC: Government Printing Oice, 103453.
Again, the development of romantic narratives also crucially
includes real social concerns. Most obviously, these involve in-
group/out-group divisions and group hierarchies, which pre-
NARRATIVES OF PERSONAL EXPERIENCE
sumably result from group dynamics. But group organization he study of narrative extends over a broad range of human
does not delimit the entire social world. Individual biological activities: novels, short stories, poetic and prose epic, ilm, folk-
endowments, developmental idiosyncrasies, and experiential tale, interviews, oral memoirs, chronicles, histories, comic strips,
accidents in later life guide personal ailiations. here is, in con- graphic novels, and other visual media. hese forms of commu-
sequence, no way of guaranteeing that personal ailiations will nication may draw upon the fundamental human capacity to
conform to the principles of group hierarchization or in-group/ transfer experience from one person to another through oral
out-group antagonism. Societies are, then, condemned to face narratives of personal experience.
conlicts between interpersonal attachments and the segrega- A focus on spontaneous recounting of experience was greatly
tions imposed by social organization. Romantic plots tell the stimulated by the development of sociolinguistic research in
story of that conlict. the 1960s, designed to capture the closest approximation to
In sum, there seem to be signiicant narrative universals the vernacular of unmonitored speech. Narratives of personal
(many, of course, statistical; others absolute). hese uni- experience were found to reduce the efects of observation to a
versals arise from a complex interaction of factors, including minimum (Labov 2001). Since then it has appeared that such
biological endowment (e.g., in basic emotional responses), pat- narratives are delivered with a similar organization in a wide vari-
terns in childhood development, and convergent developments ety of societies and cultures as, for example, in the Portuguese of
arising through group dynamics. In this way, narrative univer- ishermen in northeastern Brazil (Maranho 1984). he follow-
sals are in part derived from biological adaptations. However, ing discussion of oral narratives is based on the initial analysis of
they are no less derived from social constructions, which are William Labov and Joshua Waletzky (1967), as developed further
themselves universal. An understanding of narrative universals in the suggested reading.
is important for at least three reasons: 1) Narratives are a cen- he discussion irst treats the structural organization of nar-
tral part of human life everywhere. Understanding narratives is rative (temporal organization, orientation, coda), then turns to
therefore crucial to understanding the human mind and human the evaluative component and inally to the construction of nar-
experience. 2) he precise narrative universals we discover tell rative as a folk theory of causality instrumental to the assignment
us some surprising things about human society. For example, of praise and blame.
it is striking that most romantic plots develop our sympathy for
the lovers, not for the society. his suggests not only that certain Structural Organization
sorts of conlict are inevitable in society but also that we share A narrative is deined here as one way of recounting past events,
a deep sympathy with individuals or couples working against in which the order of narrative clauses matches the order of
social hierarchization and group antagonism a surprising and events as they occurred. Example (1) is a minimal narrative orga-
in many ways hopeful fact. Finally, 3) the complex nature of nized in this way:
546
Narratives of Personal Experience
(1) a. Well, this man had a little too much to drink fact realized. Frequently, such evaluative clauses are concen-
b. and he attacked me trated in an evaluation section, suspending the action before
c. and a friend came in a critical event and establishing that event as the point of the
d. and she stopped it. narrative.
Evaluative clauses vary along a dimension of objectivity. At
he same events could have been reported in the non-narrative
one extreme, narrators may interrupt the narrative subjectively
order c,d,a,b as in (2), which employs a variety of grammatical
by describing how they felt at the time:
devices within a single clause.
(8) a. I couldnt handle any of it
(2) A friend of mine came in just in time to stop this person who
b. I was hysterical for about an hour and a half
had had a little too much to drink from attacking me.
In a more objective direction, narrators may quote themselves
Narrative structure is established by the existence of temporal
(I said to myself his is it), or with more credibility, cite a third
juncture between two independent clauses. Temporal juncture
party witness, as in (5). At the other extreme, objective events
is said to exist between two such clauses when a change in the
speak for themselves, as in the account of a plane developing
order of the clauses produces a change in the interpretation of
motor trouble over Mexico City:
the order of the referenced events in past time. hese are nar-
rative clauses. Narrative clauses respond to a potential question (9) And you could hear the prayer beads going in the back of the
what happened then? and form the complicating action of the plane.
narrative.
Evaluation provides justiication for the narratives claim on
A narrative normally begins with an orientation, introduc-
a greater portion of conversational time than most turns of talk,
ing and identifying the participants in the action: the time, the
requiring an extended return of speakership to the narrator until
place, and the initial behavior. The orientation section pro-
it is inished (Sacks 1992). Evaluation thus provides a response
vides answers to the potential questions who? when? where?
to the potential question So what? (Spanish Y que?; French Et
what were they doing? In the minimal narrative (1), the first
alors?).
clause (a) is the orientation. More information is usually
Narratives of personal experience normally show great vari-
provided:
ation in the length of time covered by the clauses in the orien-
(3) a. my son has awell, it was a fairly new one then. tation, complicating the action and evaluation sections, ranging
b. Its a 60 cc Yamaha. from decades to minutes to seconds. Sequences of clauses of
c. and it could move pretty good. equal duration may be termed chronicles; these are not designed
d. his fella and I were going down the road together to report and evaluate personal experience.
he end of a narrative is frequently signaled by a coda, a state-
Reportability and Credibility
ment that returns the temporal setting to the present, precluding
A reportable event is one that itself justiies the delivery of the
the question and what happened then?
narrative and the claim on social attention needed to deliver it.
(4) a. And you know the man who picked me out of the water? Some events are more reportable than others. he concept of
b. Hes a detective in Union City, reportability or tellability (Norrick 2005) is relative to the situation
c. and I see him every now and again. and the relations of the narrator with the audience. At one end of
the scale, death and the danger of death are highly reportable in
almost every situation. At the other end, the fact that a person ate
Evaluation a banana for lunch might be reportable only in the most relaxed
Most adult narratives are more than a simple reporting of events.
family setting. Most narratives are focused on a most reportable
A variety of evaluative devices are used to establish the evalua-
event. Yet reporting this event alone does not make a narrative; it
tive point of the story (Polanyi 1989). hus, we ind that narra-
only forms the abstract of a narrative.
tives, which are basically an account of events that happened,
For a narrative to be successful, it cannot report only the most
frequently contain irrealis clauses negatives, conditionals,
reportable event. It must also be credible if the narrative is not
futures which refer to events that did not happen or might have
to be rejected as a whole by the listener. here is an inverse rela-
happened or had not yet happened:
tionship between reportability and credibility: he more report-
(5) And the doctor just says Just that much more, he says, and able, the less credible. Narrators have available many resources
youd a been dead. to enhance credibility. In general, the more objective the evalua-
tion, the more credible the event.
(6) Ill tell you if I had ever walloped that dog Id have felt some
bad. Narrative Preconstruction
When a narrator has made the decision to tell a narrative, he or
(7) a. And he didnt come back.
she must solve the fundamental and universal problem: Where
b. And he didnt come back.
should I begin? he most reportable event, which will be desig-
Irrealis clauses serve to evaluate the events that actually did nated henceforth as e0, is most salient, but one cannot begin with
occur in the narrative by comparing them with an alternate it. Given the marked reportability of e0 and the need to estab-
stream of reality: potential events or outcomes that were not in lish its credibility, the narrator must answer the question How
547
Narratives of Personal Experience Narratology
did this (remarkable) event come about? he answer requires a insertions of pseudoevents and removing them, detecting dele-
shift of focus backwards in time to a precursor event e-1, which is tions and replacing them, and exchanging excuses for the action
linked to e0 in the causal network in which events are represented excused. It is then possible to approximate the original chain of
in memory (Trabasso and van den Broek 1985). In traversing this events on which the narrative is based. A useful exercise is to
network in reverse, the causal links found may be event-to-goal, develop a complementary sub rosa case in the interests of the
goal-to-attempt, or attempt-to-outcome. he process will con- antagonist. he comparison of these two constructions deepens
tinue recursively to e-2, e-3, and so on. until an ordinary, mun- our understanding of how narrative skills are enlisted to trans-
dane event e-n is reached, for which the question Why did you form the social meaning of events without violating our commit-
do that? is absurd, since en is exactly what we would expect the ment to a faithful rendering of the past.
person to do in the situation described. he event en is, of course,
William Labov
the orientation. hus, a narrator telling of a time he was on shore
leave in Buenos Aires begins,
(10) a. Oh, I was settin at a table drinkin.
Gofman, Erving. 1981. Forms of Talk. Oxford: Blackwell.
Labov, William. 2001. he Social Stratiication of English in New York City.
TRIGGERING EVENTS. Given the mundane and nonreportable 2d ed. Cambridge: Cambridge University Press.
character of the orientation, it follows that the irst link in the . 2003. Uncovering the event structure of narrative. Georgetown
University Round Table 2001. Ed. Deborah Tannen and James Alatis,
causal chain is a triggering event, which drives the narrative
6383. Washington, DC: Georgetown University Press. his article
along the chain toward the most reportable event. hus, (10) is
develops the search for the events underlying the narrative.
followed by (11):
. 2004. Ordinary events. In Sociolinguistic Variation: Critical
(11) b. an this Norwegian sailor come over Relections. ed. C. Fought, 3143. Oxford: Oxford University Press. An
c. an kep givin me a bunch o junk about how I was sittin exploration of the evaluative efect of inserting ordinary events into
narrative.
with his woman.
. 2006. Narrative preconstruction. Narrative Inquiry 16: 3745. A
How ordinary situations like (10) can give rise to the report- fuller development of this topic.
able and violent events that followed is a mystery that narrative Labov, William, and Joshua Waletzky. 1967. Narrative analysis. In Essays
analysis can only contemplate, since they are part and parcel of on the Verbal and Visual Arts, ed. J. Helm, 1244. Seattle: University
the contingent character of history. of Washington Press. Repr. Journal of Narrative and Life History 7
(1997): 338.
Maranho, Tulio. 1984. he force of reportive narratives. Papers in
The Transformation of Experience Linguistics 17 (3): 23565.
he participants in many narratives include protagonist, Norrick, Neal R. 2005. he dark side of tellability. Narrative Inquiry
antagonist, and third-party, witnesses, of which the irst is the 15: 32344.
most complex. Elaborating on Gofman (1981, 1445), one can Polanyi, Livia. 1989. Telling the American Story. Cambridge, MA: MIT
identify many egos present: the self as original author of the Press.
narrative and its immediate animator; the self as actor; the self Sacks, Harvey. 1992. Lectures on Conversation. Vols. 1 and 2. Ed. Gail
as generalized other (normally as you); the anti-self as seen Jeferson. Oxford: Blackwell.
by others; and the principal, the self in whose interest the story Trabasso, T., and P. van den Broek. 1985. Causal thinking and the
is told. hat interest is normally advanced through a variety representation of narrative events. Journal of Memory and Language
of techniques that do not require any alteration in the truth- 27: 122.
fulness of the events reported. he re-creation of the causal
network involves the assignment of praise and blame for the
NARRATOLOGY
critical events and their outcomes. Most narratives of conlict
involve linguistic devices that contribute to the polarization he French term narratologie (formed in parallel with biology,
of protagonist and antagonist, though within the family, other sociology, etc. to denote the study of narrative) was coined by
linguistic forms lead to the integration of participants. he Tzvetan Todorov in his 1969 book Grammaire du Dcamron.
devices used to adjust praise and blame include most promi- he early narratologists participated in a broader structuralist
nently the deletion of events, an operation that can often be revolution that sought to use Saussurean linguistics as a pilot
detected by close reading. Key elements in further manipula- science for studying diverse forms of cultural expression, which
tion are the grammatical features of voice: active versus pas- structuralist theorists characterized as rule-governed signifying
sive, but also zero causatives that assign agency (He drove practices or languages in their own right (see structuralism;
through town with a chaufeur) or verbs that imply the exer- Culler 1975). Likewise, narratologists such as Todorov, Roland
tion of authority and resistance to it (My dad let me go with Barthes, Claude Bremond, Grard Genette, and Algirdas Julien
him). Other narrative devices function to increase the impres- Greimas, adapted Ferdinand de Saussures distinction between
sion of agency: pseudoevents that may not correspond to any la parole and la langue to construe particular stories as individ-
physical event (I turned to him and, I took this girl and, I ual narrative messages supported by an underlying semiotic
started to hit him but). code (see semiotics). And just as Saussurean linguistics privi-
Narrative analysis can show how the prima facie case is built leged code over message, focusing on the structural constituents
to further the interests of the principal. his involves detecting and combinatory principles of the semiotic system of language,
548
Narratology
rather than on situated uses of that system, structuralist narratol- between nuclei and catalyzers in his 1966 Introduction to the
ogists privileged narrative in general over individual narratives, Structural Analysis of Narratives (Barthes [1966] 1977). Renamed
emphasizing the general semiotic principles according to which kernels and satellites by Chatman (1978), these terms refer to core
basic structural units (characters, states, events, actions, etc.) are and peripheral elements of story content, respectively. Delete or
combined and transformed to yield speciic narrative texts. add to the kernel events of a story and you no longer have the
In this brief overview, I trace in further detail some of the same story; delete or add to the satellites and you have the same
developments from which structuralist narratology took rise story told in a diferent way. Related to Tomashevskiis work on
and outline key contributions by early theorists. I also review free versus bound motifs, Viktor Shklovskiis early work on plot
limitations of the structuralist approach to narrative inquiry as a structuring device established one of the grounding assump-
limitations that manifested themselves as story analysts began tions of structuralist narratology: namely, the fabula-sjuzhet or
to engage more fully with recent research in the language sci- story-discourse distinction (see story and discourse), that
ences, among other areas of study. To map the evolution of the is, the distinction between the what and the how, or what is being
ield, I draw a distinction between classical and postclassical told versus the manner in which it is told.
approaches to narratological analysis (cf. Herman 1999). Classical Another important precedent was furnished by Vladimir
narratology encompasses the tradition of research, rooted in Propps Morphology of the Folktale ([1928] 1968), whose irst
Russian Formalist literary theory as well as earlier precedents, English translation appeared in 1958. Propp distinguished
that was extended by structuralist narratologists starting in the between variable and invariant components of higher-order nar-
mid-1960s and reined and systematized up through the early rative structures more speciically, between changing dramatis
1980s by scholars such as Mieke Bal, Seymour Chatman, Wallace personae and the unvarying plot functions performed by them
Martin, Gerald Prince, and others. he Anglo-American tradition (e.g., act of villainy, punishment of the villain, etc.). In all, Propp
of scholarship on ictional narrative can also be included under abstracted 31 functions, or character actions deined in terms of
the rubric of classical approaches, though for reasons of space, their signiicance for the plot, from the corpus of Russian folktales
this discussion focuses mainly on the Formalist-structuralist tra- that he used as his data set; he also speciied rules for their distri-
dition. Postclassical narratology, meanwhile, designates frame- bution in a given tale. His approach constituted the basis for later
works for narrative research that build on this classical tradition accounts of narrative structure. For instance, extrapolating from
but supplement it with concepts and methods that were unavail- what Propp had termed spheres of action, Greimas ([1966]
able to story analysts during the heyday of structuralism. In 1983) sought to create a typology of general behavioral roles to
developing postclassical approaches, which not only expose the which particularized actors in narratives could be reduced. He
limits but also exploit the possibilities of older models, theorists initially identiied a total of six roles (which he termed actants)
of narrative have drawn on a range of ields, from gender theory, underlying individual narrative actors: subject, object, sender,
philosophical ethics, and comparative media studies to socio- receiver, helper, and opponent.
linguistics, the philosophy of language, and cognitive science.
Given the focus of the present encyclopedia, I concentrate here Establishing the Field: Structuralist Narratology
on productive synergies between postclassical narratology and I have already begun to discuss how the structuralist narratolo-
research in the language sciences. gists built on Russian Formalist ideas to help consolidate what I
am referring to as the classical tradition of research on narrative.
The (Recent) Prehistory of Narratology As originally conceived (cf. Barthes [1966] 1977), the new science
he Russian Formalists authored a number of pathbreaking of narratology aimed to be not a school or method of literary crit-
studies that served as foundations for narratological research. icism that is, not a way of interpreting novels or other specii-
Crucially, the Formalists sought to create a stylistics suit- cally literary narratives but, rather, a transmedial investigation
able for larger verbal structures found in prose narratives of all of stories of all kinds, naturally occurring as well as artistically
sorts, from Leo Tolstois historically panoramic novels to tightly elaborated, verbal (spoken or written) as well as image based,
plotted detective novels to (Russian) fairy tales. his widened painted as well as ilmed. It also aimed to be transcultural and
investigative focus would prove to be a decisive development transgeneric, investigating everything from legends and fables
in the history of modern-day narratology. he new focus helped to epics and tragedies. Ethnographic and sociological impulses,
uncouple theories of narrative from theories of the novel, shifting relecting the linguistic, anthropological, and folkloristic bases
scholarly attention from a particular genre of literary writing to for structuralist analysis of narrative, reveal themselves, when
all discourse or, in a broader interpretation, all semiotic activities Barthes writes: All classes, all human groups, have their narra-
that can be construed as narratively organized. he Formalists tives, enjoyment of which is very often shared by men with difer-
thus set a precedent for the transgeneric and indeed transmedial ent, even opposing, cultural backgrounds ([1966] 1977, 79).
aspirations of French structuralist theorists such as Bremond Narratologys grounding assumption is that a common, more
and Barthes, who came later. or less implicit model of narrative explains peoples ability to
Not only was the general orientation of Formalist research recognize and interpret many diverse productions and types
narratologically productive; more than this, speciic Formalist of artifacts as stories; the same model allows them to compare
concepts were taken over more or less directly by structuralist an anecdote with a novel or an opera with an epic. In turn, the
story analysts. For example, in distinguishing between bound raison dtre of narratological analysis is to develop an explicit
(or plot-relevant) and free (or non-plot-relevant) motifs, characterization of the model underlying peoples intuitive
Boris Tomashevskii provided the basis for Barthess distinction knowledge about stories, in efect providing an account of what
549
Narratology
constitutes humans narrative COMPETENCE. Hence, having the storyworld. In singulative narration, there is a one-to-one
conferred on linguistics the status of a founding model ([1966] correspondence between these frequency rates; in repetitive
1977, 82), Barthes identiies for the narratologist the same object narration, events are recounted more often than they occur;
of inquiry that (mutatis mutandis) Saussure had speciied for the and in iterative narration, events that happen more than once
linguist: the code or system from which the ininity of narrative are recounted fewer times than the frequency with which they
messages derives and on the basis of which they can be under- actually occur.
stood as stories in the irst place. For all the gains it achieved by drawing on linguistics as a pilot
Narratologists like the early Barthes used structuralist linguis- science (or, rather, as a metaphor for disciplinary practice), how-
tics not just to identify their object of analysis but also to elabo- ever, structuralist narratology was also limited by the linguistic
rate their method of inquiry. In this connection, the adaptation of theories it treated as exemplary. Barthes unintentionally reveals
structuralist-linguistic concepts and methods was to prove both the limits of structuralist narratology when he remarks that a
enabling and constraining. On the positive side, the example of narrative is a long sentence, just as every constative sentence is in
linguistics did provide narratology with a productive vantage a way the rough outline of a short narrative, suggesting that one
point on stories, afording terms and categories that generated inds in narrative, expanded and transformed proportionately,
signiicant new research questions. For example, the linguistic the principal verbal categories: tenses, aspects, moods, persons
paradigm furnished Barthes with what he characterized as the ([1966] 1977, 84). By contrast, post-Saussurean language theory
decisive concept of the level of description (Barthes [1966] has underscored that certain features of the linguistic system
1977: 8588). Imported from grammatical theory, this idea sug- conversational implicatures, discourse anaphora, pro-
gests that a narrative is not merely a simple sum of propositions tocols for turn-taking in conversation (see adjacency pair),
but, rather, a complex structure that can be analyzed into hierar- and so on emerge only at the level beyond the sentence. In
chical levels in the same way that a natural-language utterance other words, attempting to bring to bear on narrative texts a
can be analyzed at the level of its syntactic, its morphological, code-centered linguistics that ignores distinctive features of lan-
or its phonological representation. Barthes himself distin- guage in use, the early narratologists lacked crucial resources for
guishes three levels of description. At the lowest or most gran- the analysis of stories. he problem, then, is not with the origi-
ular level are basic meaning-bearing elements that he termed nal intuition of the narratologists namely, that linguistics can
functions, which can be mapped out both distributionally and serve as a pilot science for narratological research. he problem,
in terms of paradigmatic classes; then come characters actions rather, is with the particular linguistic concepts they used to lesh
that collocate to form narrative sequences; and inally there is out that intuition.
the level of narration, or the proile that narrative assumes when
viewed as a communicative process. Beyond Structuralism: Postclassical Narratology and the
Likewise, Genette ([1972] 1980) drew on a broadly gram- Sciences of Language
matical paradigm in using the categories of tense, mood, and Ironically, the narratologists embraced Saussures structural-
voice to characterize the relations among the story (= the basic ist linguistics as their point of reference just when its deicien-
sequence of states, actions, and events recounted), the text on cies were becoming apparent in the domain of linguistic inquiry
the basis of which interpreters reconstruct that story, and the itself. he limitations of the Saussurean paradigm were thrown
act of narration that produces the text. Indeed, Genettes work into relief, on the one hand, by emergent formal (e.g., genera-
in the area of narrative temporality constitutes one of the truly tive-grammatical) models for analyzing language structure
outstanding achievements in the ield. Developing distinctions (see generative grammar). On the other hand, powerful
that bear an interesting resemblance to Hans Reichenbachs tools were being developed in the wake of Ludwig Wittgenstein,
(1947) discriminations among event time, reference time, and J. L. Austin, H. P. Grice, John Searle, and other post-Saussurean
speech time, Genette focuses on two kinds of temporal rela- language theorists interested in how contexts of language use
tionships: 1) that between between narration and story and bear on the production and interpretation of socially situated
2) that between text and story. In connection with the irst, utterances. heorists working in this tradition began to question
Genette distinguishes between simultaneous, retrospective, what they viewed as counterproductive modes of abstraction and
prospective, and intercalated modes of narration; in connec- idealization in both structuralist linguistics and the Chomskyan
tion with the second, he develops the categories of duration, paradigm that displaced it. Indeed, the attempt by later narrative
order, and frequency. Duration can be computed as a ratio scholars to incorporate ideas about language and communica-
between the length of time that events take to unfold in the tion that postdate structuralist research has been a major factor
world of the story and the amount of text devoted to their nar- in the advent of postclassical models for research on stories and
ration, with speeds ranging from descriptive pause to scene to storytelling. To put the same point another way, one reason for
summary to ellipsis. Order can be analyzed by matching the the shift from classical to postclassical narratology has been an
sequence in which events are narrated against the sequence ongoing efort to move from using linguistics as a metaphor for
in which they can be assumed to have occurred, yielding narrative research to using linguistic models in the actual prac-
chronological narration, analepses or lashbacks, and pro- tice of narratological inquiry.
lepses or lashforwards, together with various subcategories he following are just some of the domains of narratological
of these nonchronological modes. Finally, frequency can be research in which theorists have begun to import concepts and
calculated by measuring how many times an event is narrated methods from the modern-day sciences of language, in an efort
against how many times it can be assumed to have occurred in to build models with greater descriptive and explanatory power
550
Narratology
than those developed by the classical narratologists. In each of a greater awareness that narratology is itself one of the sciences
these domains, story analysts are working to adopt more narra- of language speciically, the domain of inquiry whose focal con-
tive-appropriate tools from the language sciences that is, tools cern is narratively organized sign systems across all media and
that can throw light on how narrative, as a distinctive kind of lan- communicative settings. Once a subdomain of literary study,
guage use, constitutes a cognitive and communicative resource narratology is now coming into its own as the comprehensive
by means of which human beings make sense of themselves, one science of narrative-pertinent phenomena originally envisioned
another, and the world. by the structuralist narratologists.
Evidence of this reconiguration of the ield can be found in
Narrative Comprehension: To explore aspects of narrative
narratologists reengagement with natural-language narratives
processing, story analysts have drawn on a range of theoreti-
told in contexts of face-to-face interaction. Although William
cal frameworks that were unavailable to the structuralist nar-
Labov and Joshua Waletzky (1967) developed their model for the
ratologists, including artiicial intelligence research (work on
analysis of narratives told in contexts of face-to-face communica-
knowledge representations), accounts of mental models,
tion just as structuralist narratologists were proposing their key
cognitive linguistics, and research on text processing.
ideas, and although the Labovian model has been extraordinarily
For example, Catherine Emmott (1997) presents a powerfully
inluential in social-scientiic research for some four decades, ini-
integrative theory of narrative comprehension as a process of
tially there was little interaction between sociolinguistic research
using textual cues to build and update complex mental rep-
on storytelling and other traditions of narrative scholarship. But
resentations that she terms contextual frames, which contain
now there is increasing interest in building an integrative theory
information about narrative agents, their situation in time
that can accommodate both the study of written, literary narra-
and space, and their relationships with one another. Mark
tives and the analysis of everyday storytelling (Fludernik 1996;
Turner (2003), meanwhile, relates story comprehension to
Herman 2002). At the same time, among researchers concerned
more general cognitive processes that involve conceptual
with face-to-face narrative communication, there has been a shift
blending.
analogous to the one I have characterized as a transition from
Speech and hought Representation: Narratologists have
classical to postclassical approaches. Precipitating this shift is
drawn on ields ranging from dialectology, pragmatics,
the recognition that the Labovian model captures one important
discourse analysis (linguistic), and historical
subtype of natural-language narratives namely, stories elicited
linguistics to study aspects of speech representation in
during interviews but does not necessarily apply equally well
narrative, including dialect representations and ictional
to other storytelling situations, such as informal conversations
portrayals of scenes of conversational interaction. Likewise,
between peers, he-said-she-said gossip, conversations among
to study representations of characters mental function-
family members at the dinner table, or, for that matter, written,
ing, analysts have begun to work toward a rapprochement
literary texts (see conversation analysis).
between narratological theory and ideas from cognitive and
his convergence of sociolinguistic, discourse-analytic, and
social psychology, research on emotion, cognitive linguistics,
narratological research suggests that narratology is now coming
and other frameworks for inquiry.
into its own as a bona ide member of the language sciences. As
Focalization heory: Initially given impetus by Genettes
such, its chief aim is to enhance our understanding of stories not
([1972] 1980) attempt to reformulate theories of narrative
only as a means of artistic expression or a resource for communi-
perspective or point of view in more rigorous terms, focal-
cation but also as a fundamental human endowment.
ization theory has in recent years taken on an increasingly
interdisciplinary proile. Manfred Jahn (1996) has drawn on David Herman
the cognitive science of vision to propose a powerful account
of the perspective-marking features of narrative. Meanwhile,
David Herman (2009) uses ideas from cognitive grammar Barthes, Roland. [1966] 1977. Introduction to the structural analysis
to propose reinements to Genettes theory. of narratives. In Image Music Text, trans. Stephen Heath, 79124.
Quantitative, Corpus-Based Research: Story analysts have New York: Hill and Wang.
begun to work with large text corpora (see corpus lin- Chatman, Seymour. 1978. Story and Discourse: Narrative Structure in
Fiction and Film. Ithaca, NY: Cornell University Press.
guistics ) to study whether the distributional facts support
Culler, Jonathan. 1975. Structuralist Poetics: Structuralism, Linguistics,
accounts proposed by earlier narratologists on the basis of
and the Study of Literature. Ithaca, NY: Cornell University Press.
their own readerly intuitions. On the one hand, hypothesis-
Emmott, Catherine. 1997. Narrative Comprehension: A Discourse
driven approaches to corpus study use a top-down method, Perspective. Oxford: Oxford University Press.
attempting to map assumed categories of structure onto spe- Fludernik, Monika. 1996. Towards a Natural Narratology. London:
ciic texts or corpora to test the validity of prior theories. On Routledge.
the other hand, bottom-up approaches, seeking to reduce Genette, Grard. [1972] 1980. Narrative Discourse: An Essay in Method.
theoretical presuppositions to a minimum, work to induce Trans. Jane E. Lewin. Ithaca, NY: Cornell University Press.
categories and models from surface features that can be Greimas, Algirdas-Julien. [1966] 1983. Structural Semantics: An Attempt
identiied through automated analysis of narrative corpora. at a Method. Trans. Danielle McDowell, Ronald Schleifer, and Alan
Velie. Lincoln: University of Nebraska Press.
It is not just that narratologists have begun to engage more fully Herman, David. 1999. Introduction. In Narratologies: New Perspectives
with concepts and methods from the language sciences, how- on Narrative Analysis, ed. David Herman, 130. Columbus: Ohio State
ever; more than this, the ield is currently being revolutionized by University Press.
551
Nationalism and Language
. 2002. Story Logic: Problems and Possibilities of Narrative. whole set of ethnolinguistic nationalisms in Europe immedi-
Lincoln: University of Nebraska Press. ately afterwards. hough the historical diferences between the
. 2009. Cognitive approaches to narrative analysis. In Cognitive various social movements cannot be elided, they were inspired
Poetics: Goals, Gains, and Gaps, ed. Geert Brne and Jeroen Vandaele, by a number of German post-Kantian idealist thinkers. J. G.
79118. Berlin: Mouton de Gruyter.
Herders assertion in 1768 that each national language forms
Jahn, Manfred. 1996. Windows of focalization: Deconstructing and
itself in accordance with the ethics and manner of thought of
reconstructing a narratological concept. Style 30.3: 24167.
Labov, William, and Joshua Waletzky. 1967. Narrative analysis: Oral ver-
its people (2002, 50) was an important articulation of the link
sions of personal experience. In Essays on the Verbal and Visual Arts, between language and nation; by the time that William von
ed. June Helm, 1244. Seattle: University of Washington Press. Humboldt gave his deinition of a nation in 1836 (a body of men
Propp, Vladimir. [1928] 1968. Morphology of the Folktale. Trans. Laurence who form language in a particular way [1988, 153]), the connec-
Scott, rev. Louis A. Wagner. Austin: University of Texas Press. tion appeared almost axiomatic. In 1808, J. G. Fichte spelled out
Reichenbach, Hans. 1947. Elements of Symbolic Logic. New the political signiicance of linguistic nationalism by arguing that
York: Macmillan. wherever a separate language is found, there a separate nation
Turner, Mark. 2003. Double-scope stories. In Narrative heory and the exists, which has the right to take charge of its independent
Cognitive Sciences, ed. David Herman, 11742. Stanford, CA: CSLI. afairs and to govern itself (1968, 49). he implications of the
doctrine were realized in the role that it played in national inde-
NATIONALISM AND LANGUAGE pendence campaigns conducted by Greeks, Czechs, Hungarians,
Bulgarians, Ukrainians, Finns, Norwegians, Afrikaners, and
While the link between language and nationality is often pre-
the Irish. Some postcolonial activists today, the Kenyan writer
sented as though it developed at some primordial point in the
Ngg wa hiongo, for example, use the same model of linguistic
past, its appearance is, in fact, quite recent. his is hardly surpris-
nationalism in their contemporary struggle, not so much against
ing inasmuch as the conception of the nation itself is relatively
colonialism but in order to counter the legacy of colonial rule.
modern. hus, the idea that language is the medium by which
Andersons account of the nation as an imagined commu-
nationality is established, that language is the key to the nation,
nity drew attention to the constructedness of the concept by
has to be traced historically. Two distinct but related contexts
pointing to its precise historical origins. Yet the role of language
may serve as examples concerning how and why the connection
in the imagining of the community of the nation is also one that
was made.
arises at particular moments in history and serves speciic func-
In the sixteenth century, the Tudor monarchy sought to exer-
tions; it is neither transhistorical nor general. It is also worth
cise its dominion over Ireland, a colony which had been nomi-
noting that the conception of language underpinning this act of
nally under English rule since 1169 but which had never quite
imagination is one that has been criticized. hus, M. M. Bakhtin,
been successfully subjugated. Part of its centralizing project was
in the important essay Discourse in the Novel, points to the fact
the imposition of English upon the whole of the island of Ireland
that national languages are produced by various types of insti-
on the ground that the use of the native language, Gaelic, along
tutional forces intellectual (linguistic theorizing), educational
with other cultural factors such as behavior and dress, led the
(grammars and dictionaries), political (legislation) which act
Irish to think of themselves as being of sundry sorts, or rather of
centripetally in order to create a determinate, ixed, and know-
sundry countries (Statutes 1786, 28H8.cxv) rather than as mem-
able form. As part of this process, the realities of heteroglossia
bers of one polity united under the English crown. his stress on
(see dialogism and heteroglossia) social diference
the signiicance of linguistic diference, embodied in the Act for
inscribed in language by means of variation past and present
the English Order, Habit and Language (1537), formed the basis
have to be banished. Historians, such as E. J. Hobsbawm (1990),
of the English policy of linguistic colonialism in Ireland, but, of
have noted the historical signiicance of such linguistic selection
equal importance, it heralded the connection between language
and ranking, while linguistic anthropologists have drawn atten-
and national identity. In his 1617 Itinerary, Fynes Moryson,
tion to the fact that the homogeneous language of nationalism is
an English adventurer in Ireland, articulated the lesson that the
as imaginary as the community that accompanies it (Irvine and
colonialists learned from their struggle to impose English lan-
Gal 2000).
guage and order: [C]ommunion or diference of language hath
he extent to which such insights will have an impact in
always been observed a special motive to unite or alienate the
political and linguistic thought remains to be seen. It is certainly
minds of all nations. And in general all nations have thought
the case, however, that the postulated relationship between lan-
nothing more powerful to unite minds than the community of
guage and nation is now treated much more skeptically. At the
language (Moryson [1617] 1903, 213). Under speciic historical
reactionary edge of forms of linguistic nationalism, there are still
conditions the clash between an early modern nation-state and
those who argue for the purity of language as a way of guar-
one of its colonies linguistic diference came to signify national
anteeing the integrity of the nation. But the very fact that the
diference through the operation of military and discursive
vast majority of nations past and present have been multilingual
power. he link established in this context served as a portent
communities including a number of those whose very entrance
of a more general connection that appeared later in Europe and
into history depended on an emphasis on their supposed mono-
beyond.
lingualism radically undermines the ideological case for lin-
Although his seminal account of nationalism identiies its ori-
guistic nationalism.
gins in the New World in the late eighteenth and early nineteenth
centuries, Benedict Anderson also discusses the appearance of a Tony Crowley
552
Natural Kind Terms
WORKS CITED AND SUGGESTIONS FOR FURTHER READING Various terms and phrases can be cited as examples of
Anderson, Benedict. 1991. Imagined Communities: Relections on the
non-natural kind terms. Student with a long nose who visited
Origin and Spread of Nationalism. London: Verso. Malaysia denotes a kind whose deining properties are not
Bakhtin, M. M. 1981. he Dialogic Imagination: Four Essays. Ed. Michael related together in any lawlike regularity, and is, therefore, of
Holquist, trans. Caryl Emerson and Michael Holquist. Austin: University no use for the understanding of nature. A term like nonhuman
of Texas Press. designates a group that is too heterogeneous. Another example
Barbour, Stephen, and Cathie Carmichael, eds. 2002. Language and often cited is that of artiicial kind terms, such as pencil or apart-
Nationalism in Europe. Oxford: Oxford University Press. ment. But this is perhaps problematic: It seems to presuppose
Fichte, J. G. [1808] 1968. Addresses to the German People. Ed. G. Armstrong that humans, with their artifacts, constitute a kingdom within
Kelly. New York: Harper. a kingdom. But if Homo sapiens sapiens is a natural kind, and
Herder, J. G. [1768] 2002. Philosophical Writings. Ed. and trans. Michael
as such part of nature, then terms useful for describing its life
N. Forster. Cambridge: Cambridge University Press.
and behavior for example, apartment should perhaps count
Hobsbawm, E. J. 1990. Nations and Nationalisms since 1780.
as NKTs.
Humboldt, William von. [1836] 1988. On Language: On the Diversity Recent philosophical discussion has concentrated on the
of Human Language Construction and Its Inluence on the Mental meaning of NKTs. Until the 1960s, philosophers spoke of these
Development of the Human Species. Ed. Michael Losonsky, trans. Peter terms as if they were synonymous with a group of identifying
Heath. Cambridge: Cambridge University Press. descriptions of the kinds. he statement that some liquid is water,
Irvine, Judith T., and Susan Gal. 2000. Language ideology and linguis- say, would then be synonymous with the statement that it has (at
tic diferentiation. In Regimes of Language: Ideologies, Polities and least most of) the properties that would be used, for instance, in a
Identities, ed. Paul Kroskrity, 3583. Oxford, UK: James Currey. good, scientiically informed dictionary to characterize water.
Joseph, John. 2004. Language and Identity: National, Ethnic, Religious.
his description theory of NKTs is problematic. According to
London: Palgrave.
it, if a scientist asks a child for a glass of water, what the scien-
Moryson, Fynes. [1617] 1903. Shakespeares Europe: Unpublished
tist means by water is very diferent from what the child means
Chapters of Fynes Morysons Itinerary. Ed. C. Hughes. London: Sherratt
and Hughes. by it, and the latter cannot even understand the former. But
Ngg Wa hiongo. 1986. Decolonising the Mind: he Politics of Language this is unacceptable, for luent communication is a criterion for
in African Literature. London: James Currey. understanding.
he Statutes at Large Passed in the Parliaments Held in Ireland. 1786 he most inluential theory of the meaning of NKTs nowa-
1801. 20 vols. Dublin. days, essentialism, was developed during the 1970s by Saul
Kripke (1980) and Hilary Putnam (1975). Both claimed that the
meaning of an NKT is determined not by descriptions but by
NATURAL KIND TERMS
ostensive reference to samples. Natural kinds are assumed to
Natural kind terms (NKTs) are, to use Platos ancient metaphor, have essential properties, and the NKT means something having
those terms that carve Nature at her joints; they are the terms the same essential properties as (most of) these samples, although,
that correspond to unities and diversities in nature (Phaedrus, as a rule, when introducing an NKT, people would be ignorant of
265e266b). hey therefore enable lawlike generalizations, these essential properties.
descriptions of natural patterns, and explanations of natural Kripke also claimed that NKTs are rigid, but this seems con-
phenomena. fused. First, an NKT say, tiger is not rigid in the sense of des-
From this characterization of NKTs it is clear that science ignating the same particulars in every possible world, since
strives to use such terms in its classiication and explanation in diferent possible worlds there exist diferent tigers. Secondly,
of nature. It is also clear that, as a rule, NKTs are developed it is not rigid in the sense that if it designates a particular in one
together with the growth of our knowledge of nature, and they possible world, it designates it in every possible world in which it
both result from a better understanding of phenomena and exists: he queen bee is presumably a natural kind, but whether
advance that understanding. For instance, the biblical classiica- larvae develop into queen bees depends on how they are fed. So
tion of plants into grass, the herb yielding seed and fruit tree an insect that is a queen bee might not have been one, and queen
yielding fruit whose seed is in itself (Genesis 1:11) is no longer bee designates it only in some of the possible worlds in which it
used in botany, which classiies some trees together with some exists. Lastly, and perhaps more importantly, if what was meant
grass as angiosperms, the lowering plants, in contrast to some in calling NKTs rigid is that they preserve their meaning across
other trees, which are gymnosperms. he same point is illus- possible worlds, then this is true of non-NKTs as well, such as
trated by the recent scientiic controversy over the deinition of student with a long nose who visited Malaysia, and it would
planet: Scientists aimed at forming a concept that would relect trivialize the meaning of rigidity (cf. Schwartz 2002).
and allow a better understanding of the diferent characteristics A hypothetical example supporting essentialism that many
and origins of bodies orbiting the sun. found convincing was developed by Putnam. He asks us to imag-
he most common examples of NKTs are names of sub- ine a remote planet identical to ours (Twin Earth), apart from
stances. Gold, water, alcohol, and metal are names of natural the fact that instead of water, that is, H2O, it contains a super-
kinds of matter; Homo sapiens sapiens, primates, mammals, ani- icially identical liquid of an entirely diferent composition, say
mals, and eukaryotes are names of natural kinds of organisms. XYZ. (Let us ignore the fact that such a liquid would not quench
But often enough one inds names of natural phenomena, such our thirst, and so wouldnt even be supericially like H2O.) Since
as heat or pain, counted among these terms as well. Twin Earths liquid is supericially indistinguishable from water,
553
Necessary and Suficient Conditions Negation and Negative Polarity
we would mistake it for water; but then, the argument continues, his debate can be understood as being about the necessary
because of its diferent essential properties (composition, in this and suicient conditions for the correct predication of Taitu-
case), it is not water. And thus we are supposed to conclude that as-afraid. While some claim that a particular coniguration of
the essential properties, even if unknown to us, determine the changes in the autonomic nervous system are both necessary
meaning of our NKTs. and suicient for predicating of Taitu that she is afraid, others
Nonetheless, this example is problematic. Ever since the respond that while such changes might be necessary conditions
composition of water was discovered, it is among waters known they do not amount to suicient conditions, for such changes can
and deining properties. Accordingly, after this discovery, we be brought about by the administration of drugs. hus, the truth
wouldnt consider XYZ water because we would know it isnt, and conditions for the predication of Taitu-as-afraid must involve
so this case does not support the claim that essential unknown more than the existence of these bodily changes, for example,
properties are sometimes involved in NKTs meanings. On the Taitu also holding the belief such that she is subject to a threat and
other hand, no case has been made for the claim that we would the desire to act in a way that diminishes or absents that threat.
have been mistaken if, before that discovery, we had considered Furthermore, some might (and do) hold that conditions such as
XYZ water. We would have been mistaken had we then claimed patterned changes in the autonomic nervous system might be
that Twin Earths liquid has the same unknown composition as suicient for a particular token, such as Taitu being afraid on
Earths water; but the moot point is whether the claim that it is this occasion, but not for all applications the term afraid. his
water would then have implicitly involved such an additional latter claim might then draw on Ludwig Wittgensteins discus-
claim. Examination of actual similar cases does not support sion of family resemblance and hold that all correct appli-
Putnams contention; moreover, essentialism has been shown to cations of a term are not necessarily conditional on something
be problematic in additional respects as well (cf. Ben-Yami 2001, being common to all.
and additional references there). So despite its current popular-
Phil Hutchinson
ity, essentialism remains far from established.
Hanoch Ben-Yami
NEGATION AND NEGATIVE POLARITY
Negation is a linguistic, cognitive, and intellectual phenomenon.
Ben-Yami, H. 2001. he semantics of kind terms. Philosophical Studies Ubiquitous and richly diverse in its manifestations, negation is
102: 15584.
fundamentally important to all human thought. As Laurence R.
Kripke, S. 1980. Naming and Necessity. Oxford: Blackwell.
Horn and Yasuhiko Kato put it:
Putnam, H. 1975. he meaning of meaning. In Mind, Language and
Reality: Philosophical Papers, II: 21571. Cambridge: Cambridge Negative utterances are a core feature of every system of human
University Press. communication and of no system of animal communication.
Schwartz, S. P. 2002. Kinds, general terms, and rigidity. Philosophical Negation and its correlates truth-values, false messages, contra-
Studies 109: 26577.
diction, and irony can thus be seen as deining characteristics of
the human species. (2000, 1)
NECESSARY AND SUFFICIENT CONDITIONS
Cognitively, negation is elementary of-line thinking; it involves
If we take two conditions, A and B, and we take A to be a nec- some comparison between a real situation lacking some partic-
essary and suicient condition (or set, thereof) for B, then con- ular element and an imaginal situation that does not lack it. he
dition B cannot hold in the absence of condition A. However, if particular element in focus anchors and contextualizes the nega-
A is merely a necessary condition (or set, thereof) for condition tive element (which, being constrained by grammar, frequently
B, then the presence of condition A does not entail condition B. doesnt provide enough information for a listener to determine
Further, if A is merely a suicient condition (or set, thereof) for B, what its focus is intended to be). here are many diferent con-
then it is possible that condition B holds in the absence of condi- versational and written strategies for indicating and interpreting
tion A. focus elements, and even more for modulating them.
In the philosophy of language, necessary and suicient con- Formally (see logic and language), a functor called by
ditions have been employed by some philosophers in response logicians negation is the only signiicant monadic functor; its
to the question: What are the conditions for the correct applica- behavior is described by the most basic axiom of logic, the Law
tion of a word? of Contradiction ( (pp), NKpNp, also known as he Law of
Take our employment of the words fear and afraid. To say of Non-Contradiction), which asserts that no proposition is both
a person that he or she is afraid, we want to know how we might true and not true. Pragmatically (see pragmatics), negation
judge whether that application is correct. We are here asking for provides, among many other concepts, the basic cancelation test
the truth conditions: hat is, what needs to hold such that our for presupposition, as well as the fundamental observations that
predication of Taitu-as-afraid is true? One answer might be that underlie theories of politeness.
Taitu must hold a belief such that she is subject to a threat and a In natural language, negation functions as an operator,
consequent desire to act in a way that will diminish that threat. along with quantiiers (see quantification ) and modals
Alternatively, one might argue that predicating of Taitu that she (see modality ); operators are more basic and have more
is afraid is conditional upon a particular sensation, or patterned properties than ordinary predicates or functors. In particu-
change in the autonomic nervous system, being elicited in Taitu. lar, operators have a scope; that is, there is always some other
554
Negation and Negative Polarity Network Theory
element either assumed or verbally present in the discourse Besides NPIs, English also has positive-polarity items
to which a negative, modal, or quantiier refers. hat linked ele- (would rather, sorta) that dont occur in negative polarity con-
ment is said to be the focus or to be in the scope of the negative texts; possible polarity items (tell time) that can occur only
(or modal; quantiiers are said to bind rather than focus on within the scope of a possible-type modal; and combinations,
another element). like the impossible polarity item fathom that require both nega-
Negation produces signiicant complexities and occasional tive scope and a modal.
ambiguities when it interacts with other scope operators, because
John M. Lawler
the scopes can get twisted about. Every boy didnt leave is ambig-
uous, depending on the relative scope of the negative didnt and WORKS CITED AND SUGGESTIONS FOR FURTHER READING
the quantiier every (rather like Every boy read some book, where
two diferent quantiiers produce ambiguity). Negation com- Atlas, Jay D. 1996. Only noun phrases, pseudo-negative generalized
quantiiers, negative polarity items, and monotonicity. Journal of
bines in idiosyncratic ways with modals; for example, in You
Semantics 13.4: 265328. Negative polarity iniltrates logic.
may not go, and thats inal! the deontic may not means not
Baker, C. L. 1970. Double negatives. Linguistic Inquiry 1: 16986.
possible, but in his may not be the place, the epistemic may Horn, Laurence R. 1969. A presuppositional analysis of only and even.
not means possibly not. Chicago Linguistics Society: CLS 5: 97108.
Every language develops its own idiomatic sets of negative . 1989. A Natural History of Negation. Chicago: University of
elements, and its own rules for using them. English negative phe- Chicago Press. Horns revision and extension of his 1972 dissertation.
nomena are by far the best studied; examples include syntactic he classical neo-Gricean analysis.
constructions (his is it, isnt it? Not any big ones, he didnt), vari- Horn, Laurence R., and Yasuhiko Kato. 2000. Introduction: Negation and
ation (so didnt I; aint got none), morphology (nt, -free, un-), polarity at the millennium. In Studies in Negation and Polarity, ed.
(morpho)phonology (do/dont), intonations (Riight), and lex- Laurence R. Horn and Yasuhiko Kato, 119. Oxford: Oxford University
emes sporting negation overt (never), incorporated (doubt, lack), Press. An excellent survey.
Israel, Michael. 2004. he pragmatics of polarity. In he Handbook of
calculated (few), entailed (prohibit), or presupposed (only).
Pragmatics, ed. L. Horn and G. Ward, 70123. Oxford: Blackwell.
Included also is a large, complex, and diverse system of nega-
Klima, Edward S. 1964. Negation in English. In he Structure of
tive polarity items (NPIs like ever in He didnt ever see it), which Language, ed. J. Fodor and J. Katz, 246323. Englewood Clifs,
felicitously occur only in the scope of some negative element NJ: Prentice Hall. he irst modern syntactic/semantic study.
(*He ever saw it). he details of what scope actually is, and of how Ladusaw, William A. 1980. Polarity Sensitivity as Inherent Scope Relations.
and which and why NPIs can occur within it, vary among speciic New York: Garland. he origins of the downward-entailment theory,
negative and NPI elements. using visual metaphors like scope and focus.
Negative polarity is a variety of negative concord (e.g., French Lakof, Robin. 1970. Some reasons why there cant be any some-any
Je ne regrette rien; literally I dont regret nothing; Yiddish Ix rule. Language 45: 60815.
hob nit kin gelt; literally I dont have no money), but instead Lawler, John M. 1974. Ample negatives. Chicago Linguistics Society: CLS
of negative concord, which uses negative elements in the focus 10: 35777. After Klima, negative polarity was developed extensively in
of another negative, negative polarity uses other, non-negative the generative semantics tradition (Horn, Lakof, Ross, Lawler,
and many others), largely using a negative polarity ield metaphor,
elements, which can sometimes pick up negativity by association
along with negative triggers and secondary triggering.
and occur without overt negative (could care less < couldnt care
Linebarger, Marcia. 1981. he grammar of negative polarity. Ph.D. diss.,
less). An interesting typological question is whether languages Massachusetts Institute of Technology. A then-orthodox generative
like English that lack signiicant negative concord develop more treatment, using rule-based metaphors like NPIs being licensed.
negative polarity phenomena to compensate. . 1991. Negative polarity as linguistic evidence. Papers from the
NPI is a term applied to lexical items, ixed phrases, or syn- Parasession on Negation. Chicago Linguistic Society: CLS 27: 16588.
tactic construction types that demonstrate unusual behavior McCawley, James D. 1993. Everything hat Linguists Have Always Wanted
around negation. NPIs might be words or phrases that occur to Know About Logic (But Were Ashamed to Ask). Chicago: University
only in negative-polarity contexts (fathom, in weeks) or have an of Chicago Press. Oxford: Blackwell. McCawleys modern generative
idiomatic sense in such contexts (not too bright, drink a drop); semantic analysis; for example, In natural language, negation is not
truth-functional.
or they might have a lexical afordance that only functions in
Ross, John R. 1973. Negginess. Paper delivered to the Winter Meeting of
such contexts (need/dare (not) reply); or a speciic syntactic rule
the Linguistic Society of America, San Diego.
might be sensitive to negation, like subject-verb inversion with
van der Wouden, Ton. 1996. Negative Contexts: Collocation, Polarity
adverb fronting in Never/*Ever/*Frequently have I seen such a and Multiple Negation. London and New York: Routledge. A useful
thing. University of Groningen dissertation.
he grammatical occurrence of NPIs in an utterance is prima Zeijlstra, Hedde, and Jan-Philipp Soehn, eds. 2007. Proceedings of the
facie evidence that it contains some sort of negation, and this Workshop on Negation and Polarity. Tbingen: University Collaborative
allows NPIs to function as indicators for various types of seman- Research Center. Recent evidence of polarity research expansion into
tic opposition and syntactic structure. his has turned out to be a other languages.
sensitive tool in other research areas of linguistics, and linguists
using NPIs have discovered many covert negative phenomena;
NETWORK THEORY
for instance, NPIs can also occur in questions (Have you ever
been there?), hypothetical clauses (Tell me if he ever arrives), and Network theory concerns itself with the study of elements, called
comparatives (Hes better than we ever expected). vertices (e.g., words), and their connections, called edges or links
555
Network Theory
Figure 1. A subset of a word association network appearing in

Steyvers and Tenebaum (2005, 50). Links go from the stimu-
lus to the response word. Reproduced by permission of the
Cognitive Society, Inc., copyright 2005.
(e.g., two words are connected if one word has been elicited by the small world phenomenon, high clustering, and heteroge-
the other in a word association experiment; see Figure 1). his neous degree distribution are common properties of linguistic
theory has many applications in language sciences and is the networks (Mehler 2008). Most models are based on the prefer-
outcome of intersecting work of mathematicians and physicists, ential attachment principle proposed by Albert Lszl Barabsi
who usually call it graph theory (Bollobs 1998) or complex and Rka Albert (1999): Vertices with many connections are
network theory (Newman 2003), respectively. more likely to become more connected in the future than those
One of the major contributions of physicists has been to with few connections (Steyvers and Tenenbaum 2001, 2005;
unravel the statistical properties of real networks (Newman Dorogovtsev and Mendes 2001; Motter et al. 2002).
2003), for example, the World Wide Web or protein interaction he challenges of the application of network theory are to
networks. Firstly, physicists discovered that practically all real explain the properties of these networks (most studies are merely
networks exhibited the small world phenomenon. he term descriptive); to incorporate deeper statistical techniques, for
small world comes from the observation that everyone in the example, degree correlation analysis (Serrano et al. 2007); and
world can be reached through a short chain of social acquain- to extend the studies to more languages (most studies are in
tances, although the number of people in the whole social net- English). For these reasons, it is too early to argue that the het-
work is huge. In the word association network, partially shown erogeneous degree distributions and other statistical patterns
in Figure 1, volcano is reached from ache through a chain of at constitute laws of language in the sense of absolute and
least four links, while only one link separates ire from volcano. statistical universals. When applied to syntactic networks,
Secondly, physicists found that many real networks had a het- network theory has helped to explain the origins of the properties
erogeneous degree distribution. Loosely speaking, this property of the syntactic dependency structure of sentences, for example,
means that there are vertices (words) with a disproportionately the exceptionality of syntactic dependency crossings (Ferrer
large number of connections (the so-called hubs). For instance, i Cancho 2006) and has provided new tracks for understand-
in the network partially shown in Figure 1, the ive words with the ing syntax at the large scale of syntactic organization (Ferrer i
highest degrees are food, money, water, car, and good (Steyvers Cancho, Sol, and Khler 2004), above the traditional sentence
and Tenenbaum 2005). Finally, another fundamental property of level (see syntax, universals of).
real networks is clustering; that is, roughly speaking, if two verti- In their pioneering application of network theory, Mark
ces are connected to the same vertex they are likely to be directly Steyvers and Joshua B. Tenenbaum (2001, 2005) studied the
connected as well. large-scale organization of various kinds of semantic networks
Network theory has contributed to the study of language in (e.g., word association networks, as in Figure 1) and proposed
three ways: a) by characterizing the statistical properties of lin- a simple model for explaining the small-worldness, high clus-
guistic networks, such as networks of word association (Steyvers tering and a heterogeneous degree distribution of semantic
and Tenenbaum 2005), thesauri (Sigman and Cecchi 2002), networks. Over time, new vertices (e.g. words) are added and
and syntactic dependencies (Ferrer i Cancho, Sol, and Khler attached to existing vertices according to two principles: a)
2004); b) by modeling the properties of these networks (Steyvers Barabsi and Alberts preferential attachment and b) diferentia-
and Tenenbaum 2005; Motter et al. 2002); and c) by proposing tion. Diferentiation means that a new vertex tends to mimic the
abstract models that provide a further understanding of the fac- connectivity pattern of an existing vertex.
ulty of language (Ferrer i Cancho, Riordan, and Bollobs 2005). Network theory has shed new light on the evolution of lan-
Although the systematic application of network theory to language by deining the necessary conditions for the existence of
guage is a young ield (starting in the early twenty-irst century) language (e.g., word ambiguity) and also by suggesting the
within quantitative linguistics, it can be concluded that possibility that language could have appeared for free as a side
556
Neurochemistry and Language
efect of communication principles (Ferrer i Cancho, Riordan, Design complexity is one of the hallmarks of adaptive function
and Bollobs 2005). and, thus, inquiry into the brain systems that support the lan-
guage faculty is mandatory if one wishes to understand the evo-
Ramon Ferrer i Cancho
lutionary history and potential adaptive functions of language.
Neurochemical, neurophysiological, and neuroanatomical
studies deine a widely distributed neural network that supports
Bibliography on linguistic and cognitive networks. Available speech and language functions. his network includes the motor
online at: http://www.lsi.upc.edu/~rferrericancho/linguistic_and_ and supplementary motor area (SMA) of the prefrontal lobes;
cognitive_networks.html. brocas area in the dorsal prefrontal region; wernickes
Bollobs, Bla. 1998. Modern Graph heory. New York: Springer. A help-
area in the medial temporal lobe; the anterior cingulate
ful introduction to graph theory.
gyrus and the subcortical basal ganglia; and the periaqeduc-
Barabsi, Albert Lszl, and Rka Albert. 1999. Emergence of scaling in
random networks Science 286: 50912.
tal gray matter (PAG). he anterior cingulate gyrus sends efer-
Dorogovtsev, Sergey, and Jos Fernando Mendes. 2001. Language as an ents directly onto the PAG central gray and appears to inluence
evolving word web. Proceedings of the Royal Society of London Series the initiation and voluntary control of vocalization. Destruction
B, Biological Sciences 268: 26036. of the central gray substance at the subcortical level or the SMA at
Ferrer i Cancho, Ramon. 2006. Why do syntactic links not cross? the cortical level can cause mutism. Patients with bilateral lesions
Europhysics Letters 76: 122834. within the cingulate area often undergo a period of mutism, fol-
Ferrer i Cancho, Ramon, Oliver Riordan, and Bla Bollobs. 2005. lowed by slow recovery during which speech is aprosodic and
he consequences of Zipfs law for syntax and symbolic reference. initiation of speech is rare. he anterior cingulate gyrus receives
Proceedings of the Royal Society of London Series B 272: 5615. eferents from the dopamine-rich supplementary motor area in
Ferrer i Cancho, Ramon, Ricard V. Sol, and Reinhard Khler. 2004.
the cortex and sends aferents, along with other dopaminergic
Patterns in syntactic dependency networks. Physical Review E
ibers coming from the basal ganglia, up and into the prefrontal
69: 051915.
Mehler, Alexander. 2008. Large text networks as an object of corpus lin-
regions. hus, all of these language-related areas are intercon-
guistic studies. In Corpus Linguistics: An International Handbook of nected via dopaminergic ibers and the prefrontal cortex.
the Science of Language and Society, ed. Anke Ldeling and Merja Kyt, he prefrontal cortex (PFC) constitutes approximately one-
32882. Berlin and New York: de Gruyter. third of the human cortex and is the last part of the human
Motter, Adilson E., Alessandro P. S. de Moura, Ying-Cheng Lai, and brain to become fully myelineated in ontogeny, with maturation
Partha Dasgupta. 2002. Topology of the conceptual network of lan- occurring in late childhood/early adolescence (Huttenlocher
guage. Physical Review E 65: 065102. and Dabholkar 1997). he PFC receives projections from the
Newman, Mark. 2003. he structure and function of complex networks. mediodorsal nucleus and encompasses primary motor cortex,
SIAM Review 45.2: 167256. A helpful introduction to complex network as well as premotor, supplementary motor, and the dorsal and
theory.
orbital sectors of the prefrontal (proper) lobes. All of these PFC
Serrano, Mari ngeles, Marian Bogu, Romualdo Pastor-Satorras,
areas are addressed by mesocortical dopaminergic projections
Alessandro Vespignani. 2007. Correlations in complex networks.
In Large Scale Structure and Dynamics of Complex Networks: From
and play a role in language functions.
Information Technology to Finance and Natural Science, ed. Guido Dopamine (DA) is manufactured in the pigmented neurons of
Caldarelli and Alessandro Vespignani, 3565. Singapore: World the substantia nigra (SN) and the ventral tegmental area (VTA).
Scientiic. here are three major ascending dopaminergic systems: the stri-
Sigman, Mariano, and Guillermo A. Cecchi. 2002. Global organization of ato-nigral tract, which ascends from the SN to the corpus stria-
the Wordnet lexicon. Proceedings of the National Academy of Sciences tum; the mesolimbic system, which ascends from the SN and
USA 99: 17427. medial VTA to limbic sites, including the cingulate gyrus; and the
Steyvers, Mark, and Joshua B. Tenenbaum. 2001. he large-scale struc- mesocortical system, which ascends from the anteromedial teg-
ture of semantic networks: Statistical analyses and a model of seman-
mentum and VTA to neocortical sites, including supplementary
tic growth. Available online at: http://arxiv.org/ftp/cond-mat/
motor area and prefrontal cortex (Nieoullon 2002; Girault and
papers/0110/0110012.pdf. his is the irst version of their 2005 journal
Greengard 2004). Important language regions are linked directly
article.
. 2005. he large-scale structure of semantic networks: Statistical to these dopaminergic frontal lobe structures. Brocas area, for
analyses and a model for semantic growth. Cognitive Science example, is in the frontal lobes, as is the SMA. Posterior
29.1: 4178. language sites, such as Wernickes area, the angular gyrus, and
the inferior and superior parietal lobules, are densely intercon-
nected via the superior and inferior longitudinal fasiculi, with
NEUROCHEMISTRY AND LANGUAGE
meso-prefrontal dopaminergic systems.
Why study a potential neurochemistry of language? here are Language-related semantic and working memory net-
two reasons, one practical, the other theoretical. First, pharma- works in humans can be modulated by dopaminergic stimulation
cologic treatments of various speech and language disorders (Williams and Goldman-Rakic 1995; Kischka et al. 1996; Luciana
depend on, and will be enhanced by, an understanding of how and Collins 1997; Jay 2003; Angwin et al. 2004). Dopaminergic
selective neurochemical networks facilitate, inhibit, or mediate activation may even support key components of the sentence
language functions. Second, understanding the adaptive func- comprehension system in patients with Parkinsons disease
tion of a given trait requires detailed knowledge of the design (PD) (Grossman et al. 2001). Dopaminergic agents may be efect-
speciications that mediate or implement the trait in question. ive treatment for nonluent aphasia. M. Albert and colleagues
557
Neurochemistry and Language
(1988), using an on/of design, reported improved luency (i.e., dopamine transporter reuptake, as in the striatum) is largely
and naming scores in a patient with nonluent aphasia treated nonexistent. In humans, the COMT gene contains a highly func-
with bromocriptine (a drug that stimulates selected dopamine tional and common variation in its coding sequence that appears
receptors in the brain). Fluency and naming scores returned to to be a unique human mutation because it has not been found
baseline after the drug was discontinued. S. R. Gupta and A. G. in great apes. his uniquely human change in dopaminergic
Mlcoch (1992) replicated the efect of improved luency scores functional capacity in the prefrontal cortex suggests that it may
after bromocriptine in two aphasic patients, but L. Sabe and have been a factor in the evolution of the human prefrontal cor-
colleagues (1995) and D. L. MacLennan and colleagues (1991) tex and thereby of human speech and language functions more
could not document any improvement in speech and language generally.
scores in nonluent aphasics who were treated with bromocrip- An inherited deicit in spoken grammatical language among
tine late in the recovery process. Y. Tanaka and D. L. Bachman several members of a family (family KE) in England has been
(2000) conducted a double-blind, crossover study with bromo- associated with a mutation in the forkhead box P2 (FOXP2) gene
criptine. hey administered the drug (57.5 mg/day for four on chromosome 7 (see genes and language). Persons with
weeks) to 10 patients with a Broca-type aphasia. Statistically sig- the FOXP2 mutation evidence underactivity in dopaminergic
niicant improvement (pre- to posttreatment) on naming and neural networks linking subcortical striatal networks with pre-
luency scores was documented in the mild aphasics, but not frontal cortical sites, including Brocas area during word-genera-
in the severely impaired aphasics. M. Bragoni and colleagues tion tasks. FOXP2 is subject to the efects of genomic imprinting,
(2000) used a double-blind, placebo-controlled design focused with relatively high expression from the paternal chromosome
only on chronic nonluent aphasics at a dosage of 30 mg/day, (Feuk et al. 2006). Such a pattern of gene expression evolves
with participants maintained at that dosage for three months. in the context of evolutionary conlict due to paternity uncer-
While signiicant gains in verbal luency were evidenced with tainty in polygynous mating systems, as is the case with most
bromocriptine, these indings are based on the performance of mammals including humans. Genetic conlict occurs between
only ive participants. Consistent, however, with the claim of asymmetrically related kin (i.e., between mothers and ofspring,
positive dopaminergic efects on luency is the fact that the dopa- and between siblings in the context of paternity uncertainty),
minergic drug levodopa (LD) has also demonstrated beneicial with genes that are paternally expressed in ofspring promoting
efects on speech luency in midstage patients with Parkinsons behaviors in ofspring that are designed to monopolize resources
disease (McNamara and Durso 2000). S. Knecht and colleagues from the mother and exclude resources going to siblings. he
(2004) showed that healthy volunteers given 100 mg of LD per FOXP2-related defect implies that some aspects of spoken lan-
day exhibited more rapid and more accurate learning of verbal- guage may have evolved under pressures of genetic conlict.
visual associations than a group of controls given a placebo. his conlict view of the evolution of language is consistent
Learning efects in this carefully controlled study could not be with recent indings linking handedness and cognitive/language
attributed to changes in arousal, autonomic function, motor deicits of schizophrenics (another disorder involving dopamin-
response times, afect, or response biases. ergic dysfunction) to the parent of origin efects (Francks et al.
Acetylcholine, one of the neurotransmitters that interacts 2003), as well as other evidence identifying potential imprinting
with dopaminergic systems at the level of the cortex, has also efects on genes that regulate dopaminergic systems of the lan-
been implicated in language functions. Tanaka, M. Miyazaki, guage-related areas of the prefrontal cortex. In short, investiga-
and Albert (1997) documented naming and comprehension tion of dopaminergic inluences on language functions leads us
improvement in luent aphasics using the cholinergic agent bife- into two seemingly disparate realms of inquiry: 1) the develop-
melane. hey built on the work of L. Moscowitch, P. McNamara, ment of rational pharmacotherapeutic strategies for treatment
and Albert (1991) who reported that an anticholinesterase agent of language disorders (e.g., dopaminergic drugs for luency
(which boosts cholinergic activity) improved language perform- disorders), and 2) reconstruction of the evolutionary conlicts
ance in eight luent semantic aphasics. Albert (2000) provides that led to the emergence of speech and language functions
an in-depth discussion of both dopaminergic inluences on themselves.
nonluent aphasia and cholinergic inluences on luent aphasia.
Patrick McNamara
His conclusion is that there appears to be a strong and consist-
ent efect of dopaminergic agents on verbal luency and mild
efects of cholinergic agents on naming and semantic memory.
Dopaminergic efects, however, are better documented than Albert, M. 2000. Towards a neurochemistry of naming and anomia. In
cholinergic efects. Language and the Brain, ed. Y. Grodzinsky, L. Shapiro, and D. Swinney,
One of the major regulatory genes that control dopamines 15765. San Diego, CA: Academic Press.
Albert, M., D. L. Bachman, A. Morgan, and N. Helm-Estabrooks. 1988.
metabolic pathways in the prefrontal cortex is the gene that
Pharmacotherapy for aphasia. Neurology 38: 8779.
codes for the enzyme catechol-O-methyltransferase (COMT).
Angwin, A. J., H. J. Chenery, D. A. Copland, W. L. Arnott, B. E. Murdoch,
Signiicant associations between COMT variations with varia- and P. A. Silburn. 2004. Dopamine and semantic activation: An
tions in prefrontal cognitive function have been identiied (Egan investigation of masked direct and indirect priming. Journal of the
et al. 2001; Joober et al. 2002). Studies in rats, knockout mice, International Neuropsychological Society 10.1: 1525.
and monkeys suggest that COMT is of particular importance Bannon, M. J., E. B. Bunney, and R. H. Roth. 1981. Mesocortical dopa-
with respect to intrasynaptic dopamine regulation in the pre- mine neurons: Rapid transmitter turnover compared to other brain
frontal cortex, where an alternative route of dopamine removal catecholamine systems. Brain Research 218.1: 37682.
558
Neurochemistry and Language Neuroimaging
Bragoni, M., M. Altieri, V. Di Piero, A. Padovani, C. Mostardini, and Nieoullon, A. 2002. Dopamine and the regulation of cognition and atten-
G. L. Lenzi. 2000. Bromocriptine and speech therapy in non-luent tion. Progress in Neurobiology 67: 5283.
chronic aphasia after stroke. Neuroscience 21.1: 1922. Sabe, L., F. Salvarezza, A. Garcia Cuerva, R. Leiguarda, and S. Starkstein.
Egan, M. F., T. E. Goldberg, B. S. Kolachana, J. H. Callicott, 1995. A randomized, double-blind, placebo-controlled study of bro-
C. M. Mazzanti, R. E. Straub, D. Goldman,and D. R. Weinberger. 2001. mocriptine in nonluent aphasia. Neurology 45: 22724.
Efect of COMT Val108/158 Met genotype on frontal lobe function Tanaka, Y., and D. L. Bachman. 2000. Pharmacotherapy of Aphasia. In
and risk for schizophrenia. Proceedings of the National Academy of Neurobehavior of Language and Cognition: Studies of Normal Aging
Sciences USA 98: 691722. and Brain Damage, ed. and M. Albert, L. Connor, and L. Obler, 15978.
Feuk, L., A. Kalervo, M. Lipsanen-Nyman, J. Skaug, K. Nakabayashi, Boston: Kluwer Academic.
B. Finucane, D. Hartung, M. Innes, B. Kerem, M. J. Nowaczyk, Tanaka, Y., M. Miyazaki, and M. Albert. 1997. Efects of cholinergic activ-
J. Rivlin, W. Roberts, L. Senman, A. Summers, P. Szatmari, V. Wong, ity on naming in aphasia. Lancet 350: 11617.
J. B. Vincent, S. Zeesman, L. R. Osborne, J. O. Cardy, J. Kere, S. W. hierry, A. M., J. P. Tassin, G. Blanc, L. Stinus, B. Scatton, J. Glowinski.
Scherer, and K. Hannula-Jouppi. 2006. Absence of a paternally 1977. Discovery of the mesocortical dopaminergic system: Some
inherited FOXP2 gene in developmental verbal dyspraxia. American pharmacological and functional characteristics. Advances in
Journal of Human Genetics 79: 96572. Biochemistry and Psychopharmacology 16: 512.
Francks, C., L. E. DeLisi, S. H. Shaw, S. E. Fisher, A. J. Richardson, Williams, G. V., and P. S. Goldman-Rakic. 1995. Modulation of mem-
J. F. Stein, and A. P. Monaco. 2003. Parent-of-origin efects on hand- ory ields by dopamine D1 receptors in prefrontal cortex. Nature
edness and schizophrenia susceptibility on chromosome 2p12-q11. 376: 5725.
Human Molecular Genetics 12: 322530.
Gardner, E. L., and C. R. Ashby, Jr. 2000. Hetereogeneity of the meso-
telencephalic dopamine ibers: Physiology and pharmacology. NEUROIMAGING
Neuroscience and Biobehavioral Reviews 24: 11528.
Neuroimaging technologies provide a major source of new data
Girault, J. A., and P. Greengard. 2004. he neurobiology of dopamine
about how the language system is organized in the brain. In par-
signaling. Archives of Neurology 61: 6414.
ticular, activation imaging approaches, in which brain activity is
Greener, J., P. Enderby, and R. Whurr. 2002. Pharmacological treatment
for aphasia following stroke. Cochrane Database Systematic Reviews monitored while subjects perform some language task, allow us
4: CD000424. to visualize various aspects of language processing in the normal
Gupta, S. R., and A. G. Mlcoch. 1992. Bromocriptine treatment of non- brain and test hypotheses about component language functions
luent aphasia. Archives of Physical Medicine and Rehabilitation or systems. Among the imaging techniques most commonly in
73.4: 3736. use for understanding language in the brain are structural mag-
Grossman, M., G. Glosser, J. Kalmanson, J. M. Morris, M. B. Stern, netic resonance imaging (MRI), functional MRI (fMRI), positron
H. I. Hurtig. 2001. Dopamine supports sentence comprehen- emission tomography (PET), electroencephalography (EEG),
sion in Parkinsons Disease. Journal of the Neurological Sciences, and magnetoencephalography (MEG).
184.2: 12330.
Huttenlocher, P. R., and A. S. Dabholkar. 1997. Regional diferences in
Structural Magnetic Resonance Imaging
synaptogenesis in human cerebral cortex. Journal of Comparative
he lesion deicit model, where one deduces the function of
Neurology 387.2: 16778.
Jay, T. M. 2003. Dopamine: A potential substrate for synaptic plasticity a brain region by observing what it cannot do when damaged,
and memory mechanisms. Progress in Neurobiology 69: 37590. marks the basis of our understanding of language organization
Joober, R., J. Zarate, G. Rouleau, E. Skamene, and P. Boksa. 2002. in the brain, originating with Paul Brocas work. Whereas Broca
Provisional mapping of quantitative trait loci modulating the acous- had to wait until his patients death to determine where the
tic startle response and prepulse inhibition of acoustic startle. lesion was located, structural or conventional MRI scanning
Neuropsychopharmacology 27: 76581. allows lesion-behavior correlation in vivo.
Kischka, U., T. Kammer, S. Maier, M. Weisbrod, M. himm, and he MRI scanner is essentially composed of a large, high ield
M. Spitzer. 1996. Dopaminergic modulation of semantic network acti- magnet that delivers magnetic pulses and records small changes
vation. Neuropsychologia 34: 110713.
in the magnetized atoms in your brain or body. hese signals are
Knecht, S., C. Breitenstein, S. Bushuven, S. Wailke, S. Kamping, A. Floel,
picked up by an antenna and, through several transformations, are
P. Zwitserlood, and B. Ringelstein. 2004. Levodopa: Faster and better
translated into pictures of the brain. By altering the direction, fre-
word learning in normal humans. Annals of Neurology 56.1: 206.
Luciana, M., and P. Collins. 1997. Dopaminergic modulation of working quency, and readout times of these magnetic perturbations, difer-
memory for spatial but not object cues in normal humans. Journal of ent MRI pulse sequences produce variations in the signals generated
Cognitive Neuroscience 9: 33047. by each tissue type. he diference between these signals is referred
MacLennan, D. L., L. E. Nicholas, G. K. Morley, and R. H. Brookshire. to as contrast, and using these variations, the radiologist can deter-
1991. he efects of bromocriptine on speech and language function mine what is normal brain tissue, what looks like a clot of blood or a
in a man with transcortical motor aphasia. In Clinical Aphasiology, fatty tumor, what tissue has had a disruption in the normal difusion
ed. T. E. Prescott, 14555. Boston: College Hill. of water molecules, and so on. MRI has excellent spatial resolution,
McNamara, P., and R. Durso. 2000. Language functions in Parkinsons which refers to the precision with which one can see details. Typical
disease: Evidence for neurochemistry of language. In Neurobehavior
MRI scans resolve 1 mm; thus, it is easy to locate even small brain
of Language and Cognition: Studies of Normal Aging and Brain
lesions that might explain a particular abnormality.
Damage, ed. L. Obler and L. T. Conner, 20112. New York: Kluwer
Academic.
In the past few years, the sophistication of structural MRI
Moscowitch, L., P. McNamara, and M. L. Albert. 1991. Neurochemical techniques has increased markedly, ofering new approaches
correlates of aphasia. Neurology 41 (Supplement 1): 410. for identifying structure-function correlations in the language
559
Neuroimaging
Figure 1. Top left: Original drawing by Broca.

Bottom left: preserved whole brain of Brocas
patient. Right: axial MRI slice through Brocas area
showing damage to insula, striatum, and underlying
white matter (Dronkers et al. 2007).
system. One approach involves warping scans from diferent

of tractography, where the WM tracts in and out of a language
patients together, showing areas, for example, where lesions
area have been mapped. his approach can identify inputs and
overlap among patients with the same language impairment. An
outputs to language regions, indicating possible mechanisms for
illustrative study comes from Bates et al. (2003). Here, the authors
distal efects of local lesions through disruption on the connect-
used a technique called voxel based morphometry. For each voxel
ing pathways.
(a three-dimensional pixel) in the brain, a t-test compares the
extent to which patients with a lesion encompassing that voxel Activation Imaging and Neurovascular Coupling
difer signiicantly from subjects whose lesions do not encom- Several brain imaging technologies take advantage of neuro-
pass that area on a language task. E. Bates and colleagues (2003) vascular coupling in identifying brain regions associated with
correlated lesion location with verbal luency (Color Plate 5). language performance. Neurovascular coupling refers to the
he areas of the brain showing the most signiicant diferences fact that when neurons increase their iring rate (because that
between groups are depicted in red, indicate the brain area that brain area is working harder), blood low increases to that
is most likely responsible for the deicit observed. Contrary to region. Typically, the correlation between blood low and neu-
Brocas report, reduced verbal luency was associated not pri- ronal activity is extremely high, although the blood low increase
marily with brocas area lesions but, rather, with lesions of is delayed in onset by several seconds and falls of gradually in
the insula and underlying white matter. Indeed, a recent MRI comparison to neuronal activity (Buxton et al. 2004). In several
study of Brocas patient conirmed involvement of these struc- clinical conditions it may be decoupled, such as in acute stroke.
tures (see Figure 1). Both PET and fMRI take advantage of neurovascular coupling to
Another approach in MRI analysis compares aspects of brain identify brain activity.
structure, such as gray matter thickness or sulcal position, to per-
formance on language tasks using voxel-based correlations. For POSITRON EMISSION TOMOGRAPHY. PET scanning is an imag-
instance, L. Lu and colleagues (2007) examined the relationship ing tool in which a radioactively labeled compound is injected
between the thickness of the cortex in the left inferior frontal into the body and taken up in the brain. Compounds such as
region and ability on a phonological processing task in children. glucose or water are labeled with positrons that rapidly decay;
he development of gray matter changes in this area correlated during the decay process the positrons hit other nuclei and are
with improving scores on phonology tasks, indicating a dynamic annihilated, which causes the emission of two photons that
relationship between emerging brain growth and language shoot of in opposite directions simultaneously. he PET scan-
development (Color Plate 6). ner is composed of a ring of detectors that detect these simul-
A third structural MRI approach examines the integrity of the taneous photons, and software reconstructs their originating
white matter underlying the cortical ribbon. Difusion tensor positions, revealing where the compound traveled. he resulting
imaging is an MRI approach that measures the difusion of water PET image is a blurry picture showing the amount of radioactive
molecules in the brain. White matter (WM) ibers tend to be bun- substance reaching every pixel in the brain. Diferent radioactive
dled together, lined up in parallel sheaths. Because water is more compounds measure various brain processes, each with a char-
likely to difuse in parallel to these white matter tracts as opposed acteristic half-life. For instance, 18-luorodeoxyglucose (18FDG)
to crossing them on the perpendicular, imaging techniques that has a half-life of about 45 minutes and measures glucose metab-
track difusion will tend to emphasize the direction of these iber olism. More useful for language research is the compound H215O,
tracts. Image-processing techniques can identify the unifor- radioactive water. Because of its short half-life (about 2 minutes),
mity of these iber directions in each voxel in the brain, indicat- H215O scans can be repeated after a delay of 1012 minutes, up to
ing whether the WM is intact. Color Plate 7 shows an example between 6 and 10 scans. For this reason, H215O has been used for
560
Neuroimaging
language activation studies, where subjects might receive sev- processes of nearly any complexity, resulting in many important
eral injections while performing one or more language tasks and indings. For instance, in the reading system, years of controversy
during control tasks (Color Plate 8). about whether reading involves a single system versus parallel
One of the irst PET studies to visualize language areas in vivo systems have been largely resolved. he physical presence of two
was that of S. E. Petersen and colleagues (1988). In this study, anatomically distinct pathways diferentially engaged by difer-
normal volunteers performed a series of language tasks ordered ent reading demands and subject groups lends strong credence
hierarchically: viewing a crosshair on a screen; seeing printed to the dual-route hypothesis (Pugh et al. 1996). Another interest-
words on a screen or hearing words over headphones, reading or ing set of indings explores the role of the right hemisphere
repeating heard words, or generating an action verb correspond- in many aspects of language processing, including afective
ing to a visually or auditorally presented noun. By subtracting prosody, metaphor analysis, and contextual processing (see
lower-level tasks from higher-order language tasks, the authors Bookheimer 2002 for a review). Within the frontal lobe there
isolated areas of the brain involved in word generation, while appear to be separate regions for processing aspects of expres-
removing unwanted efects of sensory stimulation. his sub- sive language, including phonological processing, syntax, and
tractive logic forms the basis of activation imaging experiments. semantic integration. hus, fMRI appears to have resolved an
While there are theoretical diiculties with assumptions of hier- ongoing debate over whether part of Brocas area is specialized
archical organization, cognitive subtraction models remain the for syntax, as opposed to secondary to increased working mem-
mainstay of activation imaging research. ory demands of complex syntactic structures, with a recent study
An important disadvantage of PET is the need to expose sub- indicating syntax speciicity in this region (Santi and Grodzinsky,
jects to radioactivity; a second disadvantage is that subjects must 2007). In general, language research from fMRI indicates a level
perform the same task for several minutes continuously to obtain of organization that is far more complex, detailed, and speciic
a single brain image. Also, PET is a relatively noisy methodol- than envisioned on the basis of lesion-deicit studies. Color Plate
ogy, and the signal-to-noise ratio is low enough that scans must 9 shows an fMRI exam during a series of language tasks. Even
be averaged over a group of subjects. Finally, the spatial resolu- within a single subject, clear evidence for at least nine diferent
tion of PET is low, usually about 6 mm, so that individual brain brain regions contributing to language can be observed.
structures cannot be resolved and the areas of signiicant acti-
vation are not easily localized to a speciic brain structure. Most EEG and MEG. While fMRI ofers a tremendous advantage in
investigators solve this problem by performing an MRI scan for both spatial and temporal resolution over PET, the fMRI response
each subject, mathematically moving or registering the brain is extremely sluggish in comparison to neural activity measured
images so they are in the same space, then overlaying the PET directly. Two other technologies ofer vastly improved temporal
activation regions onto the corresponding MRI scans to localize resolution. Electroencephalography measures the combined
the regions of activity. electrical activity of a wide area of the brain. Because electrical
activity directly measures neural iring, EEG runs very close to
FUNCTIONAL MRI. In the early 1990s, two groups independently real-time activity of neurons. It also has several major disadvan-
discovered that blood low increases during neural activity tages. he spatial resolution is poor: Electrical signals represent
could be measured directly with MRI (Kwong et al. 1992; Ogawa an average over many centimeters of activity, and results are gen-
et al. 1992). his is due to the accident that oxygenated blood and erally conined to entire lobes. Further, the signals come mostly
deoxygenated blood have slightly diferent magnetic properties. from surface brain structures. Finally, data must be averaged over
During increased brain activity, an increase in blood low is not many trials to yield an averaged electrical response to a class of
matched by an increase in oxygen consumption; consequently, stimuli. Nonetheless, EEG is the method of choice for high tem-
more oxygenated blood spills over to the venous side of the cap- poral resolution work, and for young children or infants.
illary bed. Scans of the brain taken during this state of increased Years of EEG research reveal expected patterns of electri-
oxyhemoglobin concentration have slightly higher MRI signals cal responses to certain classes of stimuli. Averaged electrical
than those taken in the resting state. hus, fMRI measures this responses to a class of stimuli are referred to as event- related
blood-oxygen-level dependent, or BOLD, signal when compar- potentials or ERPs. An example is the N400 response, meaning
ing scans taken in diferent cognitive states. hanks to the discov- a negatively directed signal occurring 400 milliseconds after
ery of ultrafast MRI scanning, typically using an approach called the stimulus. he N400 is found when the subject experiences
echo-planar imaging or EPI, fMRI takes a complete picture of the an anomalous or unexpected event. For instance, A. Hahne,
brain as quickly as once per second. By taking complete brain K. Eckstein, and A. Friederici (2004) examined ERPs in response
volumes every few seconds over a period of several minutes or to semantic and syntactic violations embedded within sen-
more, fMRI can track those magnetic changes that correlate with tences. When there was a syntactic violation, the authors found a
blood low during experimental and control tasks that can be component in anterior brain regions, termed the ELAN (early left
varied with tremendous experimental complexity. Because fMRI anterior negativity). his was followed by a late positive response
has signiicantly greater spatial and temporal resolution than (P600). Semantic violations produced the N400 response. In a
PET, it is often possible to see signiicant language-related brain second experiment, instructions to ignore the syntactic viola-
activity within a single individual in a matter of minutes. tions and focus on the semantic task could not override the brain
he study of language organization in the brain has been response to syntactic violations. hese data indicate that syn-
revolutionized by fMRI. Because these studies are relatively easy tactic processing is mandatory and not under efortful control.
and inexpensive to conduct, it is possible to examine language While EEG cannot locate the brain structures that generate these
561
Neuroimaging Number
signals, the high temporal resolution of EEG makes it possible imaging of human brain activity during primary sensory stimulation.
to analyze on-line processing of language, addressing questions Proceedings of the National Academy of Science 89.12: 56759.
that difer fundamentally from those tapped by fMRI. Lee, A., V. Kannan, and A. E. Hillis. 2006. he contribution of neuroim-
Magnetoencephalography bears many similarities to EEG aging to the study of language and aphasia. Neuropsychology Reviews
16.4: 17183.
while ofering improved spatial resolution. MEG measures mag-
Lu, L., C. Leonard, P. hompson, E. Kan, J. Jolley, S. Welcome, A. Toga,
netic moments created by electrical activity in the brain that are
and E. Sowell. 2007. Normal developmental changes in inferior fron-
picked up by a large magneto-detector sensitive to very small tal gray matter are associated with improvement in phonological pro-
changes in magnetic ields. hese ields are closely related to neu- cessing: A longitudinal MRI analysis. Cerebral Cortex 17.5: 10929
ral activity and can be tracked over temporal intervals efectively in Ogawa, S., D. W. Tank, R. Menon, J. M. Ellermann, S. G. Kim, H. Merkle, and
real time. Localizing the source of MEG signals remains a signii- K. Ugurbil. 1992. Intrinsic signal changes accompanying sensory stimu-
cant diiculty with this technique, and sources generated close to lation: Functional brain mapping with magnetic resonance imaging.
the brain surface are easier to detect than those from deep struc- Proceedings of the National Academy of Science 89.13: 59515.
tures. Nonetheless, source localization appears to be far more Petersen, S. E., P. T. Fox, M. Mintun, and M. E. Raichle. 1988. Positron
precise than with EEG, while temporal resolution is equivalent. emission tomographic studies of the cortical anatomy of single-word
MEG is more technically challenging than EEG and fMRI and is processing. Nature 331.6157: 5859.
Pugh, K. R., B. A. Shaywitz, S. E. Shaywitz, R. T. Constable, P. Skudlarski,
also less widely available. Nonetheless, signiicant contributions
R. K. Fulbright, R. A. Bronen, D. P. Shankweiler, L. Katz, J. M. Fletcher,
to the ield of language continue to emerge from MEG studies.
J. C. Gore. 1996. Cerebral organization of component processes in
For example, MEG studies have indicated that letter-string rec- reading. Brain. 119.4: 122138.
ognition occurs 150 msec after presentation of a word, whereas Salmelin, R., P. Helenius, and E. Service. 2000. Neurophysiology of lu-
children with dyslexia show a weaker response (Tarkiainen et al. ent and impaired reading: A magnetoencephalographic approach.
1999). A corresponding delayed response in superior temporal Journal of Clinical Neurophysiology 17: 16374.
regions may relect an earlier-level dysfunction in letter-string Santi, A., and Y. Grodzinsky. 2007. Working memory and syntax interact
recognition areas (Salmelin, Helenius, and Service 2000). in Brocas area. Neuroimage 37.1: 817.
Both EEG and MEG are limited to event-related designs that Tarkiainen, A., P. Helenius, P. L. Hansen, P. L. Cornelissen, and
measure the temporal dynamics of evoked responses to a brief R. Salmelin. 1999. Dynamics of letter string perception in the human
occipitotemporal cortex. Brain 122:211932.
duration stimulus. In some cases, however, temporal resolution
Wise, R. J. 2003. Language systems in normal and aphasic human sub-
isnt necessary. For instance, studying mood or drug states or
jects: Functional imaging studies and inferences from animal studies.
observing a cognitive strategy that evolves over time may require
British Medical Bulletin 65: 95119.
longer time intervals to measure, favoring fMRI or PET.
Susan Bookheimer
NUMBER
WORKS CITED AND SUGGESTIONS FOR FURTHER READING Number is a grammatical feature that quantiies the denotation
Bates E., S. M. Wilson, A. P. Saygin, F. Dick, M. I. Sereno, R. T. Knight, of a linguistic element. It can refer to entities or events, and in
and N. F. Dronkers. 2003. Voxel-based symptom mapping. Nature language we ind both nominal number (very common, dis-
Neuroscience 6: 44850. cussed in the following) and verbal number (less common, real-
Bookheimer, S. Y. 2002. Functional MRI of language: New approaches ized on the verb to indicate the number of events or the number
to understanding the cortical organization of semantic processing. of participants; also called pluractionality).
Annual Review of Neuroscience 25: 15188. Languages vary with regard to the part of their nominal inven-
Bookheimer, S. Y., T. A. Zeiro, T. Blaxton, W. D. Gaillard, B. Malow, and
tory that is involved in the number system. In diferent languages,
W. H. heodor e. 1998. Regional cerebral blood low during auditory
the split into nominals that do and do not express number may
responsive naming: Evidence for cross- modality neural activation.
Neuroreport, 9.10: 240913.
occur at diferent points of the animacy hierarchy: speaker (irst
Bookheimer, S. Y., T. A. Zeiro, T. Blaxton, W. D. Gaillard, and W. H. person pronouns) > addressee (second person pronouns) > third
heodore. 1995. Regional cerebral blood low during object naming person pronouns > kin > rational > human > animate > inanimate.
and word reading. Human Brain Mapping 3.2: 93106. Furthermore, not all nouns are number diferentiable. Two types of
Buxton, R. B., K. Uludag, D. J. Dubowitz, and T. T. Liu. 2004. Modeling noun are traditionally distinguished: count nouns and mass nouns,
the hemodynamic response to brain activation. Neuroimage. 23 the latter regarded as lacking the number distinction. At the level of
(Supplement 1): S2203. semantics, the countmass distinction can be captured with two
Cohen, M. S., and S. Y. Bookheimer. 1994. Functional magnetic reso- semantic features boundedness and internal structure (Jackendof
nance imaging. Trends in Neurosciences 17.7: 26877. 1991), which corresponds to the distinction between temporally
Dronkers, N. F., O. Plaisant, M. T. Iba-Zizen, and E. A. Cabanis. 2007.
bounded and unbounded events in verbal semantics. But count-
Paul Brocas historic cases: High resolution MR imaging of the brains
ability is really a characteristic of nominal phrases (Allan 1980),
of Leborgne and Lelong. Brain 130.5: 143241.
Hahne, A., K. Eckstein, and A. Friederici. 2004. Brain signatures of syn-
since many nouns can appear in both count and mass syntactic
tactic and semantic processes during childrens language develop- contexts, for example, Would you like a cake/some cake? We need a
ment. J Cogn Neurosci 16.7: 130218. bigger table/ here is not enough table for everyone to sit at.
Kwong, K. K., J. W. Belliveau, D. A. Chesler, I. E. Goldberg, R. M. Weisskof, When nominal number is found expressed on the noun or the
B. P. Poncelet, D. N. Kennedy, B. E. Hoppel, M. S. Cohen, R. Turner, H. noun phrase as such, it is considered inherent. When found on other
Cheng, T. Brady, and B. Rosen. 1992. Dynamic magnetic resonance elements of the noun phrase or on the verb, it is contextual. he
562
Occipital Lobe
expressions of nominal number can involve special number words was only indirect in the sense that its role was limited to provid-
(of diferent syntactic status); syntactic means (i.e., agreement, ing visual input processed later on by language centers in other
found most commonly on demonstratives and verbs but also on brain regions that contribute to certain aspects of language (such
articles, adjectives, pronouns, nouns in possessive constructions, as reading).
adverbs, adpositions, and complementizers); a variety of mor- New data are now challenging this view of the occipital lobe
phological means (inlections, stem changes, zero expressions, as solely processing visual information. First, several key studies
clitics); and lexical means (e.g., suppletion). Number is often on the blind have shown the involvement of the occipital lobe
marked in more than one way within one language. in processing other sensory modalities. In particular, there is
All nominal number systems are built on the primary opposi- a clear link between occipital lobe processing in the blind and
tion between singular (expressing the quantity one) and plural language and verbal memory functions, where this pattern of
(more than one). Other attested number values are dual (two), activation is attributed to massive reorganization of the occipi-
trial (three), and paucal (a few). here may be further divisions tal lobe in cases of blindness. Further, the involvement of the
into paucal and greater paucal, plural and greater plural (the last occipital lobe in nonvisual processing has clearly been demon-
value may imply an excessive number, or all possible instances of strated in the sighted (i.e., under normal development of the
the referent). No genuine quadrals (four) have been found. he occipital lobe), and much of todays research is exploring the
largest number systems involve ive values. In many languages, extent to which the occipital lobe is involved in language pro-
the absence of plural marking does not necessarily imply the sin- cessing under normal development. hese topics are covered in
gular, but the form may be outside the number opposition and the following sections.
express general number, that is, the meaning of the noun without
reference to number.
The Occipital Lobe and Vision
Associatives, distributives, and collectives all sometimes
We live in a culture that relies heavily on vision and, accord-
listed as additional values of number are better analyzed as
ingly, vision research has motivated and dominated neurosci-
independent features. Associativity expresses the meaning X
ence research. he discovery and analysis of cortical visual areas
and the group associated with X; distributives indicate that
using electrophysiological and anatomical techniques was one
entities (whether count or mass ones), events, qualities, or loca-
of the major milestones in visual neuroscience (e.g., Hubel and
tions are to be construed as distinct in space, sort, or time; and
Wiesel 1963, 1965; Zeki 1978). On the basis of the vast amount
collectives indicate that the members of a group are to be con-
of anatomical studies in the primate, we now have a picture of
strued together as a unit. Many languages have markers for these
a highly diverse and hierarchically structured system (Felleman
categories in addition to various number markers.
and Van Essen 1991). his hierarchical organization originates
Anna Kibort in the geniculostriate pathway from visual area 1 (V1, primary
visual cortex) and beyond to an array of visual areas. Along this
WORKS CITED AND SUGGESTIONS FOR FURTHER READING hierarchical organization there is an increase both in the recep-
Allan, Keith. 1980. Nouns and countability. Language 56: 54167.
tive ield size and in the complexity of the optimal stimulus neu-
Corbett, Greville G. 2000. Number. Cambridge: Cambridge University rons in each area. Converging evidence suggests that the visual
Press. cortex is structured according to several principles of organi-
Jackendof, Ray. 1991. Parts and boundaries. Cognition 41: 945. zation and functional neuroanatomical schemes. Following
are descriptions of some of the most important organizing
principles:
O 1. Topographical Organization. Topographic mapping can

chart an orderly and gradual change in some functional prop-
erty of cortical neurons laid along the cortical surface. he most
fundamental transformation in vision is retinotopy. his involves
OCCIPITAL LOBE the transformation from a Euclidean coordinate system in the
Alcmaeon of Croton, probably the irst person to suggest that the retina to polar coordinates in the visual cortex. In this transfor-
mind is located in the brain and not the heart, also suggested that mation, each of the early visual areas maps the visual ield along
the optic nerves are light-bearing paths to the brain. His revo- two orthogonal axes: the polar angle (points that lie on a spe-
lutionary ideas, formulated about 2,500 years ago, were ignored ciic radius whose origin is at the fovea have an identical polar
by Egyptian and Greek scholars alike (most notably by Aristotle). angle) and eccentricity (the eccentricity distance from the fovea,
We now know that visual information is delivered to the occipi- the center of the visual ield). Areas in the left central vision area
tal lobe mainly via the thalamus. he occipital lobe is the most are projected to the back of the right occipital lobe, whereas the
posterior of the four lobes of the brain (named after the four skull more peripheral areas are projected more anteriorly. he retinal
bones beneath which they lie). points on such radii are mapped onto parallel bands across cor-
Even until very recently, it was believed that the occipital lobe tical areas. he sequential layout of these bands reverses when
was involved only in the processing of visual information per crossing from one visual area to another, providing a way for an
se. Language was considered to take place in dedicated areas, accurate delineation of the borders of these retinotopic areas
mainly in the frontal and temporal cortex. hus the relevance of (Sereno et al. 1995). here is evidence that higher-order object-
the occipital lobe to language processing and language research related areas are also topographically organized, but the basis for
563
Occipital Lobe
this organization is still subject to debate (for a review, see Grill- information processing in the primary and secondary sensory
Spector and Malach 2004 and the following). areas is strictly modality-speciic. According to this view, the
2. Visual Pathways. Lesion studies in primates and human occipital cortex processes vision, and integration of the difer-
fMRI studies both suggest that there are two processing streams ent senses occurs only in higher-level areas. Recent evidence
between early, retinotopic visual areas in the occipital cortex suggests that the occipital cortex does in fact process nonvi-
and higher-order processing centers in the occipito-temporal sual functions. We focus later on object recognition and object
and occipito-parietal lobes. hese two streams are referred to naming, although similar results were obtained in the dorsal
as the dorsal and ventral streams, respectively (Ungerleider stream (e.g., for visuo-tactile orientation, see Zangaladze et al.
and Haxby 1994). Because the dorsal stream is also involved 1999).
in visuo-motor transformations, a diferentiation is made Recognition of an object can involve a wide range of cues,
between vision for action (dorsal stream) and vision for perfor example, a characteristic color, a unique texture, or a typical
ception (ventral stream) (Goodale and Milner 1992). he ven- sound. However, shape is a particularly fundamental feature for
tral stream contains structures devolved to the ine analysis of recognizing and naming objects. Surprisingly, recent neuroim-
a visual scene such as form and color. hus, it is also known aging studies have found that visual and tactile object-related
as the what pathway. It consists of areas V1V4 in the occipital information (both contribute to shape information) converges
lobe and several regions that belong to the lateral and ventral in a lateral occipito-temporal ventral visual stream area (LOtv;
temporal lobe. Amedi et al. 2001). A later study found that shape and not sen-
3. Functional Specialization. his principle of division of sory modality is indeed the crucial factor in activating these
labor, which leads to a specialization of function in the vari- regions. he study used visual-to-auditory sensory substitution
ous cortical areas, was originally suggested for the visual sys- devices (Bach-y-Rita and Kercel 2003) in which visual images
tem (Zeki 1978). Electrophysiological studies in nonhuman are captured by a camera and then transformed by a predeter-
primates have identiied organizing principles in addition mined algorithm into soundscapes that preserve shape infor-
to retinotopy, such as selectivity for simple features like spa- mation. he study found that recognizing objects by their typical
tial orientation in V1 and selectivity for categories of complex sounds or learning to associate speciic soundscapes with spe-
stimuli like faces for spatial layouts in inferior temporal (IT) ciic objects do not activate this region. Critically, soundscapes
cortex. In the last decade or so, the use of noninvasive func- synthesized to preserve shape information did activate LOtv
tional imaging, particularly fMRI, has dramatically increased robustly. his suggests that LOtv is driven by the presence of
our knowledge of the functional organization of the human shape information, rather than by the sensory modality that
visual cortex and its relation to vision, due to its ability to pro- provides this information (Amedi et al. 2007). It is interesting to
vide a large-scale neuro-anatomical perspective (e.g. Martin note that a similar phenomenon of a modal representation in
and Chao 2001). An active debate is ongoing about the actual occipito-temporal cortex was found for word recognition. he
organization of the ventral stream (Grill-Spector and Malach study showed that the left basal occipito-temporal area shows
2004). One area in the lateral occipital cortex, the lateral occipi- speciicity to word processing, regardless of the sensory modal-
tal complex (LOC; Malach et al. 1995) responds strongly to ity used. his area showed selective activation to words versus
pictures of intact objects by contrast to scrambled objects or non-word letter strings when subjects read using vision or when
nonobject textures. In the ventral occipito-temporal cortex, blind individuals read Braille using touch (Buchel, Price, and
specialized areas for faces (fusiform face area, FFA; Kanwisher, Friston 1998).
McDermott, and Chun 1997), scenes (Epstein and Kanwisher Another example of cross-modal interactions is the case of
1998), and human body parts (Downing et al. 2001) have been integration of heard and seen speech (letter-sound associa-
described, as well as for visual word forms (visual word form tion), which is another crucial function for normal development
area, VWFA; McCandliss, Cohen, and Dehaene 2003) which of language abilities. his function, however, was shown to
has direct implications for the discussion here. Developing a be mediated primarily by the temporal lobe (especially in the
theoretical framework that captures these specialized regions superior temporal sulcus and gyrus, STS and STG, respectively).
continues to be problematic, although the notion of widely For instance, a recent fMRI study (van Atteveldt et al. 2004)
distributed and overlapping cortical object representations showed that bilateral STS/STG responded more strongly to
remains a likely principle of organization (Haxby et al. 2001). bimodal matching of letter-sound pairs than to their respective
he efects of perceptual expertise for certain object categories unimodal components. Note that correspondences between
(Gauthier et al. 2000) and, more recently, diferent category- speech sounds and mouth movements are learned implicitly
related resolution needs (Grill-Spector and Malach 2004) have and early in development by exposure to heard speech together
also been put forward as candidate organizational principles of with the sight of the speaker. In contrast, the visual representa-
the human ventral stream. tion of spoken language by written language is a cultural arti-
fact. herefore, associations between letters and speech sounds
The Occipital Lobe, Multisensory Integration, are not learned automatically but require explicit instruction.
and Nonvisual Processing It is interesting that the learning of new letter-sound mappings
he perception of objects and the perception of space are cogni- involves the occipital cortex, whereas the auditory association
tive functions of prime importance. In everyday life, these func- cortex is active during the processing of previously acquired
tions beneit from the coordinated interplay of vision, audition, matching letter-sound combinations (Hashimoto and Sakai
and touch. A central theme in sensory neurophysiology is that 2004).
564
Occipital Lobe
Language and Verbal Processing in the Occipital Lobe tasks and during speech processing (See Burton, Diamond, and
READING AND DYSLEXIA; NAMING AND ALEXIA. Reading words McDermott 2003; Amedi et al. 2003; Pascual-Leone et al. 2005;
and naming visual objects involves the association of visual stim- Roder et al. 2002).
uli with phonological and semantic knowledge. Damage to For instance, robust plasticity in the left occipital areas of
the left occipital lobe can result in pure alexia: the inability to read the blind is evident during verbalmemory tasks requiring the
without losing the ability to write or the loss of any other major retrieval of abstract words from long-term memory in early
language-related function (Damasio and Damasio 1983). More blind individuals (Amedi et al. 2003). In this case, the observed
recent neuroimaging studies support this view by showing cor- occipital activation occurred without introducing any tactile or
relations between the left occipito-temporal cortex with linguis- auditory sensory input. Notably, blind subjects showed superior
tic aspects of reading and object naming based on visual input verbal memory capabilities, compared not only to age-matched,
in normal subjects. No less informative is the study of abnormal sighted controls but also with reported population averages.
patterns of activation in this part of the brain in subjects with Furthermore, in the blind group only, a strong positive correla-
developmental dyslexia (Schlaggar and McCandliss 2007). tion was found between the magnitude of V1 activation and the
verbal memory capabilities of individual subjects.
VISUAL WORD FORM IN THE SIGHTED AND TACTILE BRAILLE IN THE More directly related to language processing, several stud-
BLIND. An interesting example of such language-related pro- ies have used a verb-generation task, in which both blind and
cessing in the occipital lobe is the case of visual word form in the sighted subjects were instructed to generate a verb in response
sighted. Related to this is the case of occipital activation during to a noun cue. he sighted group showed activation in typical
Braille reading in the blind. In both cases, activation was found language-related areas (e.g., brocas area in the prefrontal
for both words and letter strings. he speciics and signiicance of cortex, which was activated as well in the blind), but no occipi-
these two examples are discussed in this section. tal activation. he blind group, however, showed additional
One of the most hotly debated topics in the context of modu- robust activation in the occipital cortex (Burton, Diamond, and
lar versus general architecture organization in the occipital lobe McDermott 2003; Amedi et al. 2003). Furthermore, as research
is the existence of the human visual word form area, which is in bilingual subjects has demonstrated, the convergence of
dedicated to the construction of visual words and thus is a key two languages in prefrontal cortex during semantic tasks (e.g.
player in our ability to read. Like other language-related areas Crinion et al. 2006), performed by bilingual blind subjects,
in the prefrontal and parietal cortex, this area also has a strong shows additional convergence of the two languages in the poste-
hemispheric dominance located in the left occipito-temporal rior occipital cortex, including in the primary visual cortex (Ofan
sulcus bordering the fusiform gyrus (McCandliss, Cohen, and and Zohary 2006). In addition, fMRI studies have shown that
Dehaene 2003). Other methodologies, such as recording ield efective connectivity between the prefrontal and occipital cor-
potentials in awake humans, showed selectivity to words and tex is increased in blind individuals during semantic processing.
letters strings in similar parts of the left occipital cortex (Nobre, Both early blind and sighted subjects activate a left-lateralized
Allison, and McCarthy 1994). Recently, a causal link between fronto-temporal core semantic retrieval system. However, blind
lesions in VWFA and acquired alexia without agraphia was dem- subjects activate additional extra-striate regions, which are cou-
onstrated in a patient who had a patch of his cortex removed in pled with frontal and temporal semantic regions (Noppeney,
surgery, causing activation in VWFA to disappear (Gaillard et al. Friston, and Price 2003; Liu et al. 2007).
2006; Martin 2006). Finally, it should be pointed out that neuroimaging, at best,
As in the case of the fusiform face area, some investigators establishes an association between brain activity and task per-
have suggested that there is no reason to label VWFA as a sepa- formance. A causal link between occipital areas and semantic
rate modular brain area purely because of the possible lack of processing was reported in a recent transcranial magnetic stimu-
speciicity of a ventral occipito-temporal lesion to cause pure lation study. TMS targeted over the left V1 or left occipito-tempo-
alexia, and since the reading disorder might not be limited to ral cortex led to a disruption and an increase in the error rate in a
words or could be a manifestation of a more general visual pro- similar verb-generation task in blind but not in sighted subjects
cessing deicit (Price and Devlin 2003). (Amedi et al. 2004). An analysis of error types revealed that the
What happens when early blind subjects read using a difer- most common error produced by the TMS was semantic (e.g.,
ent sensory modality? Recent neuroimaging studies in the blind apple would lead to the verb jump), whereas phonological errors
have demonstrated robust occipital cortex activation during and interference with motor execution or articulation were rare.
Braille reading (see blindness and language). In this case, hese results suggest that processing language and verbal
activation is not limited to occipito-temporal areas but stretches memory in the blind incorporates a widespread network that
all the way to the primary visual cortex (Sadato et al. 1996), and encompasses occipital visual brain areas, and that this type of
interference in processing in the occipital cortex using tran- reorganization of language and memory is relevant to behavior.
scranial magnetic stimulation (TMS) increases the error rate in It is clear, for example, that the functional and structural iden-
Braille reading (Cohen et al. 1997). tity of the occipital cortex may switch from processing visual
information to processing information related to another sen-
OCCIPITAL LOBE AND PLASTICITY IN LANGUAGE AND VERBAL sory modality or even diferent language functions. However,
MEMORY FUNCTIONS. Recent neuroimaging studies in the blind is this a unique consequence of early blindness? As shown here
have demonstrated robust occipital cortex activation during in some examples, the occipital cortex may inherently possess
a wide variety of linguistic and speciically semantic judgment the computational machinery needed for nonvisual information
565
Occipital Lobe Optimality Theory
processing. Under speciic conditions, this potential may be Hubel, D. H., and T. N. Wiesel. 1963. R Shape and arrangement of col-
materialized. If so, visual deprivation may simply allow for the umns in cats striate cortex. Journal of Neurophysiology 165: 55968.
emergence of the true potential of certain brain regions. his . 1965. Receptive ields and functional architecture in two non-
hypothesis also suggests that careful task choice and experimen- striate visual areas (18 and 19) of the cat. Journal of Neurophysiology
28: 22989.
tal design (e.g., blindfolding sighted subjects for several days)
Kanwisher, N., J. McDermott, and M. M. Chun. 1997. he fusiform face
may reveal additional nonvisual, linguistic roles in the occipital
area: A module in human extrastriate cortex specialized for face per-
cortex in the sighted (Pascual-Leone et al. 2005). ception. Journal Neuroscience 17: 430211.
Amir Amedi Liu, Y., et al. 2007. Whole brain functional connectivity in the early
blind. Brain 130: 208596.
Malach, R., et al. 1995. Object-related activity revealed by functional
magnetic resonance imaging in human occipital cortex. Proceedings
Amedi, A., et al. 2007. Shape conveyed by visual-to-auditory sensory sub- of the National Academy of Science 92: 81359.
stitution activates the lateral occipital complex. Nature Neuroscience Martin, A. 2006. Shades of Djerine forging a causal link between the
10: 6879. visual word form area and reading. Neuron 50: 1735.
Amedi, A., R. Malach, T. Hendler, S. Peled, and E. Zohary. 2001. Visuo- Martin, A., and L. L. Chao. 2001. Semantic memory and the
haptic object-related activation in the ventral visual pathway. Nature brain: Structure and processes. Current Opinion in Neurobiology
Neuroscience 4: 32430. 11: 194201.
Amedi, A, N. Raz, P. Pianka, R. Malach, and E. Zohary. 2003. Early McCandliss, B. D., L. Cohen, and S. Dehaene. 2003. he visual word form
Visual Cortex Activation Correlates with Superior Verbal Memory area: Expertise for reading in the fusiform gyrus. Trends in Cognitive
Performance in the Blind. Nature Neuroscience 6: 75866. Science 7: 2939.
Amedi, A, A. Floel, S. Knecht, E. Zohary, and L. G. Cohen. 2004. Nobre, A. C., T. Allison, and G. McCarthy. 1994. Word recognition in the
Transcranial Magnetic Stimulation of the Occipital Pole Interferes human inferior temporal lobe. Nature 372: 2603.
with Verbal Processing in Blind Subjects. Nature Neuroscience Noppeney, U., K. J. Friston, and C. J. Price. 2003. Efects of visual depriva-
7: 126670. tion on the organization of the semantic system. Brain 126: 16207.
Bach-y-Rita, P., and S. W. Kercel. 2003. Sensory substitution and the Ofan, R. H., and E. Zohary. 2006. Visual cortex activation in bilingual
human-machine interface. Trends in Cognitive Neuroscience 7: 5416. blind individuals during use of native and second language. Cerebral
Buchel, C., C. Price, and K. Friston. 1998. A multimodal language region Cortex 17: 124959.
in the ventral visual pathway. Nature 394: 2747. Pascual-Leone, A., A. Amedi, F. Fregni, and L. B. Merabet. 2005. he plas-
Burton, H., J. B. Diamond, and K. B. McDermott. 2003. Dissociating cor- tic human brain cortex. Annual Review of Neuroscience 28: 377401.
tical regions activated by semantic and phonological tasks to heard Price, C. J., and J. T. Devlin. 2003. he myth of the visual word form area.
words: A fMRI study in blind and sighted individuals. Journal of Neuroimage 19: 47381.
Neurophysiology 90: 196582. Rder, B., O. Stock, S. Bien, H. Neville, and F. Rosler. 2002. Speech pro-
Crinion, J., et al. 2006. Language control in the bilingual brain. Science cessing activates visual cortex in congenitally blind humans. European
312: 153740. Journal of Neuroscience 16: 9306.
Cohen, L. G., P. Celnik, A. Pascual-Leone, B. Corwell, L. Falz, et al. 1997. Sadato N., A. Pascual-Leone, J. Grafman, V. Ibanez, M. P. Deiber, et al.
Functional relevance of cross-modal plasticity in blind humans. 1996. Activation of the primary visual cortex by Braille reading in
Nature 389: 1803. blind subjects. Nature 380: 5268.
Damasio, A. R., and H. Damasio. 1983. he anatomic basis of pure Schlaggar, B. L., and B. D. McCandliss. 2007. Development of neural sys-
alexia. Neurology 33: 157383. tems for reading. Annual Reviews in Neuroscience 30: 475503.
Downing, P. E., Y. Jiang, M. Shuman, and N. Kanwisher. 2001. A cor- Serano, S.C., et al. 1995. Borders of multiple visual areas in humans
tical area selective for visual processing of the human body. Science revealed by functional magnetic resonance imaging. Science 268:
293: 24703. 88993.
Epstein, R., and N. Kanwisher. 1998. A cortical representation of the Ungerleider, L. G., and J. V. Haxby. 1994. What and where in the
local visual environment. Nature 392: 598601. human brain. Current Opinion in Neurobiology 4: 15765.
Felleman, D. J., and D. C. Van Essen. 1991. Distributed hierarchical pro- van Atteveldt, N., E. Formisano, R. Goebel, and L. Blomert. 2004.
cessing in the primate cerebral cortex. Cerebral Cortex 1: 147. Integration of letters and speech sounds in the human brain. Neuron
Gaillard, R., L. Naccache, P. Pinel, S. Clemenceau, E. Volle, et al. 2006. 43: 27182.
Direct intracranial, fMRI, and lesion evidence for the causal role of left Zangaladze, A., C. M. Epstein, S. T. Grafton, and K. Sathian. 1999.
inferotemporal cortex in reading. Neuron 50: 191204. Involvement of visual cortex in tactile discrimination of orientation.
Gauthier, I., P. Skudlarski, J. C. Gore, and A. W. Anderson. 2000. Expertise Nature 401: 58790.
for cars and birds recruits brain areas involved in face recognition. Zeki, S. M. 1978. Functional specialization in the visual cortex of the rhe-
Nature Neuroscience 3: 1917. sus monkey. Nature 274: 4238.
Goodale, M. A., and A. D. Milner. 1992. Separate visual pathways for per-
ception and action. Trends in Neurosciences 15: 205.
Grill-Spector, K., and R. Malach. 2004. he human visual cortex. Annual OPTIMALITY THEORY
Review of Neuroscience 27: 64977.
Optimality heory (OT; Prince and Smolensky [1993] 2004) is a
Hashimoto, R., and K. L. Sakai. 2004. Learning letters in adult-
hood: Direct visualization of cortical plasticity for forming a new link
formal theory of constraint interaction in grammar that seeks
between orthography and phonology. Neuron 42: 31122. to explain how and to what extent natural languages may vary.
Haxby, J. V., et al. 2001. Distributed and overlapping representa- In addition to this central question of generative grammar,
tions of faces and objects in ventral temporal cortex. Science research in OT addresses questions of the grammars use in per-
293: 242530. formance, its acquisition, and its neural realization.
566
Optimality Theory
OT in Theoretical Linguistics competitors only the most harmonic of which are grammatical.
Characterizing the set of possible natural languages minimally he competition (via H-Eval) evaluates each pair of candidates
requires specifying i) the mental representations that are char- against the universal constraint set Con, which is ordered into a
acteristic of language, ii) the constraints that distinguish possible language-particular domination hierarchy or ranking; C1 >> C2
from impossible linguistic systems, and iii) the formal mode of means that constraint C1 dominates constraint C2. In OT, con-
interaction among these constraints. Concerning mental rep- straint domination is strict: One violation of any constraint C is
resentations, OT imposes no restrictions beyond requiring that always worse than violating constraints ranked lower than C
speciications of phonological, syntactic, or semantic structure regardless of how many lower-ranked constraints are violated
be explicit in the sense of generative grammar. his makes OT and regardless of how severe the violations are of those lower-
compatible with alternative substantive theories of particu- ranked constraints. Given two candidate structural descriptions
lar grammar components; in syntax, for example, OT versions p and q for an input I, p has higher harmony (p q) if p is pre-
of government and binding (Grimshaw 1997; Legendre, ferred by the highest-ranked constraint that does not evaluate p
Smolensky, and Wilson 1998), lexical-functional gram- and q as equal. A candidate p is optimal if there is no other candi-
mar (Bresnan 2000), and the minimalist program (Mller date q with higher harmony. If p is optimal, then p cannot violate
1997) are lourishing. OTs main contribution concerns con- any constraint C unless every competing candidate that is pre-
straint interaction (iii) and, as a consequence, the proper formal ferred to p by C is dispreferred to p by a constraint higher ranked
characterization of the constraints themselves (ii). Hence, OT than C. In this sense, violations incurred by an optimal candidate
is best characterized as a meta-theory of grammatical structure structure are minimal. In sum, for every input I, harmony opti-
compatible with any explicit theory of linguistic representation. mization over the candidates in Gen(I) determines (at least) one
It is therefore applicable to all linguistic levels, and has been optimal, though not necessarily perfect, structural description of
applied to phonology (McCarthy and Prince 1993; Prince and I, which is ipso facto declared to be a grammatical output for that
Smolensky [1993] 2004), syntax (Legendre, Grimshaw and input.
Vikner 2001), semantics (Hendriks and de Hoop 2001), and he only cross-linguistically varying property of the grammar
pragmatics (Blutner and Zeevat 2004; Blutner, De Hoop, and is the relative ranking of the universal constraints in Con: the
Hendriks 2006). set of all possible grammars then is exactly the set of all rank-
According to OT, all grammatical constraints are universal ings of this ixed set of constraints. Typically, any given empirical
and violable (or soft) a claim that represents a major depar- pattern that is predicted to be part of the universal typology is
ture from previous approaches to the characterization of phono- generated by many diferent (but typologically equivalent) rank-
logical and syntactic knowledge via language-particular rewrite ings, and the number of predicted possible typological patterns
rules written in a universal notation, or as universal, inviolable is vastly smaller than the number of all possible rankings (e.g.,
constraints supplemented by additional principles subject to 13 patterns vs. 40,320 rankings in Smolensky and Legendre 2006,
language-particular parameterization. Chapter 15).
Employing violable, conlicting constraints often means that
ARCHITECTURE. An OT grammar maps an input speciication the universal constraints can be more simply stated: Complexity
onto an output structure. In phonology, the input is typically emerges primarily from the interaction of simple constraints;
an underlying form and the output the corresponding surface this reduces the need for hedges or disjunctive principles arising
form. In syntax the input is a proposition and the output is the when universal constraints are construed as inviolable (Speas
grammatical form that expresses that meaning (excepting inef- 1997). Constraint ranking naturally captures the common situ-
fability: see the section on Faithfulness). Gen (generator) is a ation in which some phenomenon widely observed in other
mechanism for producing candidate outputs for any input and languages is seen in one context only in language L (e.g., null
freely generates all of the types of structures that are present subjects in main clauses only as in Old French). Stipulating the
in any of the worlds languages (McCarthy and Prince 1993). limited distribution is unnecessary: In L, a lower-ranked con-
Pruning this enormous set down to the grammatical forms is the straint C (e.g., against null elements) often violated in gram-
job of H-Eval (harmony evaluator), a procedure for evaluating matical forms of L makes itself felt in those special contexts
the relative well-formedness (or harmony) of candidate struc- where dominating constraints do not contravene. In other lan-
tural descriptions. H-Eval depends on a set of universal well- guages, where C is more highly ranked, the phenomenon is seen
formedness constraints Con. widely.
COMPETITION AND CONFLICT. he fact that well-formedness con- FAITHFULNESS. he most familiar grammatical constraints are
straints are violable in OT means that two constraints often con- markedness constraints, which demand that the output struc-
lict: Satisfying one requires violating the other. Grammaticality ture meet some well-formedness condition (e.g., a subject must
is therefore not equated with satisfaction of all grammatical be an agent; Smolensky and Legendre 2006, Chapter 15). In an
constraints. Grammatical structures are simply structures that optimizing grammar, unless there is pressure for the output to
sufer less severe constraint violations than their ungrammatical contain all and only the elements contained in the input, the
counterparts. his means that the evaluation of grammaticality is optimizing system would always simply return the best of all
inherently comparative. At any level of description (phonological, structures (under the given ranking). Grammars must therefore
syntactic, etc.) the universal set of possible structural descrip- include (inputoutput) faithfulness constraints, which require
tions of an input I, Gen(I), forms a candidate set, a collection of minimal structural distance between the output and the input.
567
Optimality Theory
Faithfulness constraints, unique to OT, have been shown to probability of a candidate is proportional to the exponential of its
operate at all levels of linguistic description. In syntax, faithful- harmony. Gaja Jarosz 2006 deines a version of OT phonology,
ness constraints play a crucial role in accounting for language- maximum likelihood learning of lexicons and grammars (MLG),
particular inefability, that is, syntactic structures that are simply in which underlying forms as well as rankings have probability
impossible in certain languages, for example, multiple wh-ques- distributions, deining a lexicon + grammar. he relative prob-
tions in Irish (Legendre, Smolensky, and Wilson 1998; Legendre ability of a form has been used to model its gradient acceptability
2009). In most languages, faithfulness to question operators in a (Boersma and Hayes 2001; Hayes and Wilson 2008).
semantic input force the optimal syntactic output to contain mul-
tiple wh-phrases. In languages like Irish, however, such faithful- OT and Grammar Use
ness is ranked below the syntactic constraints violated by clauses OT is well suited for theories of performance. With no additional
containing multiple wh-phrases, and thus no optimal syntactic machinery, standard OT grammars assign a structural descrip-
structure contains multiple wh-phrases. Multiple wh-questions tion to all inputs, including loanword inputs violating the pho-
are therefore inexpressible (in a single clause). notactics of the borrowing language (Yip 1993; Davidson, Jusczyk,
and Smolensky 2006) or the initial fragment of a sentence being
VARIATION. OT is naturally extensible to unstable states of lan- processed word by word (Gibson and Broihier 1998; Stevenson
guage, such as free variation (Anttila 1998), dialectal variation, and Smolensky 2006). In the latter case, processing diiculty is
and diachronic change (Nagy and Reynolds 1997). Relaxing the predicted to occur if the optimal parse of the initial portion of
requirement of a complete ranking of the universal constraints, a a sentence changes substantially when a new word arrives. On
language L may be characterized by a single partial ranking P; L a formal level, the computational complexity of the problem
is generated by the set of grammars S full rankings consistent of computing optimal outputs is well studied (e.g., Tesar 1996;
with P. hese generate a set of outputs for each input. According J. Eisner 1997; Frank and Satta 1998; Idsardi 2006)
to Joan Bresnan, Ashwini Deo, and Devyani Sharma (2007),
intraspeaker variation in the British paradigm for be arises from OT and Grammar Acquisition
a partial ranking P among faithfulness constraints, requiring an OTs account of variation and change provides a natural extension
output to express input agreement features and markedness to the analysis of acquisition of phonology and syntax (Legendre
constraints penalizing all features in the output. A total ranking et al. 2002; Kager, Pater, and Zonneveld 2004; Legendre et al.
consistent with P having more highly ranked faithfulness yields a 2004). In early child syntax, for example, a childs competence
more highly inlected paradigm. may be characterized by a partial ranking P between faithful-
ness constraints and constraints penalizing syntactic structure
FURTHER DEVELOPMENTS. OT is an evolving theory; major devel- (Legendre et al. 2002). In some of the full rankings consistent
opments include the following. Faithfulness: Output-output with P, higher-ranked faithfulness constraints lead to optimal
faithfulness requires identity of a morphemes exponent across clauses with functional projections; in other rankings consistent
its paradigm (Burzio 1994; Benua 1995); output-output anti-faith- with P, lower ranking of these faithfulness constraints entail that
fulness constraints achieve morphophonological alternations by optimal outputs lack some or all functional projections. he vari-
demanding nonidentical surface forms for distinct underlying ation in child production of tense and agreement marking is thus
forms (Alderete 2001); sympathy theory demands faithfulness given a principled grammatical account.
to suboptimal candidates (McCarthy 1999b). Harmonic evalu- Formal and computational studies of the problem of learning
ation: Comparative markedness evaluation distinguishes con- OT grammars have been extensive, including constraint learn-
straint violations that are shared with the most faithful output ing (Hayes and Wilson 2008), ranking learning (Tesar 1998; Tesar
from those that are not (McCarthy 2003); targeted constraints and Smolensky 1998; Jason Eisner 2000; Boersma and Hayes
only compare candidates that difer only in a speciied way 2001; Hayes 2004; Prince and Tesar 2004), simultaneous learning
(Wilson 2001b). Architecture: Stratal OT assumes difering rank- of a probabilistic phonological lexicon and a ranking by maxi-
ings in a series of lexical levels (Kiparsky 2006); harmonic serial- mum likelihood estimation (Jarosz 2006), and the mathematical
ism derives the surface form from a series of small alterations, logic of OT learning (Prince 2002, 2006).
each optimal at its point in the derivation (Prince and Smolensky
[1993] 2004; McCarthy 1999a); candidate chain theory evalu- OTs Neural Realization
ates entire derivations (McCarthy 2007); bidirectional optimi- OT has historical roots in debates (Pinker and Prince
zation adds competition of interpretations/underlying forms to 1988; Smolensky 1988) concerning neural network (or
the competition of expressions/surface forms that is standard in connectionist) cognitive models (Rumelhart, McClelland,
OT (Smolensky 1996; Blutner 2000; Wilson 2001a). Probabilistic and the PDP Research Group 1986). In these models, networks
formulations: In stochastic OT (Boersma 1998), each optimiza- of abstract model neurons excite and inhibit one another as acti-
tion ranks the constraints according to relative numerical val- vation spreads from input neurons to output neurons. Formal
ues randomly selected for that optimization from a probability analysis of a class of networks reveals that they perform optimi-
distribution for each constraint, with a mean value determined zation: hey compute mental representations (activation pat-
by that constraints strength in the grammar. In the maximum terns) that maximize a numerical measure of self-consistency
entropy formulation of OT (Hayes and Wilson 2008), harmony is or well-formedness: harmony (Smolensky 1986). Mathematical
numerical: Each constraint has a numerical strength determin- analysis makes precise the following general picture: At a
ing the size of the penalties it assesses to violating candidates; the lower level of description, spreading activation among abstract
568
Optimality Theory
neurons maximizes numerical harmony; at a higher level of Kager, Joe Pater, and Wim Zonneveld. Cambridge: Cambridge
description, the same system computes the symbolic structural University Press.
description of the input that optimizes the harmony of an OT Hayes, Bruce, and Colin Wilson. 2008. A maximum entropy model of pho-
grammar. Construed in these terms, the study of grammar is notactics and phonotactic learning. Linguistic Inquiry 39: 379440.
Hendriks, Petra, and Helen de Hoop. 2001. Optimality theoretic seman-
fully integrated into the contemporary science of the mind/brain
tics. Linguistics and Philosophy 24: 132.
(Smolensky and Legendre 2006).
Idsardi, William J. 2006. A simple proof that optimality theory is compu-
Geraldine Legendre and Paul Smolensky tationally intractable. Linguistic Inquiry 37: 2715.
Jarosz, Gaja. 2006. Rich lexicons and restrictive grammars maximum
WORKS CITED AND SUGGESTIONS FOR FURTHER READING likelihood learning in optimality theory. Ph.D. thesis, Johns Hopkins
University.
Alderete, John. 2001. Dominance efects as transderivational anti- Kager, Ren. 1999. Optimality heory. Cambridge: Cambridge University
faithfulness. Phonology 18: 20153. Press.
Anttila, Arto. 1998. Deriving variation from grammar. In Variation, Kager, Ren, Joe Pater, and Wim Zonneveld, eds. 2004. Constraints in
Change, and Phonological heory, ed. F. Hinskens, R. van Hout, and Phonological Acquisition. Cambridge: Cambridge University Press.
W. L. Wetzel, 3568. Amsterdam: Benjamins. Kiparsky, Paul. 2006. Paradigms and Opacity. Stanford, CA: CSLI
Benua, Laura. 1995. Output-output faithfulness. In University of Publications.
Massachusetts Occasional Papers in Linguistics 18: Papers in Optimality Legendre, Graldine. 2009. Inefability in syntax. In Modeling
heory, ed. Jill Beckman, Laura Walsh Dickey, and Suzanne Urbanczyk, Ungrammaticality in Optimality heory, ed. Curt Rice, 23766.
77136. Amherst: University of Massachusetts at Amherst, GLSA. London: Equinox.
Blutner, Reinhard. 2000. Some aspects of optimality in natural language Legendre, Graldine, Jane Grimshaw, and Sten Vikner eds. 2001.
interpretation. Journal of Semantics 17: 189216. Optimality-heoretic Syntax. Cambridge, MA: MIT Press.
Blutner, Reinhard, Helen De Hoop, and Petra Hendriks. 2006. Optimal Legendre, Graldine, Paul Hagstrom, Joan Chen-Main, Liang Tao, and
Communication. Stanford, CA: CSLI Publications. Paul Smolensky. 2004. Deriving output probabilities in child manda-
Blutner, Reinhard, and Henk Zeevat, eds. 2004. Pragmatics in Optimality rin from a dual-optimization grammar. Lingua 114: 114785.
heory. London: Palgrave Macmillan. Legendre, Graldine, Paul Hagstrom, Anne Vainikka, and Marina
Boersma, Paul. 1998. Functional Phonology: Formalizing the Interactions Todorova. 2002. Partial constraint ordering in child French syntax.
between Articulatory and Perceptual Drives. he Hague: Holland Language Acquisition 10: 189227.
Academic Graphics. Legendre, Graldine, Paul Smolensky, and Colin Wilson. 1998. When
Boersma, Paul, and Bruce Hayes. 2001. Empirical tests of the gradual is less more? Faithfulness and minimal links in wh-chains. In Is the
learning algorithm. Linguistic Inquiry 32: 4586. Best Good Enough? Optimality and Competition in Syntax, ed. Pilar
Bresnan, Joan. 2000. Optimal syntax. In Optimality heory: Phonology, Barbosa, Danny Fox, Paul Hagstrom, Martha McGinnis, and David
Syntax and Acquisition, ed. Joost Dekkers, Frank van der Leeuw, and Pesetsky, 24989. MIT Working Papers in Linguistics. Cambridge,
Jeroen van de Weijer, 33485. Oxford: Oxford University Press. MA: MIT Press.
Bresnan, Joan, Ashwini Deo, and Devyani Sharma. 2007. Typology in McCarthy, John J. 1999a. Harmonic serialism and parallelism. NELS 30:
variation: A probabilistic approach to be and nt in the survey of 50124.
English dialects. English Language and Linguistics 11: 30146. . 1999b. Sympathy and phonological opacity. Phonology
Burzio, Luigi. 1994. Principles of English Stress. Cambridge: Cambridge 16: 33199.
University Press. . 2002. A hematic Guide to Optimality heory. Cambridge:
Davidson, Lisa, Peter W. Jusczyk, and Paul Smolensky. 2006. Optimality Cambridge University Press.
in language acquisition I: he initial and inal states of the phonolog- . 2003. Comparative markedness. heoretical Linguistics 29:
ical grammar. In he Harmonic Mind: From Neural Computation 151.
to Optimality-heoretic Grammar. Vol. 2. Ed. Paul Smolensky and . 2007. Hidden Generalizations: Phonological Opacity in Optimality
Graldine Legendre, 793839. Cambridge, MA: MIT Press. heory. London: Equinox.
Eisner, J. 1997. Eicient generation in primitive optimality theory. Annual McCarthy, John J., and Alan Prince. 1993. Prosodic morphology
Meeting of the Association for Computational Linguistics 35: 31320. I: Constraint interaction and satisfaction. Technical Report RuCCS-
Eisner, Jason. 2000. Easy and hard constraint ranking in optimality the- TR-3. Rutgers Center for Cognitive Science, Rutgers University, and
ory: Algorithms and complexity. In Finite-State Phonology: Proceedings University of Massachusetts at Amherst.
of the Fifth Workshop of the ACL Special Interest Group in Computational Mller, Gereon. 1997. Partial wh-movement and optimality theory.
Phonology (Sigphon), ed. Jason Eisner, Lauri Karttunen and A. hriault, Linguistic Review 14: 249306.
2233. Morristown, NJ: Association for Computational Linguistics. Nagy, Naomi, and William Reynolds. 1997. Optimality theory and vari-
Frank, Robert, and Giorgio Satta. 1998. Optimality theory and the gen- able word-inal deletion in Ftar. Language Variation and Change
erative complexity of constraint violability. Computational Linguistics 9: 3755.
24: 30715. Pinker, Steven, and Alan Prince. 1988. On language and connection-
Gibson, Edward, and Kevin Broihier. 1998. Optimality theory and human ism: Analysis of a parallel distributed processing model of language
sentence processing. In Is the Best Good Enough? Optimality and acquisition. Cognition 28: 73193.
Competition in Syntax, ed. Pilar Barbosa, Danny Fox, Paul Hagstrom, Prince, Alan. 2002. Entailed ranking arguments. Manuscript, Rutgers
Martha McGinnis, and David Pesetsky, 15791. MIT Working Papers University, New Brunswick, NJ.
in Linguistics. Cambridge, MA: MIT Press. . 2006. Implication and impossibility in grammatical sys-
Grimshaw, Jane. 1997. Projection, heads, and optimality. Linguistic tems: What it is and how to ind it. Manuscript, Rutgers University,
Inquiry 28: 373422. New Brunswick, NJ.
Hayes, Bruce. 2004. Phonological acquisition in optimality theory: he Prince, Alan, and Paul Smolensky. [1993] 2004. Optimality
early stages. In Constraints in Phonological Acquisition, ed. Ren heory: Constraint Interaction in Generative Grammar. Malden,
569
Oral Composition
MA: Blackwell. Original version was a technical report, Rutgers can come into being without writing a puzzle for those steeped
University and University of Colorado at Boulder.. in literate traditions that assume the centrality of the written
Prince, Alan, and Bruce B. Tesar. 2004. Learning phonotactic distribu- word; and second, how this relates to performance (for perfor-
tions. In Constraints in Phonological Acquisition, ed. Ren Kager, Joe mance is arguably how an oral creation exists).
Pater, and Wim Zonneveld. Cambridge: Cambridge University Press.
Earlier approaches focused largely on nonliterate settings,
Rumelhart, David E., James L. McClelland, and the PDP Research Group.
especially those characterized as primitive or traditional. One
1986. Parallel Distributed Processing: Explorations in the Microstructure
of Cognition. 2 vols. Cambridge, MA: MIT Press.
model was of spontaneous improvization by the unself-con-
Smolensky, Paul. 1986. Information processing in dynamical sys- scious child of nature, unfettered (and unhelped) by recog-
tems: Foundations of harmony theory. In David E. Rumelhart, James nized artistic conventions. Another was of unchanging tradition
L. McClelland, and the PDP Research Group, Parallel Distributed from the far-distant past, not composed by living creators but
Processing: Explorations in the Microstructure of Cognition, I: 194281. stored in the communal tribal memory.
Cambridge, MA: MIT Press. hese models were largely superseded by the inluential oral-
. 1988. On the proper treatment of connectionism. Behavioral formulaic approach (also known as the oral theory) which came
and Brain Sciences 11: 174. to the fore in the mid-twentieth century. he concept of oral
. 1996. On the comprehension/production dilemma in child lan- composition acquired a speciic meaning and became a key term
guage. Linguistic Inquiry 27: 72031.
of analysis and explanation. Its classic statement in Albert Lords
Smolensky, Paul, and Graldine Legendre. 2006. he Harmonic
seminal he Singer of Tales (1960) used ieldwork in the 1930s in
Mind: From Neural Computation to Optimality-heoretic Grammar.
Vol. 1. Cognitive Architecture. Vol. 2. Linguistic and Philosophical
Yugoslavia to demonstrate how lengthy oral poems were com-
Implications. Cambridge, MA: MIT Press. posed during performance: he singers drew on a traditional
Speas, Margaret. 1997. Optimality theory and syntax: Null pronouns and store of formulaic phrases and themes, which enabled them,
control. In Optimality heory: An Overview, ed. Diana Archangeli and without writing or verbatim memorization, to pour forth long
D.Terrence Langendoen, 17199. Malden, MA: Blackwell. epic songs in uninterrupted low. Variations around such formu-
Stevenson, Suzanne, and Paul Smolensky. 2006. Optimality in sen- laic phrases as, for example, By Allah, he said, and mounted his
tence processing. In he Harmonic Mind: From Neural Computation white horse recur throughout the poems, providing a parallel to
to Optimality-heoretic Grammar. Vol. 2. Ed. Paul Smolensky and the Homeric epithets like leet-footed Achilles found in early,
Graldine Legendre, 30738. Cambridge, MA: MIT Press.
putatively oral, Greek epics. his special technique of composi-
Tesar, Bruce B. 1996. Computing optimal descriptions for optimality
tion (Lord 1960, 17) relied not on preplanned, memorized texts
theory grammars with context-free position structures. Proceedings of
but on composition-in-performance. Contrary to literate expec-
the hirty-Fourth Annual Meeting of the Association for Computational
Linguistics, 1017. Morristown, NJ: Association for Computational tations, there was no ixed correct version: each performance was
Linguistics. authentic in its own right, a unique product composed and per-
. 1998. Error-driven learning in optimality theory via the ei- formed on one occasion. Oral-formulaic composing was linked
cient computation of optimal forms. In Is the Best Good Enough? to a traditional, oral mindset incompatible with literacy and the
Optimality and Competition in Syntax, ed. Pilar Barbosa, Danny Fox, literate mind, and once singers became literate, it was posited,
Paul Hagstrom, Martha McGinnis, and David Pesetsky, 42135. MIT they lost the power to compose orally.
Working Papers in Linguistics. Cambridge, MA: MIT Press. he oral-formulaic theory was enormously inluential
Tesar, Bruce B., and Paul Smolensky. 1998. Learnability in optimality throughout much of the later twentieth century and across a
theory. Linguistic Inquiry 29: 22968.
wide span of disciplines, providing, as it apparently did, an
Wilson, Colin. 2001a. Bidirectional Optimization and the heory of
answer to the puzzle of verbal composition without writing.
Anaphora. In Optimality-heoretic Syntax, eds. Geraldine Legendre,
Examples of comparable formulaic expression and hence, it
Sten Vikner and Jane Grimshaw. Cambridge, MA: MIT Press.
. 2001b. Consonant cluster neutralisation and targeted con- seemed, of oral composition were identiied throughout the
straints. Phonology 18: 14797. globe, from early Greek epic, Old English texts, or the Bible to
Yip, Moira. 1993. Cantonese loanword phonology and optimality the- living examples recorded from the ield, soon also extending to
ory. Journal of East Asian Linguistics 2: 26191. the full range of poetic genres and to prose-like forms, such as
sermons or storytelling.
hough still regarded as a classic approach, oral-formulaic
ORAL COMPOSITION
theory has been both modiied and challenged, especially dur-
Oral composition broadly refers to the creation of organized ver- ing the last two decades. First, it has become apparent that not
bal formulations without reliance on writing. hough in essence all genres of unwritten verbal art follow the oral-formulaic com-
a familiar process in everyday speech, it has become a quasi- position-in-performance mode, nor, as implied by the classic
technical and debated term, applied especially to bringing into oral-formulaic analysts, is oral composition a single identiiable
being relatively sustained examples of entextualized verbal art, process. heir often somewhat generalized conclusions have not
both ancient and recent. It has thus been of interest to linguists, been fully supported by the empirical evidence, for oral forms
anthropologists, folklorists, psychologists, historians, and spe- turn out to be created in diverse ways. Some are composed before
cialists in speciic languages and cultures, also linking to work and separated from performance. Some do, after all, involve
on oral culture, performance, story, literacy, and memorization. One much-quoted case is of the Somali poets
memory. who spend hours, sometimes days, composing elaborate genres
he central issues have been, irst and most directly, how of oral poetry, later delivered word for word either by themselves
lengthy oral poems, narratives, and other sustained verbal forms or by reciters who are able to memorize poems and, without
570
Oral Composition
writing, store large and exactly reproducible repertoires in their better, as a multifaceted spectrum of overlaps, interpenetrations,
memory over many years. Elsewhere, too, prior composition is and diversities. he now-inluential concepts of entextualization
sometimes a long-drawn-out and carefully considered proce- and of dialogism, here applied in particular by linguistic and
dure, in some cases involving multiple authors and/or rehears- literary anthropologists, have also bridged the once-accepted
als before being performed. Certain womens personal songs in chasm between oral and literate and illumined the multiple ways
mid-twentieth-century Zambia, for example, were thought out in which people construct, assemble, and interact with texts
by one woman, elaborated with her friends, worked over for days (Barber 2007; Silverstein and Urban 1996).
by an expert composer, then rehearsed and memorized before he meaning of oral itself has also been enlarged and prob-
inal performance. In other cases, a composer may speak aloud lematized. Most oral compositions, it is now increasingly noted,
words of rapid inspiration designed for later performance, to are realized not just through words but through a constellation
be captured by listeners on the spot through memorizing, tape of multimodal resources. he act of performance may include,
recording, or writing (further details and discussion in Finnegan for example, movement, bodily enactment, visual devices, and
1992, 5287; 2007, 96113, 179200). Contrary to the classic oral- the variegated arts of the voice (volume, intonation, speed,
formulaic model, oral composing varies in diferent cultures, silence, timbre, atmosphere, and much else): A musical element
genres, and circumstances. is essential in certain genres, an aspect often neglected in Western
Second, the assumption that literacy and orality are mutually scholars propensity to privilege the verbal component. Although
incompatible has been extensively challenged. By now, many music and words are in some cultures and genres taken as dis-
empirical examples of their interaction in both historical and tinct, composed by diferent people, this is not always so, and
more recent times have been noted and investigated. At a more some scholars argue that language and music form a continuum
theoretical level, there are also the current transdisciplinary cri- rather than a dichotomy (see Banti and Giannatasio 2004). he
tiques of the West-centered binary dichotomizing between primi- substantial recent work on gesture (McNeill 2000; Kendon
tive/civilized, non-Western/Western, traditional/modern, and, 2004) has also elucidated the integral relation between gesturing
alongside these, oral/literate, together with parallel challenges and speaking. Even if below our explicit consciousness, gesture,
to the arguably ethnocentric and ideological presuppositions of it seems, is a planned and patterned activity, a dimension there-
a simple and necessary link between literacy and modernity. In fore that, like music, must arguably enter into a full understand-
practice, it appears, there are multiple forms of literacy, interacting of oral composition and performance.
ing, therefore, in multiple ways with oral modes. Recent approaches to memory are also relevant. Historians,
Despite challenges to some of its central presuppositions, the anthropologists, and psychologists have drawn attention to the
legacy of the oral-formulaic school lives on. It rightly unsettled frames within which remembering is actively recreated and
the (literate) concept of ixed correct text, highlighted the signif- to the diverse social mechanisms for organizing and manipu-
icance of performance and audience, and, if in the (arguably) lating memory. Some cultures or genres prioritize word-for-
somewhat elusive terminology of formulae, pointed up the word memorization and organize formal or informal training
importance for composition of conventionalized verbal formu- in this skill; in others, diferent arts are emphasized, including
lations in generic settings. Scholars identifying themselves with improvization. Generic conventions themselves provide sche-
that tradition have continued their (largely textual) examina- mas for organizing and activating memory, ofering constraints
tions of oral and oral-derived texts while also reconiguring and opportunities for the creative low of language, not only
their approaches by attention to the speciicities of aesthetic and through larger frames such as narrative, praising, or lamenting
cultural traditions, interacting fruitfully with trends elsewhere to but also by memory-enhancing devices like imageries, rhythm,
produce sophisticated analyses of the complex interrelations of and audience (and chorus) participation and by sound-pattern
oral with written composition (Amodio 2005; Foley 2002). repetitions and sequences, such as rhyme, alliteration,
Although there is currently no one dominant approach to parallelism, or melody. In some contexts, memory is seen as
complement the earlier oral theory, the topic of composition itself an aspect of creativity, eroding its apparent opposition
without writing (or anyway, without central reliance on writing) to composition (see further Rubin 1995; Carruthers 1990).
has continued to attract interdisciplinary interest. he focus is he upshot is that oral composition has somewhat dissolved
now less on attempting to delineate oral composition as a single as a distinctive topic for analysis. It no longer stands out as some-
process, or as pertaining to some special kind of culture or men- thing self-evidently special or puzzling but as an aspect of pro-
tality, and more on complexity and plurality. cesses being studied from other viewpoints and as taking place
Oral composition is thus no longer conceptualized as primar- in many diferent forms, settings, and modalities from lengthy
ily conined to traditional, historic, or non-Western settings art genres to the creativity of everyday conversation; from long
but as also including such examples as contemporary popular preplanned and rehearsed performances to extemporized
songs or the spoken oratory of modern statesmen and publi- speeches; from live delivery to multimedia enactments. It is now
cists. It has also been noted how readily some long-established tied less to theories of the primitive, traditional, or, indeed, the
oral genres are exploited in new settings, like the South African oral as such as to ongoing issues related to language or creativ-
praise poems now composed for Nelson Mandela, the national ity more generally, analyzed both comparatively and in cultural
football team, or university graduation ceremonies, and circu- speciicities.
lated not only in live performance but in writing and on radio, While in one way this has undermined the idea of oral com-
CD-ROMs, and the Web. he relation between oral and literate is position as a subject for direct scrutiny in its own right, in another
now more often envisaged as continuum than as opposition or, way this broader cross-cultural approach and the empirical
571
Oral Culture
investigations it has stimulated have enabled a irmer grasp on culture to certain degrees, but without ever eliminating it or
the complexity of the processes by which, without much or any totally superseding it. In the world today, an estimated one bil-
direct recourse to writing, people can and do produce verbal lion people do not know how to read or write any language, and
formulations both lengthy and short, aesthetically marked and so they live in a residual form of primary oral culture. In addi-
everyday. Further, all of this has helped to challenge traditional tion, certain cultures in the world today remain highly oral, just
models of language as realized preeminently either, on the one as Western culture did for centuries before print culture helped
hand, in stable written texts or, on the other, in relatively uncon- usher in what is commonly referred to as modern culture and
strained and perhaps trivial everyday speech. A consideration of modernity modern science, modern capitalism, modern
oral composing highlights the sustained and creative marshal- democracy, the Industrial Revolution, and the Romantic move-
ing of language in situations where writing does not necessarily ment. he common distinction between modern culture, promi-
lie at the core: verbal genres that are by no means outdated or nent in the West, and premodern cultures in many other areas
peculiar but have had a wide spread in the world, both yesterday of the world (e.g., Turner 1969) can be understood in terms of
and today. Ongs account of Western cultural history. Premodern cultures
are examples of what Ong has referred to as primary oral cultures
Ruth Finnegan
and as residual forms of primary oral cultures.
When alphabetic writing was introduced, Ong claims (and so
do McLuhan and Havelock), it did not change everything over-
Amodio, Mark C., ed. 2005. New Directions in Oral heory. Tempe: Arizona night. As a result, early writing such as most of the Bible (except
Center for Medieval and Renaissance Studies. for the prologue to the Gospel of John) and the Homeric epics
Banti, G., and F. Giannatasio. 2004. Poetry. In A Companion to Linguistic can be seen as providing transcripts of primary oral thought
Anthropology, ed. Alessandro Duranti, 290320. Oxford: Blackwell.
and expression. But distinctively literate forms of thought and
Barber, Karin. 2007. Texts, Persons and Publics in Africa and Beyond.
expression emerged in the pre-Socratics and Plato, as Havelock
Carruthers, Mary. 1990. he Book of Memory. Cambridge: Cambridge
explains in detail (1963, 1978, 1982). Perhaps more than anything
University Press. else, Ong sees the formal study of logic initiated by Aristotle as
Finnegan, Ruth. 1992. Oral Poetry: Its Nature, Signiicance and Social involving distinctively literate thought; Ong has traced the his-
Context. 2d ed. Bloomington: Indiana University Press. tory of the formal study of logic in his 1958 masterwork Ramus,
. 2007. he Oral and Beyond: Doing hings with Words in Africa. Method, and the Decay of Dialogue: From the Art of Discourse to
Oxford: James Currey. Chicago: Chicago University Press. the Art of Reason (3d ed. 2004). Within the Aristotelian tradition
Foley, John Miles. 1988. he heory of Oral Composition: History and of medieval logic, Ong notes, new developments emerged that he
Methodology. Bloomington: Indiana University Press. styles the quantiication of thought (see esp. [1958] 2004, 5391).
. 2002. How to Read an Oral Poem. Urbana: University of Illinois In a subsequent essay, he points out how these new develop-
Press.
ments contributed to the emergence of a new state of mind as
Kendon, Adam. 2004 Gesture: Visible Action as Utterance.
found in modern science (Ong 1962, 72). Neither this new state
Lord, Albert B. 1960. he Singer of Tales. Cambridge: Harvard University
of mind nor modern science emerged in oral culture, just as the
Press. formal study of logic developed by Aristotle has no counterpart
McNeill, David, ed. 2000. Language and Gesture. Cambridge: Cambridge in oral culture.
University Press. In Ramus, Method, and the Decay of Dialogue, Ong also calls
Rubin, David C. 1995. Memory in Oral Traditions: he Cognitive attention to the visualist tendencies of Western philosophic
Psychology of Epic, Ballads, and Counting-Out Rhymes. Oxford: Oxford thought, which were advanced further by the development of
University Press. printed books. he visualist tendencies of ancient Greek philo-
Silverstein, Michael, and Greg Urban, eds. 1996. Natural Histories of sophic thought have been further ampliied recently by Andrea
Discourse. Chicago: Chicago University Press. Nightingale (2004). But such visualist tendencies do not charac-
terize oral culture. Ong ([1969] 1995) also describes oral culture
as based on an oral-aural sense of the world as event, which he
ORAL CULTURE
contrasts with the visual sense of the world as something seen
Oral culture is a conceptual construct associated primarily with (as in the expression worldview). In World as Event: Aspects of
the work of Walter J. Ong, S.J., Marshall McLuhan, and Eric A. Chipewyan Ontology (1997), anthropologist David M. Smith
Havelock, whereas the term oral tradition, which they also use, borrows Ongs expression world as event to help elucidate cer-
is more often associated with the work of Milman Parry, Albert B. tain aspects of Chipewyan thought.
Lord, and their many followers (see Foley 1985). Ong, McLuhan, Ong associates oral culture with the cyclic forms of thought
and Havelock use the term oral culture to refer primarily to pre- that Mircea Eliade describes in he Myth of the Eternal Return
literate cultures but also to characterize the thought and expres- ([1949] 2005). Lynne Ballew describes further examples of cyclic
sion that carry over into manuscript culture and even into print thought in Straight and Circular: A Study of Imagery in Greek
culture. Moreover, oral culture, which Ong also refers to as Philosophy (1979). Ong sees the recycling of souls in the story
primary oral culture, endures in the sense that people continue of Er recounted by Socrates in Platos Republic as an instance
to talk with one another. he later subsequent cultural devel- of cyclic thought in Greek philosophy, which is to say a residual
opments in manuscript culture and print culture may be seen form of oral thought in Greek philosophic thought. Conversely,
as cultural overlays that inluence and transform the base oral Ong (see, for example, 1967a, 6182, 8398, 99126) associates
572
Oral Culture
the linear accounts of time in the Bible with literacy; he likes Brakke, David. 2006. Demons and the Making of the Monk: Spiritual
to style linear conceptions of history as evolutionary thought, Combat in Early Christianity. Cambridge: Harvard University Press.
thereby rooting later forms of evolutionary thought in Darwin Cary, Phillip. 2000. Augustines Invention of the Inner Self: he Legacy of a
and others within the biblical cultural tradition in Western tra- Christian Platonist. New York: Oxford University Press.
Connor, James L. 2006. The Dynamism of Desire: Bernard J. F.
dition. Independently of Ong, Donald L. Fixico, who is himself
Lonergan, S.J., on The Spiritual Exercises of Saint Ignatius of Loyola.
of American Indian descent, works comfortably with these con-
St. Louis, MO: Institute of Jesuit Sources.
trasts in he American Indian Mind in a Linear World: American Daniell, Beth. 1986. Against the great leap theory of literacy. Pre/Text
Indian Studies and Traditional Knowledge (2003; also see Lee 7.3/4: 18193. Also see Ong 1987.
1987, 10520). Draper, Jonathan A. 2004. Orality, Literacy, and Colonialism in Antiquity.
In Manliness, Harvey C. Mansield does not happen to refer Leiden: Brill.
explicitly to oral culture, but he refers to Achilles frequently to Draper, Jonathan A., ed. 2003. Orality, Literacy, and Colonialism in
illustrate certain points regarding manliness (2006, 558, 601), Southern Africa. Leiden: Brill.
an ambivalent quality that he sees as needing to be disciplined Eliade, Mircea. [1949] 2005. he Myth of the Eternal Return. 2d ed. Trans.
toward socially constructive ends. Male puberty rites, for exam- Willard R. Trask, new introduction by Jonathan Z. Smith. Princeton,
ple, have long been used in oral cultures to help discipline and NJ: Princeton University Press.
Farrell, homas J. 2000. Walter Ongs Contributions to Cultural
orient young men in socially constructive ways (see van Gennep
Studies: he Phenomenology of the Word and I-hou Communication.
1960; Ong 1971, 11341). he kind of socially constructive war-
Cresskill, NJ: Hampton.
rior manliness that Achilles and Agamemnon and Hector and Fixico, Donald L. 2003. he American Indian Mind in a Linear
Odysseus represent is a necessity in oral cultures: he entire World: American Indian Studies and Traditional Knowledge. New
enterprise of modernity, however, could be understood as a proj- York: Routledge.
ect to keep manliness unemployed (Mansield 2006, 230). In Foley, John Miles. 1999. Homers Traditional Art. University
David Riesmans terminology, oral culture is tradition directed, Park: Pennsylvania State University Press.
whereas modernity is dominated by inner-directedness (1950). Foley, John Miles, ed. 1985. Oral-Formulaic heory and Research: An
(For further recent studies of the historical development of inner- Introduction and Annotated Bibliography. New York: Garland.
directedness, see Williams 1993; Brakke 2006; Cary 2000; van t Havelock, Eric A. 1963. Preface to Plato. Cambridge: Belknap Press/
Harvard University Press.
Spijker 2004; Renevey 2001; Low 2003; Connor 2006; Bloom 1998;
. 1978. he Greek Concept of Justice: From Its Shadow in Homer to Its
Kahler 1973). In Honor and the Epic Hero, Maurice B. McNamee
Substance in Plato. Cambridge: Harvard University Press.
(1960) shows that concepts about heroic and great-spirited per-
. 1982. he Literate Revolution in Greece and Its Cultural
sons have shifted from time to time. Even though the concepts of Consequences. Princeton, NJ: Princeton University Press.
a heroic and magnanimous person in oral culture no longer work Horsley, Richard A., Jonathan A. Draper, and John Miles Foley, eds.
for modernity, we do need to formulate some concepts of heroic 2006. Performing the Gospel: Orality, Memory, and Mark. Minneapolis,
and magnanimous persons that will work for modernity. MN: Fortress.
Before concluding, we should note the critique that some Jousse, Marcel. 1990. he Oral Style. Trans. Edgard Sienaert and Richard
authors have made of Ongs work and related work regarding Whitaker. New York: Garland.
oral culture. he critique alleges that Ong has set forth a great Kahler, Erich. 1973. he Inward Turn of Narrative. Trans. Richard
divide theory in which there is a great divide with oral culture Winston and Clara Winston, foreword by Joseph Frank. Princeton,
NJ: Princeton University Press.
when literacy emerges (see, for example, Daniell 1986; but also
Kelber, Werner H. 1997. he Oral and the Written Gospel: he Hermeneutics
see Ongs 1987 letter about her article). Beth Daniell and others
of Speaking and Writing in the Synoptic Tradition: Mark, Paul, and Q.
who advance this critique do not accurately summarize what
2d ed. Foreword by Walter J. Ong, S.J., new introduction by Werner H.
Ong has said, and so their supposed critique amounts to little Kelber. Bloomington: Indiana University Press.
more than knocking down a straw man named by them Ong. Lee, Dorothy. 1987. Freedom and Culture. Prospect Heights,
(For a more detailed response to this alleged line of critique, see IL: Waveland.
Farrell 2000, 1626, 2004). Lord, Albert B. 1960. he Singer of Tales. Cambridge: Harvard University
In conclusion, in oral culture, people are culturally condi- Press.
tioned so that they tend to favor cyclic patterns of thought and Low, Anthony. 2003. Aspects of Subjectivity: Society and Individuality from
expression, to have a world-as-event sense of life, to put manli- the Middle Ages to Shakespeare and Milton. Pittsburgh, PA: Duquesne
ness to work in socially constructive ways, to use oral stories of University Press.
Mansield, Harvey C. 2006. Manliness. New Haven, CT: Yale University
heroes as ways to help orient and put manliness to work, and to
Press.
use ritual process very efectively to promote and support socially
McLuhan, Marshall. 1962. he Gutenberg Galaxy: he Making of
constructive behavior.
Typographic Man. Toronto: University of Toronto Press.
homas J. Farrell McNamee, Maurice B. 1960. Honor and the Epic Hero: A Study of the
Shifting Concept of Magnanimity in Philosophy and Epic Poetry.
New York: Holt, Rinehart and Winston.
Morris, Jan, and Barry Powell, eds. 1997. A New Companion to Homer.
Ballew, Lynne. 1979. Straight and Circular: A Study of Imagery in Greek Leiden: Brill.
Philosophy. Assen, the Netherlands: Van Gorcum. Nightingale, Andrea. 2004. Spectacles of Truth in Classical Greek
Bloom, Harold. 1998. Shakespeare: he Invention of the Human. New Philosophy: heoria in Its Cultural Context. Cambridge: Cambridge
York: Riverhead Books. University Press.
573
Ordinary Language Philosophy
Ong, Walter J. [1958] 2004. Ramus, Method, and the Decay of necessary for the elimination of the confusing ambiguities of
Dialogue: From the Art of Discourse to the Art of Reason. 3d ed. New ordinary language. Advocates of the ordinary language approach
foreword by Adrian Johns. Chicago: University of Chicago Press. may recognize the power, the frequent utility, and the intellec-
.1962. he Barbarian Within: And Other Fugitive Essays and Studies. tually admirable parsimony of the formalist, but they also insist
New York: Macmillan.
that ordinary linguistic practice as it stands is generally appro-
. 1967a. In the Human Grain: Further Explorations of Contemporary
priate for our use without any preemptory need for a compre-
Culture. New York: Macmillan.
. 1967b. he Presence of the Word: Some Prolegomena for Cultural
hensive and indispensable reform, and, moreover, that ordinary
and Religious History. New Haven, CT: Yale University Press. usage contains helpful distinctions and nuances that would be
. [1969] 1995. World as view and world as event. In Faith and hurtfully eliminated if the rigors of a formalist system were to be
Contexts, III: 6990. Atlanta: Scholars Press.Originally printed in imposed as the ultimate standard. Peter Strawson, J. L. Austin,
American Anthropologist 71 (August): 63447. and John Searle have been, in varied ways, advocates of an ordi-
. 1971. Rhetoric, Romance, and Technology: Studies in the Interaction nary language approach, and undoubtedly Ludwig Wittgensteins
of Expression and Culture. Ithaca, NY: Cornell University Press. move from the rigors of the Tractatus to the complexities of the
. 1977. Interfaces of the Word: Studies in the Evolution of Philosophical Investigations is seminal to the entire movement.
Consciousness and Culture. Ithaca, NY: Cornell University Press. A short entry cannot survey all the philosophers and topics
. 1981. Fighting for Life: Contest, Sexuality, and Consciousness.
of importance in this ordinary language approach, but as prime
Ithaca, NY: Cornell University Press.
examples this entry will focus on one philosopher, Peter Strawson,
. 1986. Hopkins, the Self, and God. Toronto: University of Toronto
Press.
and on one topic, reference. We will consider the way in which
. 1987. Letter to the Editor. Pre/Text 8.1/2: 155. Comments on Strawson makes reference his starting point, how this single topic is
Daniell 1986. embedded in contemporary debates within the philosophies of lan-
. 199299. Faith and Contexts. 4 vols. Ed. homas J. Farrell and guage and logic, Strawsons own signiicant contributions to those
Paul A. Soukup. Atlanta: Scholars Press. Volumes now distributed by debates, and some replies he makes to his critics. Also noted will be
Rowman & Littleield. Bertrand Russell and Wittgenstein as prime movers in Strawsons
. 2002a. An Ong Reader: Challenges for Further Inquiry. Ed. homas thought and W. V. O. Quine as a stern and characteristic critic.
J. Farrell and Paul A. Soukup. Cresskill, NJ: Hampton. he best known of all Strawsons writings is probably his early
. 2002b. Orality and Literacy: he Technologizing of the Word. 2d ed.
article On referring (1950) in which he addresses the issue of
New York: Routledge.
singular reference and predication and their objects, a matter with
Opland, Jef. 1983. Xhosa Oral Poetry. Cambridge: Cambridge University
which he was concerned throughout his working life. his article
Press.
Parry, Milman. 1971. he Making of Homeric Verse: he Collected Papers was written in response to Bertrand Russells theory of deinite
of Milman Parry. Ed. Adam Parry. New York: Oxford University Press. descriptions contained in On denoting (1905). For Strawson,
Renevey, Denis. 2001. Language, Self, and Love: Hermeneutics in the we use a variety of expressions to refer to some individual person,
Writings of Richard Rolle and the Commentaries on the Song of Songs. object, or event. We use singular descriptive pronouns (this and
Cardif: University of Wales Press. that), proper names (Winston Churchill), and singular pronouns
Riesman, David, with Rueul Denny and Nathan Glazer. 1950. he Lonely (I, you, it), and for what are called deinite descriptions, we use
Crowd: A Study of the Changing American Character. New Haven, the deinite article followed by a noun in the singular, e.g., the
CT: Yale University Press. king of France. Suppose someone at present utters the sentence
Scholes, Robert, and Robert Kellogg, with a chapter by James Phelan.
he king of France is wise (S). For Bertrand Russell, S is signii-
2006. he Nature of Narrative. 2d ed. New York: Oxford University
cant; that is, it may be true or false. But he claims that to show the
Press.
true logical form of S, it needs to be rewritten as
Smith, David M. 1997. World as event: Aspects of Chipewyan ontology.
In Circumpolar Animism and Shamanism, ed. Takako Yamada and (1) here is a king of France.
Takashi Irimoto, 6791. Sapporo, Japan: Hokkaido University Press.
Turner, Victor. 1969. he Ritual Process: Structure and Anti-Structure. (2) here is not more than one king of France.
Chicago: Aldine. (3) here is nothing that is the king of France and is not
van Gennep, Arnold. 1960. he Rites of Passage. Trans. Monika B. wise.
Vizedom and Gabrielle L. Cafee, introduction by Solon T. Kimball.
Chicago: University of Chicago Press. hus, someone uttering S today would be saying something
van t Spijker, Ineke. 2004. Fictions of the Inner Life: Religious Literature signiicant but false. For Russell, we must distinguish deinite
and Formation of the Self in the Eleventh and Twelfth Centuries. descriptions such as the king of France from logically proper
Turnhout, Belgium: Brepols. names, for example, Winston Churchill. he latter alone can be
Walker, Jefrey. 2000. Rhetoric and Poetics in Antiquity. New York: Oxford subjects of sentences of a genuine subject-predicate form and
University Press. have some single object for which they stand.
Williams, Bernard. 1993. Shame and Necessity. Berkeley and Los
Strawson thinks that Russell is wrong in this, since his account
Angeles: University of California Press.
of sentences 13 is neither completely nor even partially correct.
A correct account must begin by distinguishing among
ORDINARY LANGUAGE PHILOSOPHY
a sentence
Within the analytic tradition of contemporary Anglophone phi-
losophy, ordinary language philosophy is set in contrast to the a use of a sentence
view that the prescriptions of formal logic provide the means an utterance of a sentence
574
he sentence he king of France is wise can be uttered at distinction when we say that a statement is untrue or that it is
various times and for various purposes. We cannot say that the inconsistent. If a deductive argument is valid, if the premises are
sentence is true or false, only that it may be used to make a true true, then the conclusion is judged necessarily true under pain of
or false assertion. At the heart of Strawsons position is the claim inconsistency or self-contradiction. But in all of this we must also
that referring is not something that an expression such as the consider the context of statements that are made. Asked if the
king of France does. Referring is, instead, characteristic of the results of the recent election pleased me, I may signiicantly reply
use of an expression. Meaning is a function of the sentence or that they did and they didnt. Words such as vehicle and enter-
expression, but mentioning and referring and truth and falsity tainment have only approximate boundaries for their appropri-
are functions of the use of the sentence or expression. ate use. he uses and therefore the meanings of various words
Russells claim is that someone at present uttering he king and expressions are subject to expansion and contraction.
of France is wise (S) would a) be making a true or false state- hus, as in On referring, logical appraisal is properly applied
ment, and b) be asserting that there exists at present one and to statements, not sentences. We need, therefore, to approach
only one king of France. Strawson inds Russell wrong on both the relation between formal logic and ordinary language with
counts. For Strawson, the sentence is signiicant, since it could caution. In formal logic, a formula is an expression such that by
be true or false, and it could refer to a particular person. But that substituting words or phrases for the variables we can obtain
does not mean that any particular use of the sentence must be sentences that could be used to make statements. In the formula
either true or false. Ordinarily, a person uttering S presupposes x is a younger son, to substitute Tom for x would yield a
the existence of the king, and his uttering S neither asserts nor sentence that would have meaning, while to substitute he
entails the kings existence. hus, presupposition must be care- square root of 2 would not. hus, some variables would yield
fully distinguished from both assertion and entailment. sentences, but not signiicant statements. We can talk about the
here is, moreover, a need to distinguish rules for referring range of admissible values for a variable, but, unlike formal logic,
from rules for ascribing and attributing. hat distinction roughly in ordinary language there are no precise rules for what is admis-
corresponds to the grammatical distinction between subject and sible. Once again, statements have a contextual component, and
predicate. For Strawson, that irreducible distinction has been that goes beyond the reach of formal logic.
blurred by logicians in their desire to reduce or to eliminate he limits of formal logic are also manifest in its use of sym-
altogether the referring use. He inds a prime example of that bols for truth- functional connectors. Consider particularly the
attempted elimination in Gottfried Leibnizs efort to establish logical symbol . Consider If it rains, the party will be a failure.
individual identity through the use of complete individual con- hat suggests conditions that are neither logical nor linguis-
cepts done in exclusively general terms. Strawson thinks that tic, but are instead discovered in our experience of the world.
Russell also strives to make logic in a narrow sense adequate for Compare the function of the connector in that sentence with the
referring to individuals. function of the same connector in If he is a younger son, then he
It is particularly noteworthy that Strawsons fundamental dis- has a brother. Similar limitations are evident in the use of other
tinctions between sentence and utterance, and between referring connectors. Consider the question of the connector that is the
and describing, are a challenge to the votaries of modern logic. appropriate one for unless.
Consider such nonuniquely referring expressions as all, no, some, Again, when logicians choose the pattern for their represen-
and some are not, that is, the four types of standard form cat- tative rules, they employ common uses drawn from ordinary lan-
egorical propositions: A, E, I, and O. For the modern, only I and O guage, and then proceed to make standard what is common. In
propositions have existential import. In consequence, the mod- this way, a rigidity is imposed that is foreign to the uses of ordi-
ern must deny some traditional doctrines, such as the square of nary language.
opposition and the validity of some forms of the syllogism. he he logician is not a lexicographer but is concerned only with
moderns dilemma is for Strawson a bogus one. We may sim- general principles that are indiferent to subject matter. he dif-
ply say that the question of whether or not the quantiicational iculty here is that sometimes diferent expressions may have the
expressions are being used to make true or false statements just same uses; all, the, and a may have the same use in describing
do not arise except that when the existential conditional is ful- the basic move of the pawn in chess. Similarly, the same expres-
illed for the subject term, then all of the laws of traditional logic sion may have diferent uses; not and not may be used as a
hold good. If we ask a literal-minded and childless man if all of double negation, to emphasize, or to show necessity. he logi-
his children are asleep, he will not answer either yes or no cian would eliminate this complexity and clutter and impose a
because the question simply does not arise. systems rules to cure the perceived deiciencies of ordinary lan-
For Strawson, neither Aristotelian nor Russellian rules give guage. he logician is content with all, some, and no but has no
the exact language of any expression of ordinary language, since use for most or few, despite the usefulness and common employ-
ordinary language has no exact logic. ment of those terms in our ordinary reasoning.
In light of On referring, Strawson sets out in Introduction here are further challenges to any claim for a sovereignty of
to Logical heory (1952) to remedy the failures of modern logi- formal logic over the workings of ordinary language. Consider
cians to address adequately the relationships between formal the notion of logical form as a sort of verbal skeleton that remains
logic and the logical features of ordinary language. He begins by when all expressions, except the selected logical constants, are
noting diferences among the various ways we make judgments eliminated from a sentence that might be used as a statement
about what someone says. To say that a statement is logical is and are replaced by variables. For Strawson, this notion of logi-
ordinarily a commendation. here is a further and more complex cal form is viable, but it may lead us to the mistaken conclusions
575
that a statement must have just one logical form, or that logical Not all have agreed. Quine is notable for his long-standing
form makes the work of the lexicographer superluous, or that disagreement on the issue of singular terms and reference, the
logical features need not take into account the relevant subject issue which is central in Strawsons On referring and perva-
matter, or that validity depends upon form, rather than the other sive in all of his later writings. For Quine, singular terms are at
way about. best superluous, to be eliminated without loss; to be is to be the
he claim may be made that appropriate caution will enable value of a variable (1972, 234). Here is the great divide that, on
us to avoid such mistakes, and that the relation between ordin- Strawsons account, separates him from both Quine and Russell.
ary language and formal logic might be seen minimally as peace- Quines concern that singular terms are ambiguous in their ref-
ful coexistence, and more truly as a separation of powers that is erence is set aside by Strawson on the ground that such terms do
necessary and useful to both sides. For Strawson, the ongoing not refer at all; they are, instead, used by persons to make refer-
diiculty here is that the logician is not content with being con- ence. If the reference is ambiguous, the responsibility rests with
sistent but seeks the completeness of a system. hat ideal is com- the statement maker, not the term. For that matter, ambiguity
promised by the fact that the typical truth-functional connectors has its own uses and, indeed, its own occasional sweetness in the
defy a single ordinary use, and the attendant complexities run ordinary language of daily life.
counter to a mathematical model taken as the paradigm for the Strawson found strong support for his views on ordinary lan-
whole of logic. hat paradigm appeals, but its seduction mis- guage in Wittgensteins transition from his positions in Tractatus
leads, with profound consequences for the study of metaphysics to those in the Philosophical Investigations. his is manifest in
and epistemology. Strawsons review of the latter work in Mind (1954). In sections
Of all of the identiications between the truth-functional con- 38137 of the Investigations, Strawson inds an evident rejection
nectors of formal logic and ordinary words, Strawson inds con- of the logical atomism that characterizes the Tractatus. In that
junction and negation least troublesome, but even here there are earlier work, Wittgenstein had been concerned with the idea of
limitations. By the laws of formal logic, pq and qp are equiva- the genuine names of a language, and with the idea of the simple
lent, but in ordinary language, the order may be essential to the indestructible elements of reality that are only to be named, not
meaning. Most troublesome in the identiication of logical con- described or deined, and which are the meanings of those genu-
nectors and ordinary words is . he falsity of the antecedent ine names. hese primary elements are Russells individuals,
suices in material implication for the truth of the statement, but and the objects of the Tractatus. hose elements are connected
not in the corresponding hypothetical statement. to the belief that the clariication of ordinary language depends
he workings of the class system of modern quantiicational on an analysis in which ambiguous sentences are replaced by
logic further compound these diiculties about the relation ones that relect exactly the logical form of the fact under con-
between ordinary language and truth-functional logic. Modern sideration. Logic then seems to be pure, exact, and general, the
orthodoxy claims that once the older Aristotelian system is essence of the thoughts that mirror the structure of the empirical
cleaned up, it is simply a small part of todays quantiicational world. hat leads us to the illusion that this process of analysis
logic. Conversely, Strawson contends that with only a few res- is inite, that there is a single completely resolved form for every
ervations, the traditional rules dating from Aristotle conform to expression.
the use of words in ordinary language, and indeed avoid some For Wittgenstein in the Philosophical Investigations, and for
of the incongruities of the moderns practice. Standard criticism Strawson, the cure for this illusion is to give up the search for the
of tradition rests on the question of existential import, that is, for very essence of language and to direct our attention, instead, to
the four moods A (all), E (no), I (some) and O (some are not ), the various ways in which language actually functions, In a well-
whether there is a commitment in the tradition to the actual known example, Wittgenstein asks us what is common to all of
existence of the members of the terms. he moderns assump- the proceedings we call games. We cannot say that they must all
tion is that only I and O have such import. But consider some- have something in common, an essence, simply because they
ones saying All Johns children are asleep. Again, if John is have a common name. here is no single element they all share.
fatherless, the existential import question simply does not come here are only family resemblances, a network of overlappings
up. he existence of those children is a necessary precondition of and crisscrosses.
the statement being either true or false. he modern goes wrong What is true of games is true of linguistic activity; there is no
in failing to distinguish sentence from statement. he sentence single use, only family resemblances. here is no exact bound-
may be true or false, that is, meaningful, but in its use as a state- ary of use, although a ixed boundary could be set to serve some
ment, the question of existential import is determined by the particular purpose. A word or a linguistic practice need not be
context. exact in order to be understood and acted upon; stand roughly
In sum, for Strawson there are two kinds of logic, the entail- here may be serviceable enough. To say in dispraise that it is
ment rules of formal logic, which abstract from the time and inexact misses the mark. he demand for absolute and ixed
place of utterance, and the referring rules, which lay down the meanings is senseless. Whether or not there is enough preci-
contextual requirements of what a sentence presupposes. In the sion is determined by whether the concept is used with general
study of those referring rules, we do not ind the elegance and agreement.
system of formal logic, but Strawson does ind a ield of intellec- he consequence is that we are not to provide ordinary lan-
tual study unsurpassed in richness, complexity, and the power guage with a necessary revision and reduction; we are simply to
to absorb. he two kinds of logic are interrelated, and both are describe the ways it works. If we do so, we eliminate the puzzles
necessary in human communication. that arise when language goes on holiday, when we consider
576
Origins of Language
words and sentences in abstraction from their ordinary uses. For Direct Versus Indirect Selection
Wittgenstein, philosophys proper task is simply the assembling he notion that language constituted an evolutionary adaptation
of a series of reminders of actual uses, with the purpose of dispel- sensu stricto in other words, that it arose through some selec-
ling confusions that arise in speciic contexts. tive pressure acting directly upon pre-existing genetic material
hat conception of philosophy is one that Strawson shares up had been around since Darwin but is most cogently expressed
to a point, but he also inds that an appropriate philosophy of lan- by Steven Pinker and Paul Bloom (1990). Arguing against the
guage provides the basis for a descriptive metaphysics, one that suggestion, made by Stephen Jay Gould among others, that lan-
is content to give an account of the actual structure of the world guage could be a spandrel an accidental by-product of other
of our experience. his is set in contrast to a revisionary meta- evolutionary developments these authors pointed out that the
physics that vainly strives to do better. Descartes, Leibniz, and intimate interconnections among the various parts of language
George Berkeley are revisionary; Kant and Aristotle are descrip- parallel a similar interconnectivity in the eye, an object univer-
tive. hat contrast in many ways mirrors the distinction between sally agreed to have evolved through natural selection. Although
ordinary language philosophy and those formalist attempts that their approach entailed a gradual process of evolution, they did
only mar whats well. not address the initial stage of that process nor discuss in detail
possible adaptive pressures (beyond suggesting that competi-
Cliford Brown
tion among humans was probably more inluential than envi-
ronmental factors).
While few if any scholars would deny that selective pressures
Austin, J. L. 1962 . How to Do hings with Words. Oxford: Clarendon. have played a role in the development of many prerequisites for
Brown, C. 2006. Peter Strawson. Montreal: McGill-Queens University
language, some still suggest that the emergence of language itself
Press.
was not speciically selected for. he notion that language was an
Quine, W. V. O. 1972. Methods of Logic. New York: Holt, Rinehart and
invention by human ancestors with expanded brains is still held
Winston.
Russell, B. 1905. On denoting. Mind 14.4: 47993. by some (e.g., Donald 1991). Others propose that laws of form
Strawson, P. 1950. On referring. Mind 59: 2152. afecting brain structure and growth played a more signiicant
. 1952. Introduction to Logical heory. London: Methuen. role than natural selection (Jenkins 2000; see also biolinguis-
. 1954. Wittgensteins Philosophical Investigations. Mind, n.s., tics). Alternatively, a mutation or the modiication of some prior
63.249: 7099. nonlinguistic faculty might have yielded recursion, the capac-
. 1974. Subjects and Predicates in Logic and Grammar. London: ity to generate ininite structures from inite materials (Hauser,
Methuen. Chomsky, and Fitch 2002), and recursion added to prehuman
conceptual structure might have suiced to produce language.
hese last two proposals imply that language emerged in more
ORIGINS OF LANGUAGE or less its current form, without any intermediate stage between
his term and language evolution are sometimes used inter- animal communication and true language. Approaches of
changeably. Here, origins of language will mean the earliest this type would be strengthened if the required laws of form,
emergence of a system structurally distinct from the communi- mutations, or changes of function could be precisely speciied;
cation systems of other animals and having at least some of the this has not yet been done.
attributes of human language; the term will not include further
evolutionary developments leading to the emergence of fully
Selective Pressures
Among those who see language as an adaptation, explanations
modern language or the subsequent diversiication of human
for the selective pressure involved have changed over time. Until
languages (see historical linguistics). Although the topic
the 1980s, it was widely assumed that language arose for pur-
is one that has engaged the human imagination throughout his-
poses of tool making and/or cooperative hunting. However, eco-
tory, an orgy of armchair speculation following Darwin caused
logical studies revealed complex cooperative hunting patterns in
it to fall into disrepute. However, now that advances in a vari-
nonhuman species, while anthropological studies showed that
ety of sciences have made possible more informed (if still inevi-
preliterate peoples made tools and taught tool making largely
tably speculative) approaches, the attention devoted to it has
without using words. Moreover, ethological studies of ape spe-
increased annually, with perhaps an overly ebullient prolifera-
cies showed highly complex societies in which individuals com-
tion of theories.
peted with and sought to deceive and outwit one another (Byrne
he issues may be more sharply deined by considering sepa-
and Whiten 1992). An inluential essay (Humphrey 1976) had
rately three major questions to which the topic gives rise:
already suggested that higher cognitive faculties, including lan-
(1) Was language directly selected for, or an emergent prod- guage, had most likely been generated through intense within-
uct of other faculties? group competition.
(2) If it was selected for, what pressure(s) selected for it? he view that language arose from social intelligence is
nowadays shared by a majority, but it has problems. Social
(3) What form did its earliest emergence take?
competitiveness is far from unique to humans; so why has no
Other issues involve the timing of language emergence, the form of language, however rudimentary, evolved in other pri-
modality it originally employed, and whether or not language mate species? A unique adaptation suggests a unique pres-
evolved directly from prior means of animal communication. sure. Furthermore, there must surely have been a stage when
577
Origins of Language
language was limited to a handful of symbols with which it A synthetic protolanguage faces many diiculties, however
would have been impossible to express any socially signiicant (Tallerman 2007). Whereas a compositional protolanguage
meaning. What, in such a situation, would have reinforced lan- enables basic functions of language, such as creating new infor-
guage use? mation, asking questions, and negating statements, a synthetic
Advocates of some form of social adaptive pressure whether protolanguage allows for none of these. Predication and dis-
for gossip (see grooming, gossip, and language), sexual placement are equally impossible. Other problems arise at the
display, or social manipulation have so far failed to address stage of reanalysis into a compositional system. For instance,
such problems adequately. An alternative proposal is that some unless a given holophrase is equivalent to just one sentence in a
primitive form of language developed for exchanging informa- compositional language, no two people would necessarily agree
tion about food sources among small groups of extractive for- as to the meanings of its analyzed segments; yet if such equiva-
agers (Bickerton 2002). Carcasses of megafauna, in particular, lence exists, a compositional language must already exist, at least
would have required the rapid recruitment of signiicant num- mind-internally so why is a holophrastic stage necessary? he
bers for eicient exploitation. Nobody doubts that language, precise nature of protolanguage has been, and will doubtless
once it had emerged, would have been used for a variety of social continue to be, hotly debated, a debate to which experimental
functions; such functions, in turn, would have expanded lan- evidence will hopefully contribute (Bowie 2006).
guage. he real, and still unanswered, question is exactly what
led to its initial emergence. Other Issues
he issue is rendered still more problematic by the fact that A further controversy revolves around whether language was orig-
words are cheap tokens (Zahavi 1975). Since they take so little inally spoken or signed. Given that sign languages develop as
efort to produce, and since primate species constantly engage in naturally among the deaf as do spoken ones among the hearing,
deception, why would anyone have believed them, and if no one and that the hands of our closest primate relatives are more agile
believed them, who would have persevered in their use? and under more volitional control than their vocal organs, the
notion of a signed protolanguage is not unreasonable and has
Initial Structure been vigorously defended (Corballis 2002). However, even if the
While some (as noted) believe that language has always pos- original modality could be determined (and for all we know, pro-
sessed its present structure, most researchers would probably tolanguage could originally have mixed signs and vocalizations
agree that some simpler form developed irst a stage generally indiscriminately), this would not answer the questions discussed
termed protolanguage (Bickerton 1990) and subsequently grew here or tell us how language came to acquire the properties that
more complex. Until recently, it was assumed that protolanguage, distinguish it from other modes of communication.
like early-stage pidgins, consisted of a small quantity of units Another unresolved issue concerns the timing of emergence.
(roughly equivalent in semantic coverage to modern words) None of the evidence from the fossil record is unambiguous.
that could be concatenated, without any consistent grammatical Endocasts of Homo habilis suggest a developed brocas area,
structure, to form brief propositions; in other words, proto- and this has been taken to indicate an early (~2.5 million years
language was compositional. his view is now challenged by ago) beginning for language. But since, even today, Brocas area
the proposal that protolanguage was synthetic, with holophrastic subserves both linguistic and nonlinguistic functions, we can-
units (like the units of animal communication systems) roughly not know what functions it performed in antecedent species.
the semantic equivalents of complete propositions and not divis- Symbolic artifacts are sometimes used to date language origins,
ible into smaller meaningful units the whole thing means the but while these indicate that language already existed, they can-
whole thing (Wray 2002, 118). not tell how long before their appearance it began. Absent reli-
Defenders of a synthetic system note that (in contrast with able evidence, estimates of when language originated tend to
a compositional system) there would nowhere be any break in be determined by researchers positions on other issues. For
continuity between language and the prelinguistic communica- instance, those who believe that language emerged abruptly
tion system of hominids (assumed to be similar to those of other more or less in its present state favor a recent date coinciden-
primates; see primate vocalizations), which it would at irst tal with the emergence of anatomically modern humans (~140
resemble except for productivity (holophrastic units could be thousand years ago), or even later. Conversely, those who take
multiplied indeinitely). At a subsequent stage, chance phonetic an adaptationist approach argue for a much earlier date, any-
similarities between portions of holophrases would cause the lat- thing up to a few million years ago. he origin of language is
ter to be reanalyzed into wordlike segments; these could then be probably associated with some speciation event, but this issue,
recombined to form a modern, compositional language. like most others, is unlikely to be resolved without new sources
It is claimed that a synthetic protolanguage would be less sub- of evidence.
ject to ambiguities than a compositional one and would be bet- he question of continuity with prelinguistic systems is
ter adapted for manipulation of other group members. Support somewhat clearer. hat language evolved from some prior com-
has come from computational linguists, many of whose municative system was, to Darwin, an article of faith, and some
simulations of language evolution begin with units that repre- subsequent authors have assumed that a commitment to grad-
sent propositional rather than lexical units (Briscoe 2002). hose ual evolution entails such continuity, discounting the possible
who, following Darwin and Otto Jespersen, assume a common capacity of mutations, changes in function, and interactions
origin for language and music are more or less obliged to adopt between diferent faculties to produce evolutionary novelties.
some form of the synthetic hypothesis. But the only plausible continuist scenario is the holophrastic,
578
Overregularizations
synthetic model of Wray, discussed previously. If objections to the irregular form, and that experience eventually settles in favor
this are overcome, the case for continuity could be maintained; of the irregular form.
otherwise, the diferences between language or even protolan- In a crucial 1986 paper, however, D. E. Rumelhart and J. L.
guage and any nonlinguistic system suggest a sharp discontinu- McClelland showed that newly developed connectionist
ity between the two. networks could simulate the rise and decline of overregulariza-
tions in childrens speech without the use of general rules or syn-
Derek Bickerton
tactic symbols like verb. Briely stated, connectionist networks
hypothesize connective paths between the constituent features
of present and past forms. hrough feedback about correctness,
Bickerton, Derek. 1990. Language and Species. Chicago: University of eventually the right feature-to-feature connections get sorted
Chicago Press. out. With skillful design, such networks can simulate the tem-
. 2002. Foraging versus social intelligence in the evolution of pro- poral courses through which children pass without ever using
tolanguage. In he Transition to Language, ed. Alison Wray, 20726.
any general rule statement at all. In such a model, there is an
implicit competition between irregular and past forms, but the
Bowie, Jill. 2006. he evolution of meaningful combinatoriality. Paper
presented at the Sixth International Conference on the Evolution of
competition is really among connections of features. here is no
Language, Rome. general rule, no general reference to verb as a category. Irregular
Briscoe, Ted, ed. 2002. Linguistic Evolution through Language and regular forms are produced by a single overall network pro-
Acquisition: Formal and Computational Models. Cambridge: Cambridge cess. So these are called single process models versus dual process
University Press. (general rule vs. individual lexical entry).
Byrne, Frank, and Andrew Whiten. 1992. Cognitive evolution in pri- Rumelhart and McClellands paper instigated a series of
mates. Man 27: 60927. simulations, arguments, criticisms, and new simulations that
Corballis, Michael. 2002. From Hand to Mouth: he Origins of Language. continue to this day (e.g., McClelland and Patterson 2002;
Princeton, NJ: Princeton University Press. Plunkett and Marchman 1991, 1993; Pinker 1999). Perhaps the
Donald, Merlin. 1991. Origins of the Modern Mind. Cambridge: Harvard
most prominent empirical data were introduced by G. Marcus
University Press.
and colleagues (1992), who analyzed longitudinal studies of four
Hauser, Marc, Noam Chomsky, and Tecumseh Fitch. 2002. he fac-
ulty of language: What is it, who has it, and how did it evolve? Science
children and cross-sectional studies of hundreds more. hey
298: 156979. argued that in any competition account, one would expect that
Humphrey, Nicholas K. 1976. he social function of intellect. In overregularizations would originally occur at a high rate before
Growing Points in Ethology, ed. P. P. G. Bateson and R. A. Hinde, experience wore them down. But in their analysis of the longitu-
30317. Cambridge: Cambridge University Press. dinal and cross-sectional subjects, they found that overall pre-
Jenkins, Lyle. 2000. Biolinguistics: Exploring the Biology of Language. school year rates seemed very low, around .04 to .06 (or .02 to .10,
Cambridge: Cambridge University Press. depending on the method).
Kirby, Simon, and Morton H. Christiansen, eds. 2003. Language Evolution. his means, they argue, that in actuality, children probably
Oxford: Oxford University Press. A collection of position papers by know the irregular form is correct as soon as they learn it. his
leading scholars in the ield.
knowledge is available because an innate general heuristic
Pinker, Steven, and Paul Bloom. 1990. Natural language and natural
called blocking tells children that if two forms are possible but
selection. Behavioral and Brain Sciences 13: 70726.
Tallerman, Maggie. 2007. Did our ancestors speak a holistic protolan-
only one is heard, choose the heard one. he actually heard
guage? Lingua 117: 579604. irregular form thus has an innate heuristic preferred status.
Wray, Alison. 2002. Dual processing in protolanguage: Performance Overregularizations only occur if the child does not know the
without competence. In he Transition to Language, ed. A. Wray, irregular form, or if the child temporarily cannot remember the
11337. Oxford: Oxford University Press. irregular form and the regular rule intrudes itself. Such retrieval
Zahavi, Amotz. 1975. Mate selection a selection for a handicap. errors are posited to be inherently rare, for some unstated
Journal of heoretical Biology 53: 20514. reason.
he blocking hypothesis requires a general reference to alter-
native rules and, in practice, to regular rule versus irregular indi-
OVERREGULARIZATIONS
vidual lexical patterns. So blocking contradicts connectionist
Overregularizations like runned and mans have played a major formulations in many ways. If there is no competition, connec-
role in the language development literature for more than 40 years tionist models cannot be correct, as they presuppose competi-
(see also childrens grammatical errors; morphology, tion of some sort. If blocking in particular is correct, it requires
acquisition of; syntax, acquisition of). Once brought statement at general symbolic and rule levels, and so network
into focus, overregularizations comprised prime examples of formulations are inadequate.
the way in which childrens use of general grammatical rule M. Maratsos (2000), however, has argued against the empir-
knowledge (the regular past tense rule of adding -ed) could pro- ical conclusions of Marcus and colleagues (1992). Using sam-
ductively overwhelm the word-speciic knowledge they gained pling arguments, he notes that for frequent irregular verbs,
form actual input, a prime example of using rules to go beyond which dominate overall tabulations, even in a competition
the input. Less attention was paid to the way in which children model children would probably hear hundreds of correct inputs
would get rid of overregularizations, but results indicated that within a week or few weeks after the competition started, and
there is a verb-by-verb competition between the regular rule and so overregularizations would fall to near zero very quickly; the
579
Overregularizations Parable
result would be an overall rate of near zero in a sample of two to Rumelhart, D. E., and J. L. McClelland. 1986. On learning the past tenses
three years. Our samples are so small (usually an hour a week) of English verbs. In Parallel Distributed Processing: Explorations in
that they would fail to catch these occurrences. Only less fre- the Microstructure of Cognition. Vol. 2: Psychological and Biological
quent verbs, discounted by Marcus and colleagues, might show Models. Ed. J. L. McClelland, D. E. Rumelhart, and the PDP Research
Group, 21671. Cambridge, MA: Bradford Books/MIT Press.
evidence of strong overregularization. In fact, that they do was
shown for R. Browns two low-overregularizing subjects Adam
and Sarah (see, e.g., 1973). For Adam, for example, the average
overregularization rate was a strong 55 percent for his 21 lower-
frequency verbs. he same rate was found even in samples after
P
the child irst produced the correct irregular form of a verb.
Arguments from sampling considerations indicated that such PARABLE
overregularizations were still persisting after tens or even hun-
dreds of uses. Recent work from a more intensively recorded Standard deinitions, such as the one given in the Oxford English
subject, Peter (Maslen et al. 2004), has strongly supported these Dictionary, conceive of parable as a literary term; it is said to be
analyses and extended them to noun plurals. hese data indi- the expression of one story through another. Literary historians
cate that overregularizations do often appear frequently after have modiied this conception by placing limits on the kind of
the irregular past is known, contrary to blocking. Our samples story that counts as parable, attempting to distinguish it from, for
restrictions just make it diicult to capture them for the more example, fable or allegory.
frequent irregular verbs whose numbers dominate overall here are, however, even among literary scholars, some who
rates. see parable as a much larger phenomenon, belonging not merely
Suppose these analyses do indeed indicate that the low-rate to expression and not exclusively to historical genres but, rather,
blocking account is incorrect. Do they also show that the connec- as C. S. Lewis (1936, 44) observed, to mind in general. (See also
tionist account is therefore correct? Actually, they only indicate Louis MacNeices discussion of literary critical perspectives on
that a competition process of some sort is involved. As noted, parable in MacNiece [1963] 1965, 5.)
older rule-based models also assumed a competition between For the language sciences, parable is not only, or even chiely,
regular rule and individual entry. he current association of a kind of story; it is not an expression at all but, rather, a men-
competition with connectionism and with non-rule models thus tal faculty that allows the human mind to integrate two concep-
relects current disputes, not the basic analytic problem. he tual stories or narratives into a third story, thereby creating a
conlict between connectionist and rule-based approaches will conceptual blending network that has emergent meaning.
thus have to be resolved ultimately, if it can be, using other data Straight history, or the observation of human interaction,
and arguments. often can serve as the material for such parabolic blending. For
example, Sun Tzus he Art of War treats 13 aspects of warfare.
Michael Maratsos It has been studied in the West by military strategists since the
eighteenth century. Written in the sixth century b.c. in China, it
WORKS CITED AND SUGGESTIONS FOR FURTHER READING precedes by a couple of millennia the origin of modern business
Brown, R. 1973. A First Language: he Early Stages. Cambridge: Harvard management. But in the 1980s, it underwent extensive para-
University Press. bolic rendering in numerous books and articles for the purpose
Maratsos, M. 2000. More overregularizations after all: New data and dis- of ofering guidance to twentieth-century graduate students of
cussion on Marcus, Pinker, Ullman, Hollander, Rosen & Xu. Journal business and investment on how to conduct their professional
of Child Language 27: 183212. lives.
Marchman, V., and E. Bates. 1994. Continuity in lexical and morpho-
Parable frequently blends two stories that have strong con-
logical development: A test of the critical mass hypothesis. Journal of
licts in their content. It is a scientiic riddle why human beings
should be able to activate two conlicting stories simultaneously,
Marcus, G., S. Pinker, M. Ullman, M. Hollander, T. Rosen, and F. Xu.
1992. Overregularizations in language acquisition. Monographs of given the evident risks of mental confusion, distraction, and
the Society for Research in Child Development 57, serial no. 228. error. Yet, uniquely among species, human beings can evidently
Maslen, R. J., A. L. heakston, E. V. Lieven, and M. Tomasello. 2004. not only activate fundamentally conlicting stories simultane-
A dense corpus study of past tense and plural overregulariza- ously and construct connections between them but also blend
tion in English. Journal of Speech, Language, and Hearing Research them to create emergent meaning. his ability to blend two con-
47: 131933. ceptual arrays with strong conlicts in their framing structure is
McClelland, James, and Karalyn Patterson. 2002. Rules or connections central to higher-order human cognition and is a hallmark of the
in past-tense inlection: What does the evidence rule out? Trends in cognitively modern human mind. It is known as double-scope
blending (Fauconnier and Turner 2002).
Pinker, S. 1999. Words and Rules: he Ingredients of Language. New
Consider a parable from the Fourth Gospel. In John 10:1118,
York: Basic Books.
Jesus presents Himself as the good shepherd, who lays down
Plunkett, K., and B. Marchman. 1991. U-shaped learning and frequency
efects in a multi-layered perceptron: Implications for language acqui- His life for the sheep, in contrast to the hired hand, who does
sition. Cognition 28: 73193. not care for the sheep and lees in the face of the wolf. He says
. 1993. From rote learning to system building: Acquiring verb mor- the Father loves Him because He lays down his life and that no
phology in children and connectionist nets. Cognition 48: 3559. one takes it from Him. Rather, He has the power to lay it down
580
Parable Paralanguage
and take it up again. he clash between the story of the shep- . 2001. Cognitive Dimensions of Social Science: he Way We hink
herd and the blend Jesus proposes is astonishing. It is quite About Politics, Economics, Law, and Society. New York: Oxford
implausible that a shepherd would choose to die defending the University Press.
sheep, because then the sheep would be without a defender.
Yet this consequence is not projected to the blend: he actual
shepherd cannot return after being killed to look out for the
PARALANGUAGE
lock, but in Jesuss blend, He can. he emergent struc- Nuances, connotations, and innuendos, which are integral char-
ture in the blend is crucial: Jesuss narrative blends dying acteristics of verbal communication, are given the vague term
with physical manipulation of an object. (Physical manipula- paralanguage. hese meanings arise from sources both within
tion is at the root of human understanding. See Chapter 4 of and outside of standard linguistic structure. Linguistic elements
Turner 1996, Actors Are Movers and Manipulators.) In the words, word order, semantics, grammar can be utilized
story of manipulation, we can lay down an object and pick it for paralinguistic communication. hese combine with variations
back up. Blending manipulation of a physical object with the of speech melody in ways that often defy structural description.
state of being alive or dead, Jesus achieves the remarkable Paralanguage (as the term implies) both draws on and lies over
ability of self-revival. the known and describable ortholinguistic levels of phonetics,
As discussed in Chapter 4 (Analogy) of Cognitive Dimensions phonology, morphology and lexicon, syntax, and seman-
of Social Science (Turner 2001), almost all the mental achieve- tics. All of these elements can be harnessed for paralinguistic
ments analyzed by analogy theorists as analogy involve con- communication, as is well known from baby talk, from connota-
siderable unrecognized blending. In general, analogy involves tional meaning diferences in terms such as skinny, slim, slender,
dynamic forging of mental spaces, construction of connec- and from word-order choices such as Herman Melvilles hat
tions between them, and blending of the mental spaces to create inscrutable thing is chiely what I hate. In addition, emotion,
a conceptual integration network of spreading coherence, whose attitude, intention, mood, psychological state, personality,
inal version contains a set of what are recognized, after the fact and personal identity can be communicated without referring
in the rearview mirror, as systematic, even obvious analogical to words. Because of the power of the intonational contribu-
connections. But those analogical connections are more often tion to paralanguage, the notion of two channels in the speech
the outcome of conceptual blending than its preconditions. Put signal has been invoked, but their intimate interplay has been
diferently, what is commonly discussed as analogy manifests emphasized (Bolinger 1964). Words can communicate emotions,
the faculty for parable. but when prosody does so using a diferent channel, the paralin-
It is also important to recognize that a parable is not, in gen- guistic intent overrides the ortholinguistic content, as in Im not
eral, a conceptual metaphor for understanding one con- angry! spoken with increased pitch, amplitude, and rate.
ceptual domain in terms of another. Consider 2 Samuel 12. he Much of paralanguage is carried over longer stretches of
prophet Nathan creates an elaborate blend in which a rich man utterance than the phonetic element or the single word. Lexical
is blended with King David, a poor man is blended with Uriah and syntactic choices may interact with intonational features
the Hittite, Uriahs wife Bathsheba is blended with a favored ewe with a cumulative efect. Formulaic and nonliteral expressions
lamb, and there is a traveler who comes to dinner. he point of may be called into play. It has come to my attention that your
the complex blend is that David has done wrong. he source stonewalling is holding up the works contains conventional and
and target are complicated, drawing on many conceptual metaphoric utterances that build to a message more fraught
domains, and the principal connection is that in both of them, with paralinguistic content than Ive learned that your hesita-
one man abuses another and deserves punishment. No general tion is contributing to a delay. Although subtle contrasts can be
conceptual metaphor provides this set of cross-space connec- conveyed on short utterances (see Nine ways of saying yes in
tions. Most of them are not metaphoric. Crystal 1995), paralanguage prefers a larger canvas. Repetition of
Parable as a form of literary expression might be of interest words (Shakespeares a little, little grave) may have a powerful
to historians, anthropologists, and critics. But parable as a spe- paralinguistic efect. Movement from low to high pitch across an
cies-speciic mental faculty that can activate, connect, and blend intonational unit displays surprise or amazement; temporal units
sharply conlicting stories to produce new emergent meaning is are stretched to express sadness or disappointment; increased
a far larger and more fundamental topic, posing one of the cen- intensity signals aggression or thematic emphasis; voice quality
tral riddles of the cognitive and language sciences. becomes creaky to communicate victimization or breathy to
Mark Turner signal excitement.
Prosody, a major vehicle of paralanguage, can be decom-
WORKS CITED AND SUGGESTIONS FOR FURTHER READING posed into measurable elements: timing, pitch, amplitude, and
voice quality. hese measures combine into complex patterns,
Fauconnier, Gilles, and Mark Turner. 2002. he Way We hink: Conceptual
such that associating acoustic cues with paralinguistic meanings
Blending and the Minds Hidden Complexities. New York: Basic
is far from straightforward. John didnt drive the car can be
Books.
intoned with sadness, happiness, fear, or disgust, and may enfold
Lewis, C. S. 1936. he Allegory of Love. Oxford: Oxford University Press.
MacNeice, Louis. [1963] 1965. he Varieties of Parable [he Clark attitudes such as incredulity, relief, perplexity, or amusement.
Lectures]. Cambridge: Cambridge University Press. Contradiction or denial, and conversational presumptions, such
Turner, Mark. 1996. he Literary Mind: he Origins of hought and as sincerity and truthfulness, are carried by phrasal intonation.
Language. New York: Oxford University Press. Take a common paralinguistic trope, sarcasm (see irony), in
581
Paralanguage Parameters
hat was a good efort. We know sarcasm when we hear it, but PARAMETERS
exactly what in the signal conveys that the speaker is intending
he term parameter is used in linguistics on analogy with its
to communicate the opposite of the usual lexical meanings is dif-
usage in mathematics and engineering. In mathematics, the
icult to specify. In one version, the sarcastic utterance utilizes
parameters of a function are those aspects of the function that
higher pitch and greater amplitude on the irst word followed
are held constant when deining a particular function, but which
by falling intonation, pharyngeal voice quality, tensed vocal
can vary in a larger context so as to characterize a family of sim-
tract, and spread lips. While morphological, lexical, and syntac-
ilar functions. For example, the function for a line in analytic
tic meanings can be structurally analyzed using units, features,
geometry is f(x) = mx + b, with x the variable and m and b param-
and rules, paralinguistic meanings constitute a brew of unstable,
eters of the function (the slope and the y-intercept). In the dei-
leeting, and subjective qualities. hese paralinguistic qualities
nition of any one line, the parameters m and b are held constant,
shade into one another, and they impinge on purely linguistic
while the value of x varies, giving the diferent points on the same
uses of prosodic contrasts, as in question and statement intona-
line. In a broader context, however, the parameters m and b can
tion. he auditory-acoustic cues that comprise paralanguage are
vary so as to deine a family of similar functions: the set of all
graded, in that they are not perceptually allocated by the listener
lines. In the same way, parameters in linguistics are properties
into discrete, contrastive categories as are the acoustic signals
of a grammatical system that are held constant when character-
for phonetic and lexical elements. Using deft combinations and
izing one particular human language, but which are allowed to
placements of prosodic cues, a speaker can communicate more
take diferent values in a broader context so as to characterize
or less fear, gradations of perplexity, and degrees of denial.
a whole family of possible human languages. he idea that the
he development of the pragmatics of communication,
observed variation in human languages can be understood as
a branch of linguistics that studies language use in conversation
the ixing of certain parameters within an otherwise innate and
(see conversation analysis), jokes (see verbal humor),
invariant system of principles (universal grammar) is most
and storytelling, has advanced understanding of paralanguage.
commonly associated with the Chomskyan approach to formal
Communicative elements such as turn-taking, inference, and
generative linguistics (see generative grammar). As a result,
theme (topic of the discourse), and how they are signaled by the
this approach is sometimes called the principles and param-
speaker and comprehended by the listener, are investigated. he
eters theory. he idea is, however, a very general one, and it
ields of prosody and pragmatics have provided another valuable
can also be used in the context of other views about the nature of
impetus for the productive study of paralanguage: investigation
the human language faculty.
of the communicative competence of right hemisphere lan-
his notion of linguistic parameters was introduced into lin-
guage processing in humans. While it has long been known
guistic theory by Chomsky (1981) and Rizzi (1982), during the
that the left hemisphere modulates language processing,
government and binding period. he paradigmatic case
studies of pragmatics and prosody indicate involvement of the
was the pro-drop parameter (or null subject parameter). It was
right hemisphere in processing emotions and attitudes, infer-
observed that languages like Spanish and Italian difer from
ence and theme. he notion of two channels, ortholinguistic and
French and English in several ways that appear to be interre-
paralinguistic, is supported by the model that allocates process-
lated. First, Spanish and Italian allow the subject pronoun of a
ing to left and right hemispheres, respectively. Paralinguistic
inite clause to be omitted, whereas French and English do not:
nuances are intimately woven into the propositional message, so
much so that synthesized speech is often judged as unpleasant. (1) a. Verr. (Italian: He/she will come)
A goal of speech synthesis is to produce more natural-sounding b. *Came. (English)
speech, which means infusing paralanguage, a challenging but
Second, Spanish and Italian allow the subject to come after the
worthy goal.
verb as well as before it, whereas French and English generally
Diana Van Lancker Sidtis do not:
WORKS CITED AND SUGGESTIONS FOR FURTHER READING (2) a. Verr Gianni (Italian, Will-come Gianni)
b. *Came John (English)
Bolinger, Dwight. 1964. Around the edge of language: Intonation. In
Intonation, ed. D. Bolinger. Harmondsworth, UK: Penguin Books. hird, the subject of an embedded sentence in Spanish and
Crystal, David. 1995. Nine ways of saying yes. he Cambridge Italian can be moved to the beginning of the sentence as a whole,
Encyclopedia of the English Language, ed. David Crystal, 248. even when there is an overt complementizer, whereas in French
Cambridge: Cambridge University Press. and English some sort of accommodation is needed in sentences
Kreiman, Jody, Diana Van Lancker Sidtis, and Bruce Gerratt. 2005. like these:
Perception of voice quality. In Handbook of Speech Perception, ed.
David Pisoni and Robert Remez, 33862. Maldon, MA: Blackwell. (3) a. Chi credi che verr? (Italian)
Van Lancker Sidtis, Diana. 2007. he relation of human language to Who you-think that will-come
human emotion. In Handbook of the Neuroscience of Language, b. *Who do you think that came? (English)
ed. Brigitte Stemmer and Harry Whitaker. San Diego, CA: Academic
Press. Although these are clearly three distinct properties of the lan-
Williams, C., and K. Stevens. 1972. Emotions and speech: Some guages in question, they have a common theme: Informally put,
acoustical correlates. Journal of the Acoustical Society of America French and English require that there be an overt noun phrase
52.4B: 123850. in the canonical subject position immediately before the inite
582
Parameters
verb, whereas Spanish and Italian do not. his diference in the parameters that are intended to account for the large-scale dif-
syntax of subjects was also related to a morphological differences among the major classes of languages discovered by
ference: he agreement morphology on the inite verb is rich typology. he head directionality parameter is a parameter of
enough to uniquely identify which pronoun would be in the this sort. Another early example was Ken Hales (1983) noncon-
subject position in Spanish and Italian, whereas in French and igurationality parameter, which was designed to explain why
English it is not. he universal syntactic condition, then, is that Australian languages like Warlpiri tolerate free word order and
inite clauses require subjects (the extended projection prin- discontinuous phrases, whereas languages like English do not.
ciple); the parameter concerns exactly what kind of subject is Similarly, parameters have been proposed to capture the difer-
necessary to fulill this condition. In Italian and Spanish, the rich ence between ergative languages (like Basque and Eskimo), in
agreement on the verb means that null or displaced subjects are which the object of a transitive clause is treated in some respects
permissible because (roughly) much of the information con- like the subject of an intransitive clause, and accusative lan-
cerning the sort of subject it was is locally available on the inite guages (like English and most Indo-European languages), in
verb. In French and English, the agreement on the verb is of little which all subjects are treated similarly. hese proposals range
help, and so an overt subject in the canonical subject position from radical diferences in how syntactic structure is initially
is required. A parameter, then, is a way of attributing a uniied constructed (Marantz 1984) to relatively minor diferences in
theoretical account of the systematic diferences that distinguish how case and agreement morphology are assigned in a simple
one class of languages from another. sentence (Bittner and Hale 1996). Mark Baker (1996) proposes
While the pro-drop parameter was the irst important param- a polysynthesis parameter that attempts to give a uniied charac-
eter to be proposed, it is by now not considered the best case. A terization of the diference between many native American lan-
look at a wider range of languages both nonstandard dialects guages, in which a large part of the expressive burden is placed
of the Romance languages and languages from other families on verbal morphology, and languages like English, in which the
quickly showed that the properties in (1)(3) do not correlate primary expressive burden is borne by syntactic combination.
with one another as closely as was thought (Jaeggli and Sair Taken together, some set of parameters such as these might
1989). his implies that the pro-drop parameter as it was origin- characterize the major linguistic types we observe.
ally conceived is either false or highly oversimpliied. Other parameters operate on a smaller scale, deining the dif-
hat does not mean that the idea of a parameter was ill-con- ferences between historically-related languages or dialects.
ceived, however. he current paradigmatic example is what is he pro-drop parameter was a parameter of this sort, distinguish-
sometimes called the head directionality parameter (terminolo- ing French from Italian. Another example is the parameter that
gies vary). his can be stated as an open factor in the principles determines whether the subject of a clause moves from its origi-
of phrase structure (see x-bar theory). Roughly put, nal position inside the verb phrase to the highest position in the
when a word-level category x merges with a phrase Y to create clause or not; this accounts for the diference between English,
a phrase of type X, there are two ways that the elements can be which has subject-inite verb-object word order, and Celtic lan-
ordered: he order can be X-Y within XP, or it can be Y-X. Setting guages like Welsh, which have inite verb-subject-object word
the parameter in the irst way gives head-initial languages like order (Koopman and Sportiche 1991). Jean-Yves Pollock (1989)
English, in which complementizers come before embedded argues that there is a parameter that says that verbs move to a
clauses, tense particles come before verb phrases, verbs come higher position in French than they do in English; this accounts
before their objects, prepositions come before their objects, and for a cluster of subtle word-order diferences having to do with the
so on: placement of verbs, negation, and adverbs in the two languages
(e.g., John kisses often Mary is normal French but bad English).
(4) John will think that Mary showed a picture to Sue.
A third example is Jonathan Bobaljik and Dianne Jonass (1996)
Setting the parameter in the second way gives head-inal lan- proposal that some Germanic languages have an extra posi-
guages like Japanese, in which complementizers come after tion available for subjects that other Germanic languages dont
embedded clauses, tense particles come after verb phrases, verbs have; this makes sentences like here have some trolls eaten
come after their objects, prepositions come after their objects, Christmas pudding possible in some Germanic languages but
and so on: not others, among other things. (See Baker 2001 for a general
overview of these parameters and several others.)
(5) Taroo-ga Hiro-ga Hanako-ni syasin-o miseta to omotte iru.
In the early days of parametric theory, it was thought that
Taro SUBJ Hiro SUBJ Hanako to picture OBJ show that think-
virtually any syntactic principle could be parametrized, and
ing be
parameters were proposed that were relevant not only to X-bar
Taro is thinking that Hiro showed a picture to Hanako.
theory but also to movement, the theory of binding, and even
In this way, a parametric theory can account for many of the the projection principle. On that view, there would be a mod-
most robust Greenbergian universals (Greenberg 1963; Dryer est number of parameters (dozens or perhaps hundreds), each
1992) concerning word order in an elegant way. hese two of which would have a relatively large impact on the language
very common and stable language types fall out of a simple and generated. But this view has been questioned in more recent
unity choice that is made in the precise formulation of a univer- work. Hagit Borer (1984) proposed almost immediately that the
sal principle of language. syntactic principles themselves are invariant, and what is param-
Parameters vary widely in the range and scope of the efects eterized is the features associated with individual lexical items.
that they are supposed to capture. Some theorists have proposed Rather than saying that the syntax of French is diferent from
583
Parameters Parietal Lobe
the syntax of English in that verbs raise to the tense/inl node in Jaeggli, Osvaldo, and Kenneth Sair. 1989. he null subject parameter
French, this view says that the lexicon of French is diferent from and parametric theory. In he Null Subject Parameter, ed. Osvaldo
the lexicon of English in that French has tenses that require the Jaeggli and Kenneth Sair, 144. Dordrecht, the Netherlands: Kluwer.
verb to move into them, whereas English does not. Kayne, Richard. 2005. Some notes on comparative syntax, with spe-
cial reference to English and French. In he Oxford Handbook of
Borers view has the conceptual advantage that it largely
Comparative Syntax, ed. Guglielmo Cinque and Richard Kayne, 369.
reduces the learning of syntax to the learning of individual lexical
New York: Oxford University Press. A detailed discussion of general
items. It also suggests that there might be thousands of param- considerations and very small-scale parameters.
eters, rather than dozens, because each distinct lexical item is Koopman, Hilda, and Dominique Sportiche. 1991. he position of sub-
a possible locus of parametric variation (see especially Kayne jects. Lingua 85: 21158.
2005). Each individual parameter, however, will afect only a Marantz, Alec. 1984. On the Nature of Grammatical Relations. Cambridge,
relatively narrow part of the grammar since it is limited to those MA: MIT Press.
structures in which a particular item appears. his view is com- Pollock, Jean-Yves. 1989. Verb movement, universal grammar, and the
patible with the fragmentation of the pro-drop parameter, which structure of IP. Linguistic Inquiry 20: 365424.
is now seen as a cluster of small-scale distinctions, each of which Rizzi, Luigi. 1982. Issues in Italian Syntax. Dordrecht, the
can vary independently of the others, giving one the lexibility to Netherlands: Foris.
describe the various intermediate patterns found in the dialects

of southern France and northern Italy. As a result, Borers view PARIETAL LOBE
has been championed by Richard Kayne (2005) as the one that
is supported by his methodology of comparing closely related Anatomy
dialects. he parietal lobe is situated superior to the occipital lobe and
Baker (1996, 2008), however, argues that there may also be posterior to the frontal lobe. More speciically, it extends from the
syntactic parameters in more or less the original sense, in addi- central sulcus anteriorly, to the imaginary boundary of the pari-
tion to the ine-grained lexical parameters. Taken strictly, Borers etal-occipital issure posteriorly, to the sylvian issure (peri-
view does not really account for the unity of the head directional- sylvian cortex) inferiorly. he parietal lobe(s) can be further
ity parameter. Even the smaller-scale parameters do not seem to subdivided into three main areas. hese include: 1) the soma-
vary lexical item by lexical item. For example, it is not the case tosensory strip, also known as the postcentral gyrus (Brodmanns
that some tenses trigger verb-adverb-object order in French and area [BA] 1, 2, 3, 43), 2) the superior parietal lobule (BA 5), and 3)
others do not; rather, all the diferent tenses trigger that order in inferior parietal lobule (includes BA 39-angular gyrus and 40-su-
French, whereas none of the English tenses do. Perhaps, then, the pramarginal gyrus). he latter two areas are separated by the
proper locus of much parameterization is neither the individual intraparietal sulcus (see Figure 1). Medially, the parietal lobe(s)
lexical item nor the syntactic principle, but rather a natural class comprises the postcentral gyrus extension of the paracentral lob-
of lexical items. How to state this and what its implications are ule, the precuneus, and part of the cingulate gyrus (see Figure 2).
continue to be topics of discussion.
Physiology
Mark C. Baker
here are two parietal lobes, one in each hemisphere, which are
divided functionally on the basis of dominance. he dominant
lobe is typically the left one and the nondominant the right.
Baker, Mark. 1996. he Polysynthesis Parameter. New York: Oxford here are many diferent non-language functions performed
University Press. by the parietal lobe, for example, perception and localization of
. 2001. he Atoms of Language. New York: Basic Books. A broad touch, pressure, pain, and temperature on the opposite side of
overview of the notion of a parameter in linguistic theory, written for
the body, and visuospatial processing. he variety of language-
a general audience.
related functions associated with the parietal lobe will be espe-
. 2008. he macroparameter in a microparametric world. In
cially highlighted in the context of non-language functions.
he Limits of Syntactic Variation, ed. heresa Biberauer, 35174.
Amsterdam: John Benjamins. Provides an overview and argument for he dominant parietal lobe is involved primarily in integrat-
large-scale parameters. ing sensory information to create a particular perception. he
Bittner, Maria, and Kenneth Hale. 1996. Ergativity: Toward a theory of a inferior portion of this lobe, particularly the supramarginal gyrus
heterogeneous class. Linguistic Inquiry 27: 531604. and angular gyrus, is involved in structuring information for
Bobaljik, Jonathan, and Dianne Jonas. 1996. Subject positions and the reading and writing (see writing and reading, neurobiol-
roles of TP. Linguistic Inquiry 27: 195236. ogy of), performing mathematical calculations, and perceiving
Borer, Hagit. 1984. Parametric Syntax: Case Studies in Semitic and objects normally.
Romance Languages. Dordrecht, the Netherlands: Foris. Damage to the dominant lobe can result in apraxia (motor
planning deicit) aphasia (language disorder), agnosia (abnormal
the Netherlands: Foris. he reference that started it all.
perception of objects), and sensory impairment (e.g., touch, pain).
Dryer, Matthew. 1992. he Greenbergian word order correlations.
Lesions to the inferior portion of the dominant lobe involving the
Language 68: 81138.
Greenberg, Joseph. 1963. Universals of Language. Cambridge, MA: MIT angular gyrus can result in Gerstmanns syndrome, which is char-
Press. acterized by leftright confusion, diiculty pointing to named
Hale, Kenneth. 1983. Warlpiri and the grammar of nonconigurational ingers (inger agnosia), impaired writing ability (agraphia), and
languages. Natural Language and Linguistic heory 1: 549. inability to perform mathematical calculations (acalculia).
584
Parietal Lobe
Figure 1.
Figure 2.
he nondominant parietal lobe, however, is involved in a dif- of the research involving humans with brain damage and animal
ferent set of functions that are mostly non-language related. In studies using rhesus monkeys. Sir William Turner (1873) is con-
particular, this region is responsible for visuospatial functions as sidered the irst to describe in detail the intraparietal sulcus (BA
it receives and integrates input from the visual system (occipital 40). Before being scientiically discredited, phrenologists pro-
lobe) to make sense of the spatial order of the world around us. posed that damage or disease to the parietal lobe(s) was a major
M. A. Eckert and colleagues (2005) found that Williams syndrome, cause of melancholia (depression), and parietal eminence was
whose phenotype (visuospatial deicits) and genotype (deletion believed to relate to cautiousness (Hollander 1902). Due to the
on chromosome 7) are well characterized, is linked to superior wide variety of symptoms reported from brain damage studies,
parietal impairment. Williams syndrome, thus, may provide a the parietal lobes were accurately but vaguely thought to be a
valuable system for understanding parietal lobe function. general association area combining all the information from
Damage to the right parietal lobe can result in a constellation various functions, speciically visuospatial and attention; how-
of deicits involving spatial and body relations. Bilateral lesions ever, details as to how this function occurred physiologically
may result in Balints syndrome, which afects both visual atten- were lacking until modern times.
tion and motor skills. If both the parietal and temporal lobes are Josef Gerstmann ([1924] 1971) irst described inger agnosia
damaged, memory impairments and personality changes may in a patient with a left parietal stroke, and the efects of various
result. Speciically, if this damage occurs on the dominant (left) lesions on the parietal cortex were identiied and cataloged in
side, it may result in verbal memory deicits and diiculty in the detail by John McFie and Oliver L. Zangwill (1960). Much of the
retrieval of strings of numbers. If the damage is on the right side, early parietal research was pioneered by scientists Macdonald
it will afect nonverbal memory functions and will signiicantly Critchley and later Juhani Hyvarinen in their respective works
impair personality. he Parietal Lobes (1953) and he Parietal Cortex of Man and
Monkey (1982).
History A great deal of neurological investigation has been conducted
For more than a century, the exact role of the parietal lobe has on rhesus monkeys, and there appears to be signiicant overlap
been debated by neuroanatomists and psychologists, with much between the human and monkey parietal lobe in both function
585
Parietal Lobe
and form, though it is noteworthy that diferences have been however, recent evidence (Wagner et al. 2005) suggests that the
identiied, such as larger parietal cortex, asymmetry of the lobes, parietal lobes may have a role to play in it as well. he role of
and more neural subdivisions in humans (Kolb and Whishaw declarative memory in language has been attributed to word
1990). learning or vocabulary storage. Wagner and his colleagues sug-
gest three theories explaining the contributions of the parietal
Language lobe in episodic memory retrieval. hey highlight that, indeed,
Continuing the classical connectionist tradition of Hugo the parietal lobe does not play an independent role in this
Liepmann, Norman Geschwind (1965) championed the simpli- retrieval; rather, it mediates the major pathways in which the
ied yet controversial position that the parietal lobe acts as the MTL subserves episodic memory.
association area of association areas. Neural tissue damage to In sum, a number of neurolinguistic positions have been
this area often results in the classical disconnection syndromes, taken from available neuropsychological and brain-mapping
for example, apraxia and others. data (Stein 1989). hese include the parietal lobe as a) a senso-
Aleksandr Romanovich Luria (1973) considered the parietal rimotor association area such that the posterior parietal cortex
cortex one piece in his two-part model of mental activity, stat- (PPC) becomes a junction of somaesthetic and visual informa-
ing that it was important for understanding reception, analysis, tion that interacts in a complex fashion, b) a sensorimotor inte-
and storage of information. Lesions to the left parietal lobe were gration area, which is very similar to the previous theory except
understood to result in aferent motor aphasias (diiculty in for the addition of an actual integral function, and c) a com-
inding the correct articulatory positions for speciic phonemes), mand apparatus that is actually able to initiate a motor activity
particularly lesioned primary and secondary sensory areas from the accumulated sensory information. he authors propose
afecting speech motor control and lesioned tertiary sensory that although it is possible that the parietal lobe is involved in
area resulting in aphasia (the loss of speech production and/or some motor processes, it is more likely that the process is one
comprehension). of maintenance than of initiation and is d) a region for directing
Recent research, such as that of Gregory Hickok (2000), sug- attention to stimuli of interest. Here, the PPC and the pathways
gests that the inferior parietal lobe serves as the connection it receives are postulated to direct attentional focus to the tar-
between phonological representations and motor control for get stimulus while coordinating and communicating with the
those representations, that is, the auditory-motor interface, inferotemporal cortex. J. Stein advocates that the PPC does not
which is part of a larger network of interfaces and systems have a single narrow neurocognitive focus; nevertheless, it could
subserving language function. Marco Catani, D. Jones, and have a common underlying function that integrates its multifac-
H. Dominic (2005) in a signiicant paper, conirm the analysis eted involvement in cognitive as well as automatic linguistic and
that includes the inferior parietal lobe in the use and possibly extralinguistic processes.
the acquisition of language via a new circuit connecting the Overall, the parietal lobe is crucial for several language func-
traditional language areas of broca and wernicke. It has tions, most importantly, naming, semantic processing, and
been labeled Geschwinds territory in honor of Geschwinds phonological shaping of words, as well as reading and writing.
original proposal that the parietal lobe is critical to language In addition, it mediates attention and memory, both essential at
function. diferent levels of language processing.
In sum, the left parietal cortex has particular areas that are
Yael Neumann, Hia Datta and Daniel P. Rubino
responsible for various linguistic functions. However, there are
other extralinguistic processes that the parietal lobe is known for
as well.
Baddeley, Alan, Susan Gathercole, and Costanza Papagno. 1998. he
Extralinguistic Processes phonological loop as a language learning device. Psychological Review
105: 15873.
ATTENTION. he function of the parietal lobe in attention mecha-
Catani, Marco, Derek K. Jones, and H. Dominic. 2005. Perisylvian lan-
nisms has been discussed over a period of time. Michael Posner
guage networks of the human brain. Annals of Neurology 57: 816.
and Steven E. Peterson (1990) outlined the diferent subsystems Critchley, Macdonald. 1953. he Parietal Lobes. London: Edward Arnold.
of attention: a) orientation to sensory events (not conscious), b) Eckert, M. A., B. S. Hu, S. Eliez, U. Bellugi, A. Galaburda, J. Korenberg, D.
signal detection for focal processing (conscious), and c) main- Mills, and A. L. Reiss. 2005. Evidence for superior parietal impairment
tenance of a vigilant state (conscious). From the available neu- in Williams syndrome. Neurology 64: 1523.
rocognitve evidence, the researchers assert that the posterior Gerstmann, Josef. [1924] 1971. Fingeragnosie: Eine umschriebene
parietal lobe plays an important role in attention mechanisms, Strung der Orientierung am eigenen Krper. Wiener Klinische
speciically in orientation and signal detection that are essential Wochenschrift 37: 101012. Trans. in Archives of Neurology
for linguistic processing. Earlier, Luria (1973) identiied this par- 24: 4756.
ietal region that mediates attention as an involuntary orienting Geschwind, Norman. 1965. Disconnection syndromes in animals and
man. Brain 88: 23794.
system. However the posterior parietal attentional mechanisms
Hickok, Gregory. 2000. Speech perception, conduction aphasia, and
are greatly impacted by the frontal regions that subserve alerting
the functional neuroanatomy of language. In Language and the
mechanisms as well. Brain: Representation and Processing, ed. Y. Grodzinsky, L. Shapiro,
and D. Swinney, 87104. San Diego, CA: Academic Press
MEMORY. Traditionally, episodic memory, or declarative mem- Hollander, Bernard. 1902. Scientiic Phrenology. London: Grant
ory, has been attributed to the medial temporal lobe (MTL); Richards.
586
Parsing, Human Parsing, Machine
Kandel, Eric R., James H. Schwartz, and homas M. Jessell. 1991. WORKS CITED AND SUGGESTIONS FOR FURTHER READING
Principles of Neural Science, 3d ed. New York: Elsevier.
Battison, Robbin. 1978. Lexical Borrowing in American Sign Language.
Kolb, Bryan, and Ian Q. Whishaw. 1990. Fundamentals of Human
Silver Spring, MD: Linstok.
Neuropsychology. New York: Freeman.
Bresnan, Joan. 1982. he Mental Representation of Grammatical Relations.
Joseph, Rhawn. 2000. Neuropsychiatry, Neuropsychology, Clinical
Neuroscience. New York: Academic Press.
Browman, Catherine P., and Louis Goldstein. 1990. Articulatory gestures
Juhani Hyvarinen. 1982. he Parietal Cortex of Monkey and Man: Studies
as phonological units. Phonology 6: 20151.
of Brain Function. Berlin: Springer Verlag.
Charniak, Eugene. 1993. Statistical Language Learning. Cambridge,
Luria, Aleksandr Romanovich. 1973. he Working Brain: An Introduction
MA: MIT Press.
to Neuropsychology. New York: Basic Books.
Chomsky, Noam. 1965. Aspects of the heory of Syntax. Cambridge,
McFie John, and Oliver L. Zangwill. 1960. Visual-constructive disabili-
MA: MIT Press.
ties associated with lesions of the left cerebral hemisphere. Brain
Fowler, Carol A., and Julie M. Brown. 2000. Perceptual parsing of acous-
83: 24360.
tic consequences of velum lowering from information for vowels.
Posner, Michael, and Steven E. Peterson. 1990. Attention system of the
Perception and Psychophysics 62: 2132.
human brain. Annual Review of Neuroscience 13: 2542.
Frazier, Lyn. 1987. Sentence processing: A tutorial review. In Attention
Stein, J. F. 1989. Representation of egocentric space in the posterior pari-
and Performance. Vol. 12: he Psychology of Reading. Ed. M. Coltheart,
etal cortex. Experimental Physiology 74: 583606.
55986. Hillsdale, NJ: Lawrence Erlbaum.
Turner, William. 1873. he Convolutions of the Brain in Relation to
Frazier, Lyn, and Janet Dean Fodor. 1978. he sausage machine: A new
Intelligence. Yorkshire: he West Riding Lunatic Asylum Medical
two-stage parsing model. Cognition 6: 291325.
Reports.
Hale, John. 2006. Uncertainty about the rest of sentences. Cognitive
Wagner, Anthony D., Benjamin J. Shannon, Itamar Kahn, and Randy
Science 40: 64372.
L. Buckner. 2005. Parietal lobe contributions to episodic memory
Jurafsky, Daniel, and James H. Martin. 2000. Speech and Language
retrieval. Trends in Cognitive Sciences 9.9: 44553.
Processing: An Introduction to Natural Language Processing,
Computational Linguistics, and Speech Recognition. Upper Saddle
River, NJ: Prentice Hall.
PARSING, HUMAN McClelland, James, and Mark St. John. 1989. Sentence comprehen-
In general, parsing refers to breaking something into its sion: A PDP approach. Language and Cognitive Processes 4: 287336.
constituent parts. hus, machines (see parsing, machine) and Steedman, M. 1999. Connectionist sentence processing in perspective.
humans can decompose a message (such as print or spoken lan-
Tabor, W., and M. Tanenhaus. 1998. Dynamical models of sentence pro-
guage) into phrases, words, and morphemes. Most commonly
cessing. Cognition 23: 491515.
human parsing has been considered in the context of sentence
processing, particularly its syntactic and semantic aspects.
Language spoken, written, and signed can also be described PARSING, MACHINE
in terms of smaller functional units, including syllables, pho-
he query Over which strait in North Wales did homas Telford
nemes, features, and gestures.
build a suspension bridge? illustrates the fact that natural lan-
An understanding of grammatical constraints has guided the
guages have complex syntactic structures. Comparison of the
development of descriptive representations (e.g., sentence dia-
question with the answer He built a suspension bridge over the
grams) and formal systems of language structure and use. Parsing
Menai Strait reveals that the phrases including strait, occur in
models also have been inluenced by linguistic, psycholinguistic,
diferent positions in the two utterances, and that the verb posi-
and cognitive theory and by techniques used in computational
tions are quite diferent, leading linguists to propose a constit-
linguistics, natural language processing, and speech recogni-
uent structure like (1) for the question:
tion (Chomsky 1965; Bresnan 1982; Jurafsky and Martin 2000).
Representative approaches include linguistic, statistical, connec- (1) [[over/Preposition [which/Determiner [strait/Noun [in/
tionist, and dynamical systems models (Charniak 1993; Hale Preposition [North/Noun Wales/Noun ]NN ]PP ]N1 ]NP ]
2006; McClelland and St. John 1989; Steedman 1999; Tabor and PP [did/Vaux homas_Telford/NP [build/Verb [a/
Tanenhaus 1998; see also self-organizing systems). Determiner [suspension/Noun bridge/Noun ]Noun ]
he scale at which we can break the signal into pieces depends NP t/PP ]VP ]Sinv ]Sq
upon both our attention to detail and to our descriptive goals, as
A parser is a program that analyzes sentences in order to igure
can be seen in numerous psychological studies that range from
out their structure, using a list of rules describing the structure of
ambiguity resolution (Frazier and Fodor 1978; Frazier 1987) to
the language, such as:
assessment of our ability to perceive, produce, and use informa-
tion at various levels of description. Parsing linguistic information (2) S NP VP
is not restricted to sound and print but can include a consider- Sq PP Sinv
ation of the gestures underlying the production of language by Sinv Vaux NP PP
voice (the coordinated movement of speech articulators, such as PP Preposition NP
the tongue body, tongue tip, jaw, and lips; see speech produc- NP Determiner N1
tion) and sign (manual, facial, and body orientation) (Battison etc.
1978; Browman and Goldstein 1990; Fowler and Brown 2000).
Most parsers begin by determining the part of speech of each
Philip Rubin word (see word classes). A bottom-up parser then attempts to
587
Passing Theories
group the words into phrases and phrases into clauses, according irst, meaning. A nice derangement of epitaphs thus literally
to the grammar rules, keeping track of multiple possible analyses means a nice arrangement of epithets if uttered with the relevant
because of the extensive ambiguity of natural languages. Top- semantic intention and understood accordingly. According to
down parsers, though perhaps less intuitive, are more frequently Davidson, this does not obliterate the distinction between lit-
used: hey essentially work by attempting to generate the input eral meaning and speakers meaning; speakers meaning for
sentence. instance, metaphorical meaning always comes later in the
Standard parsing algorithms for analyzing any artificial order of intentions.
language (such as programming languages) have been devel- According to Davidson, Tarski-style theories of truth
oped and can be used with any context-free grammar. hus, (T-theories) can be used as formal semantic theories. To spe-
linguists can write the grammar rules: hey do not need to be cify the literal meaning of any utterance, be it ever so idiosyn-
programmers. But linguists grammars of natural languages cratic, a full T-theory is required. In the case of malapropisms
often make use of additional devices, such as agreement or and other novel or idiosyncratic use, these theories will be of a
subcategory features (as with Sinv and Vaux in (1) and (2), denot- transient, passing character; they might not hold for more than
ing inverted sentences and auxiliary verbs). generative a single utterance. If they hold for a certain utterance, Davidson
grammars, therefore, usually augment constituent structure speaks of speaker and hearer sharing a passing theory (for
with additional information: here may be labels to uniquely that utterance). Prior theories, on the other hand, specify the
identify individuals or additional levels of information, such as interpretations speakers expect hearers to make, and hearers are
meanings. Work on feature-based frameworks such as lexical- prepared to make, prior to actual utterances (cf. Davidson [1986]
functional grammar and head-driven phrase struc- 2005, 101f).
ture grammar has gone hand in hand with the development Davidson then uses the terminology of prior and passing
of complementary parsing methods. theories to renew his argument against any account of linguistic
competence essentially involving the prior mastery of a system
John Coleman
of shared semantic and syntactic conventions or rules: [S]har-
ing such a previously mastered ability [is] neither necessary nor
SUGGESTION FOR FURTHER READING
suicient for successful linguistic communication (Davidson
Jurafsky, Daniel, and James H. Martin. 2000. Speech and Language [1994] 2005, 110; cf. also [1982] 1984). To model successful lin-
Processing. Upper Saddle River, NJ: Prentice Hall. guistic communication, systematic semantic theories of pass-
ing and prior nature are required, but sharing of prior theories
is not suicient for successful linguistic communication. Even if
PASSING THEORIES
speaker and hearer share a prior theory, the ability to interpret
Passing theories are utterance-speciic formal semantic theo- in accordance with that theory does not account for those cases
ries; they specify the correct interpretation, or literal meaning, of successful communication where words are used in novel or
of particular linguistic utterances: sentences uttered by par- idiosyncratic ways. Nor is a shared prior theory necessary for
ticular speakers at particular times. he expression passing communication to succeed all that is necessary is that the pass-
theory was coined by Donald Davidson in his 1986 paper A ing theory be shared. Sharing passing theories, however, does
Nice Derangement of Epitaphs, which was part of his attack on not amount to sharing a previously mastered ability: In conclu-
accounts of linguistic communication essentially involving con- sion, then, I want to urge that linguistic communication does not
ventionally determined, shared meanings ([1986] 2005). require, though it very often makes use of, rule-governed rep-
According to Davidson, expressions like language, meaning, etition; and in that case, convention does not help explain what
or sentence are theoretical terms used for describing, or explain- is basic to linguistic communication, though it may describe a
ing, successful linguistic communication (cf. [1992] 2001, 108f). usual, though contingent feature (Davidson [1982] 1984, 280).
For communicative success, regular or conventional use of Davidsons 1986 paper has been heavily criticized, among
words is not necessary; what is necessary is only that the hearer others by Michael Dummett. Part of the criticism is due to the
understand what the speaker intends to mean. For instance, if by provocative formulation Davidson gives there to his conclu-
the words a nice derangement of epitaphs the speaker intends sion: [T]here is no such thing as a language, not if a language
to mean a nice arrangement of epithets and the hearer under- is anything like what many philosophers and linguists have sup-
stands that, we have a case of successful linguistic communica- posed ([1986] 2005, 107). A controversy between Davidson and
tion. Davidson suggests characterizing communicative success Dummett ensued regarding the questions of whether the notion
in terms of the semantic intentions of the speaker. hese he con- of an idiolect is to be explained in terms of a communal language
strues as intentions to be interpreted in a particular way on a par- or the other way around, and whether meaning is essentially nor-
ticular occasion and by a particular hearer. Moreover, they are of mative or prescriptive. Davidson argues that any obligation we
a Gricean, self-referential form (see communicative inten- owe to conformity is contingent on the desire to be understood
tion): A semantic intention is an intention to achieve the end ([1994] 2005, 118), and he explicitly opposes those forms of social
of being interpreted in a certain way by means of the intentions meaning externalism (such as Tyler Burges), according to
being recognized by the hearer (Davidson [1986] 2005, 92 f). which the literal meaning of words is essentially a matter of the
Any utterance is made with a number of intentions that can be linguistic practices of the community surrounding the speaker
ordered in terms of means to ends; the irst intention in such (Davidson [1994] 2005, 119). Just as for Gricean accounts of
a sequence (as ordered by in order to) speciies its literal, or meaning, there are also issues of psychological realism that arise
588
Performance
for Davidsons account of successful linguistic communication speech community, deined as an organization of diversity that
in terms of the complicated semantic intentions of the speaker. had to be constituted and managed via performances, rather
than as a preexisting homogeneous entity.
Kathrin Gler
In his concern with how language functions in society,
Hymes was inspired by the work of the prewar Prague School
(192938) and, in particular, by Roman Jakobson (18961982).
Bar-On, Dorit, and M. Risjord. 1992. Is there such a thing as a language? Working against Russian Formalisms emphasis on the inner
Canadian Journal of Philosophy 22: 16390. laws and formal structure of text without regard for context,
Davidson, Donald. [1982] 1984. Communication and convention. the Prague School focused attention on the multifunctionality
Inquiries into Truth and Interpretation, 26580. Oxford: Clarendon.
of language. Jakobson (1960), building on work by Karl Bhler
. [1986] 2005. A nice derangement of epitaphs. In Truth, Language,
and Jan Mukaovsk, identiied six constitutive factors of a com-
and History, 89108. Oxford: Clarendon.
. [1992] 2001. he second person. In Subjective, Intersubjective,
municative event and postulated that each factor was associated
Objective: 10722. Oxford: Clarendon. with a particular language function. Jakobsons constitutive fac-
. [1994] 2005. he social aspect of language. In Truth, Language, tors include addresser, addressee, context, message, contact,
and History, 10926. Oxford: Clarendon. and code; he termed their associated functions expressive (or
Dummett, Michael. A nice derangement of epitaphs: Some comments emotive), conative, referential, poetic, phatic, and metalingual.
on Davidson and Hacking. In Truth and Interpretation: Perspectives hus, for instance, an utterance (such as eee-gads!) that directs
on the Philosophy of Donald Davidson, ed. E. Lepore, 45976. attention to the addresser (speaker) would be associated with
Oxford: Blackwell. the expressive function, and so on. his model provided a basis
Gler, Kathrin. 2001. Dreams and nightmares: Conventions, Norms, from which scholars could investigate the relationships among
and Meaning in Davidsons Philosophy of Language. In Interpreting
form, function, and meaning.
Davidson, ed. P. Kotatko, P. Pagin, and G. Segal, 5374. Stanford,
In attending to speaking as a social accomplishment, the eth-
CA: CSLI.
Pietroski, Paul. 1994. A defense of derangement. Canadian Journal of
nography of speaking opened the way for studies of language
Philosophy 24: 95118. as an arena for the performance of social identities (see iden-
tity, language and). Earlier studies tended to focus on the
organization of communicative life in small, often face-to-face
PERFORMANCE
communities, highlighting the diferential distribution of lin-
he study of performance investigates communicative practices guistic resources by age, gender, ethnicity, or other status
in their sociocultural contexts from three perspectives. First, it markers (see Bauman and Sherzer 1974). Later works consider
foregrounds the performativity of communicative forms and how particular linguistic performances are both embedded in
practices as modes of action or means of accomplishing social and help to shape wider political or cultural formations, such
ends. Second, it directs attention to the poetics of communica- as race relations, subcultural or national identities, multicul-
tive practice or to the forms of verbal artistry through which com- turalism, secularism, and the like. Linguistic anthropologys
municative acts are crafted and communicative skill is displayed. historical emphasis on ways of speaking, strategies of voicing,
hird, it focuses attention on performances as a special class of participation structures, and orientation to audiences made the
events, such as rituals, spectacles, festivals, or fairs, in which a ield especially amenable to the approach of the Russian liter-
societys symbols and values are publicly displayed, interpreted, ary theorist Mikhail Bakhtin (1981), whose work on dialogism
and transformed. Within language study, the irst and second and heteroglossia inspired studies in areas including lan-
perspectives have been foregrounded. guage ideology, genre, and intertextuality (Silverstein
he contemporary focus on the poetics and performance of and Urban 1996).
communicative practice emerged in the subdiscipline of linguis- By foregrounding speaking as a social performance, the eth-
tic anthropology from a line of inquiry called the ethnography of nography of speaking countered an alternative use of the term
speaking. Developed by Dell Hymes and his students during the performance proposed by Noam Chomsky (1965). Drawing
1960s and 1970s, the ethnography of speaking highlights perfor- on a distinction made by Ferdinand de Saussure ([1907] 1959)
mance in two linked ways: as speaking practice and as artfully between language (langue) and speech (parole), Chomsky
marked ways of speaking (Bauman and Sherzer 1975). Its cen- deined performance as the incomplete and imperfect realization
terpiece is what Hymes called the speech event or communica- of language by particular speakers. He opposed performance to
tive event, a framework that allowed scholars to analyze multiple competence, an internalized set of general rules that constitute
components of language in use, including setting, participants, ones knowledge of a language, abstracted from particularities of
ends (goals, purposes), act sequences, key (tone, tenor), instru- performance. In contrast, theorists of performance, along with
mentalities (channel, code), norms, and genres (the SPEAKING many linguistic anthropologists and sociolinguists, empha-
acronym provides a mnemonic) (Hymes 1967). he interest was size communicative competence, understood not as a hypothet-
not simply in cataloging these components but, rather, in under- ical capacity for language but as the contextually grounded and
standing how speakers use language within the conduct of social culturally acquired ability to speak in socially appropriate ways
life. In highlighting the emergent and creative nature of speech (Bauman 1977, 11). Here, speaking is understood as a creative
performance, the ethnography of speaking focused attention and emergent act through which social life is accomplished. As
on linguistic forms as resources for living, in Kenneth Burkes such, speaking is inherently risky; it involves skill and account-
([1941] 1973) sense. Further, it proposed a new unit of study, the ability and is subject to critical evaluation.
589
Performance
Richard Bauman highlights the dimensions of risk, respon- to forms of social relationship aligned with the democratic
sibility, and accountability in what has become a classic deini- aspirations of their movement (the Berber Cultural Movement
tion of performance: Performance as a mode of spoken verbal was a minority ethnolinguistic, subnational, and secular
communication consists in the assumption of responsibility to opposition movement in a majority Arabo-Islamist nation).
an audience for a display of communicative competence (1977, A chorus ofered a way of teaching children new gender roles
11). Inspired by Hymes, Bauman has been particularly inter- while displaying new modes of gender interaction to the wider
ested in the forms of verbal artistry through which communica- community. To accomplish this, the young men created a new,
tive skill is put on display. His work generated a pivotal shift in highly marked event within the already marked wedding: hey
folklore studies from a classiicatory concern with texts indepen- mounted a stage, rented microphones, hung lights, and thus
dent of their contexts of use to an interest in the performance of conigured an entirely new relationship between performers
verbal art as a constitutive ingredient of social life. Performance and audience, placing the guests in an unfamiliar spectator
in this sense may range from sustained, full performance to a role. he children sang political songs that, while well known,
leeting breakthrough into performance, with hedged or negoti- were not typically associated with weddings. his repertoire
ated performance lying somewhere in between (Bauman 2004, provided the backdrop for yet a third performance: An ado-
110; the phrase breakthrough into performance comes from lescent girl recited a poem on gender relations written by her
Hymes 1981). Both Bauman and the interactional sociologist brother (the chorus director) a novel form of verbal art that
Erving Gofman have been interested in how performances are until then had no possibility of public performance in the vil-
framed or keyed, but whereas Gofmans approach is dramatur- lage. Yet the girl appeared to be only partially invested in serv-
gical, highlighting how social actors move from back stage ing as a spokesperson for her brothers text (she animated the
regions to perform the face work associated with an array of text, in Gofmans sense); at one point, she stumbled over the
social roles (Gofman 1959), Baumans interest lies in poetics, words, and her brother prompted her, mouthing the words
voice, and genre as verbal resources for the accomplishment of from the sidelines. he event culminated in a rousing dance in
social ends. which the children spontaneously organized themselves into
hus far, performance has been considered from two related malefemale couples, a transformation of gender roles in dance
vantage points, each grounded in particular disciplinary per- that galvanized the audience for nearly an hour.
spectives: Performance as speaking practice has been a focus his multilayered performance highlights the use of verbal
of linguistic anthropology and sociolinguistics; performance art (songs, poems) alongside other performance modes to efect
as verbal art has been highlighted in folkloristics and linguistic a transformation of the social relations of gender. It also illus-
anthropology. A third approach views performance as a special trates diferential relations to linguistic resources and linguis-
class of marked events in which a societys symbols are dis- tic authority (a concern of the ethnography of speaking): he
played for commentary, interpretation, or transformation. his childrens chorus had access to political repertoires but not to
approach, pioneered by Victor Turner (1967, 1969), is less con- womens traditional songs or henna poems. A young man could
cerned with language per se. hrough its focus on collective rep- fashion himself as the author of a poem; a young woman could
resentations, cultural symbolism, and collective efervescence, only animate it, and was subjected to her brothers corrective
or communitas, it is located in a Durkheimian paradigm, with voicing from the sidelines. Further, it shows how the participant
inspiration from Arnold Van Genneps work on rites of passage. structure (irst made salient in Hymess SPEAKING model) was
Increasingly, however, scholars are drawing on aspects of all both creatively altered for political ends and amenable to mul-
three approaches. One example of how the three approaches tiple interpretations. Putting girls on a public stage constituted
may be productively considered together is Jane Goodmans a display of political commitment to democracy for the young
analysis of a childrens performance in the Kabyle Berber region men; for the girls, in contrast, their appearance on stage was a
of Algeria (2005). highly controversial and far more ambivalent deviation from the
he performance in question took place at a wedding, social norms of female performance. Beyond gender consider-
understood as a festive occasion in which villagers suspended ations, this performance clearly reoriented what was typically
interpersonal or political conlicts and came together to collec- framed as a purely local event to wider ethnolinguistic and sub-
tively celebrate the new union. he wedding was set apart from national concerns. Yet embedding this political orientation into
everyday life by various formal markers: location (an outdoor the already sanctioned frame of the wedding entailed less risk
public square), timing (late evening), dress, music (traditional (and ensured greater audience) than mounting a stand-alone
band), and activities (dance). Special forms of verbal art also political event might have done.
marked the occasion: A hired poet recited a poem after henna In sum, the study of performance provides a point of entry
was applied to the grooms hand; older village women sang for research into social life as it is constituted, critiqued, and
traditional songs to mark transitions. Wedding guests danced transformed through communicative practices. It highlights the
to show support for the new couple. In this village, men and emergent, creative, and transformative nature of language use in
women shared the same dancing space but typically danced a sociocultural context. Finally, performance ofers a compelling
sequentially rather than concurrently; in no case did they vantage point on the mutually constitutive relationship between
dance as couples. One summer, however, village youth active seemingly microlevel practices and wider processes, ideologies,
in the national Berber Cultural Movement formed a mixed- and political formations.
gender childrens chorus as a way of changing gender relations
in the community and, more broadly, fostering a commitment Jane E. Goodman
590
Performative and Constative
WORKS CITED AND SUGGESTIONS FOR FURTHER READING perform an action, as opposed to constatives, which describe a
Bakhtin, Mikhail. 1981. he Dialogic Imagination. Ed. Michael Holquist,
state of afairs.
trans. Caryl Emerson and Michael Holquist. Austin: University of Texas he seminal source for this distinction is Austin (1962), pub-
Press. lished posthumously as a written record of lectures delivered in
Bauman, Richard. 1977. Verbal Art as Performance. Prospect Heights, 1955 (and based on earlier, largely unpublished ideas). his is
IL: Waveland. important for two reasons. First of all, much of Austins thinking
. 2004. A World of Others Words: Cross-Cultural Perspectives on is actually contemporary with (though probably largely uninlu-
Intertextuality. Malden, MA, and Oxford: Blackwell. enced by) Ludwig Wittgensteins ideas on language-games.
Bauman, Richard, and Joel Sherzer, eds. 1974. Explorations in the Secondly, the 1962 monograph records an evolution in Austins
Ethnography of Speaking. London and New York: Cambridge University thinking, in which he starts from a distinction between two utter-
Press.
ance classes and ends up drawing the conclusion that this dis-
. 1975. he ethnography of speaking. Annual Review of
tinction is untenable and that all utterances perform. In order
Anthropology 4: 95119.
Burke, Kenneth. [1941] 1973. he Philosophy of Literary Form: Studies in
to understand this major shift, it is necessary to trace the evolu-
Symbolic Action. 3d ed. Berkeley and London: University of California tion in his model in some detail.
Press. Constatives are deined as utterances that have truth condi-
Chomsky, Noam. 1965. Aspects of the heory of Syntax. Cambridge, tions, the prototype case being desciptions of states of afairs
MA: MIT Press. (e.g., he sun comes up in the East). In contrast, Austins origi-
Hymes, Dell. 1967. Models of the interaction of language and social set- nal performatives do not have truth conditions, in that they do
ting. Journal of Social Issues 23.2: 828. not commit the speakers beliefs to the proposition expressed.
. 1981. In vain I tried to tell you: Essays in Native American ethno- Utterances such as I hereby baptize this child John Doe do
poetics. Philadelphia: University of Pennsylvania Press.
things: hey perform actions, in that they change reality from
Gofman, Erving. 1959. he Presentation of Self in Everyday Life. New
one in which a child named John Doe does not exist to one in
York: Doubleday.
which such a child does exist. Performatives do not have truth
Goodman, Jane. 2005. Berber Culture on the World Stage: From Village to
Video. Bloomington: Indiana University Press. conditions but, rather, felicity conditions; that is, they are
Gumperz, John J., and Dell Hymes, eds. 1986. Directions in only performed successfully (or happily, to use Austins term) in
Sociolinguistics: he Ethnography of Communication. Oxford and New speciic circumstances. For instance, baptizing is only performed
York: Blackwell. happily if the speaker has the proper authority to perform the
Jakobson, Roman. 1960. Concluding statement: Linguistics and poetics. procedure (e.g., a priest), if the procedure is carried out correctly
In Style in Language, ed. T. A. Sebeok, 35077. Cambridge, MA: MIT and completely (using the appropriate verbal format), and if the
Press. parties involved carry out any necessary subsequent conduct.
. 1990. On Language. Ed. Linda R. Waugh and Monique Monville- he question arises whether performatives and constat-
Burston. Cambridge: Harvard University Press.
ives have any formal identifying features, such as grammatical
Saussure, Ferdinand de. [1907] 1959. Course in General Linguistics.
or lexical devices, that provide cues for the hearer about their
Ed. Charles Bally and Albert Sechehaye, trans. Wade Baskin. New
pragmatic status. Austin originally thought that so-called per-
York: Philosophical Library.
Silverstein, Michael, and Greg Urban. 1996. Natural Histories of Discourse. formative verbs might be a good candidate; in an utterance like
Chicago: University of Chicago Press. I (hereby) promise Ill inish the essay on time, the matrix verb
Turner, Victor. 1967. he Forest of Symbols: Aspects of Ndembu Ritual. promise marks the performance of the action of promising.
Ithaca, NY: Cornell University Press. Constatives appear to lack such a marker. Since performative
. 1969. he Ritual Process: Structure and Antistructure. verbs are easily identiied (they are irst person present tense
Chicago: Aldine. indicative, they collocate with hereby, etc), they may function as
powerful cues and can be said (following Searle 1969) to function
as illocutionary force indicating devices (or IFIDs).
PERFORMATIVE AND CONSTATIVE
he assumption that performatives must have performative
he distinction between performative and constative utterances verbs proves untenable, however. A slightly variant formula-
was irst introduced by J. L. Austin, and is illustrative of a reaction tion of the aforementioned promise, such as Ill inish the essay
within language philosophy to the doctrine of logical posi- on time, is functionally very similar, if not identical, to the ver-
tivism: his paradigm holds that sentence meaning can be sion with promise. he problem is that the second version does
captured in terms of truth conditions (see truth conditional not contain a performative verb; therefore, if one assumes that
semantics) and logical relations, and that sentences that can- both versions perform the same action, it must necessarily follow
not be thus veriied are essentially meaningless. In contrast, that performative utterances do not need to have explicit perfor-
ordinary language philosophy, as conceived by philoso- mative verbs. his leads Austin to posit a distinction between
phers such as J. L. Austin, Peter Strawson, and H. P. Grice (see explicit and implicit (or, as he called them, primary performa-
cooperative principle), examines language in use, and thus tives) performatives.
lies at the basis of the development of modern pragmatics. We have now lost any kind of formal marking of performatives
Austin observes that there are utterances, such as I (hereby) since both constatives and implicit performatives lack overt per-
bequeath my watch to my brother, for which any evaluation in formative verbs. In fact, once one posits the existence of implicit
terms of truth and falsity is irrelevant; this type of utterance he performatives, the possibility is raised that a constative such as
labeled (at least at the outset) performatives, or utterances that he sun rises in the East is, in fact, an implicit version of the more
591
Performative and Constative
explicit I (hereby) state that the sun rises in the East (since state (ii) Directives (e.g., requests, suggestions, commands),
here has all the characteristics of a performative verb). What is which consist of attempts by the speaker to get the hearer to
more, one could claim that statements also have felicity condi- do something;
tions in the sense that they are only uttered happily if the speaker (iii) Commissives (e.g., promising, threatening, ofering),
is reasonably sure about the truth of the proposition expressed. which commit the speaker to some future course of action;
Conversely, many performatives need to bear some relation to
(iv) Expressives (e.g., apologizing, congratulating, thanking),
actual facts and, thus, have at least some propositional content.
which express a psychological state;
he question thus arises whether constatives are similar to per-
formatives in that they also perform an action. Austin admits (v) Declarations (e.g., declaring war, baptizing, chisten-
that they do, namely, the action of committing the speaker to the ing), which efect immediate changes in some institutional
truth of the propososition: Once we realize that what we have state of afairs, typically relying on elaborate extralinguistic
to study is not the sentence but the issuing of an utterance in a institutions.
speech situation, there can hardly be any longer a possibility of his classiication, despite having been hugely inluential,
not seeing that stating is performing an act (Austin 1962, 139). raises some serious problems. First of all, some speech-acts seem
he performativeconstative distinction thus becomes unten- to belong to more than one category: A complaint such as Im
able, and one can only conclude that all utterances are actions. upset that you forgot to put the trash out presumably expresses the
Austins original two utterance classes are then merely sub- speakers psychological state (expressive) but might also be inter-
classes of acts performed through language, or speech-acts, preted as an attempt to get the hearer to take the trash out (direc-
which consist of three distinct types of act: locution, illocution, tive). Secondly, illocutionary acts relect the communicative
and perlocution. intention of the speaker, who hopes that this intention will be
he locutionary act can be more or less equated to the recognized and interpreted accurately by the hearer. Again, this
semantic meaning of the utterance and is roughly equivalent raises the question as to how hearers are able to do so. Searles
to uttering a certain sentence with a certain sense and refer- answer is the performative hypothesis, whereby every utterance U
ence (Austin 1962, 109). Illocutionary acts are utterances has an underlying format of the form I (hereby) Vp you (that) U,
which have a certain (conventional) force (1962, 109), such Vp representing the (explicit or implicit) performative verb. his
as baptizing, promising, and all of Austins original performa- still begs the question how hearers know that, for instance, he
tives, but also former constatives such as informing and stating. door is standing wide open is the implicit version of I apologize for
Perlocutionary acts, inally, are what we bring about or achieve leaving the door open, rather than of I am complaining that you
by saying something (1962, 109), that is, the consequences that left the door open (or, for that matter, an indirect version of the
utterances trigger (which may, but need not be verbal), such as request Could you close the door?) he three traditional sentence
convincing, deterring, or frightening. In short, all utterances per- types (declarative, interrogative, imperative) potentially ofer
form three diferent acts: the act of saying something (locution), some help by functioning as IFIDs, as may some lexical relexes
what the speakers intention is in saying something (illocution), associated with certain illocutionary acts (e.g., please appears to
and what its consequences are by saying something (perlocu- co-occur exclusively with directives). However, the fact remains
tion). In much of the subsequent literature, the term speech-act that most speech-act realizations contain neither a performative
has become virtually synonymous with the illocutionary force of verb nor any other IFID. Such utterances, which exhibit no overt
the utterance, but it is important to stress that, for Austin, per- structural marking of their speech-act status (as in he door is
forming a speech-act involves performing all three kinds of act standing wide open when intended as a request), Searle labels
simultaneously. indirect speech-acts. However, since most usages of speech-acts
Since all utterances are now considered to be performative appear to be indirect rather than direct, it remains unexplained
speech-acts, the question is raised as to how many diferent how hearers are capable of computing the speakers intended
classes of speech-acts can be distinguished on linguistic grounds. illocutionary force in the absence of structural signals. A possible
Out of the three acts involved, locution does not provide any use- explanation is that people rely on contextual cues, working out
ful distinguishing criteria since the same propositional content the implicit meaning by relying on Grices cooperative principle
can be employed for creating various speech-acts; neither does through conversational implicatures.
perlocution since the perlocutionary efect of a speech-act is dif- he fact remains that Searles classiication ofers little help in
icult to predict. However, utterances do difer systematically assigning speech-act status to stretches of verbal interaction in
with regard to their illocutionary force and, thus, presumably ethnographic data. Ultimately, it could be argued, the interpreta-
have diferent felicity conditions. It should, therefore, be possible tion of an utterance will depend on the speech event in which it
to develop a new taxonomy of illocutionary acts based on these occurs, that is, the culturally recognized social activity in which
felicity conditions or the linguistic realizations thereof. language plays a speciic, and often rather specialized, role
Austin did, in fact, develop a rudimentary taxonomy, but it (Levinson 1983, 279). In classroom interactions, for instance,
was left to his pupil J. R. Searle to come up with a more systematic teacher questions regularly violate Searles sincerity condition
classiication (see Searle 1979). Searle distinguishes ive major since the speaker already knows the answer and is thus not sin-
classes of illocutionary act: cere in trying to obtain a missing piece of information.
(i) Representatives (e.g., stating, describing, concluding), A radically diferent approach to the interpretation prob-
which commit the speaker to the truth of the expressed lem (i.e., how the speakers communicative intention is rec-
proposition; ognized in the absence of linguistic cues) is ofered by the
592
Perisylvian Cortex
ethnomethodological paradigm of conversation analysis. 1861; Wernicke [1874] 1969), usually the left in right-handed
Consider the following exchange: individuals (Broca 1865). his cortex surrounds the sylvian is-
sure and runs from the pars triangularis and opercularis of the
S: Another glass of wine would hit the spot.
inferior frontal gyrus (Brodmans areas [BA] 45, 44: Brocas area),
H: I dont think so mate, youve had enough.
through the angular and supramarginal gyri (BA 39 and 40) into
Ss utterance, despite being a declarative, is clearly not inter- the superior temporal gyrus (BA 22: Wernickes area) in the dom-
preted as simply stating a fact by H; rather, Hs response (a inant hemisphere (Figure 1).
refusal to comply) shows that it was interpreted as a request
for another glass of wine. he basis for interpretation here lies
in the conversational sequencing of the two contributions: hey
Classical Clinical Models of the Functional Neuroanatomy
are conditionally reliant upon each other, by virtue of being two
of Perisylvian Cortex for Language
he irst theories of the functional neuroanatomy of language
parts of a request-refusal adjacency pair. he question as to
pertained to this cortical region. he pioneers of aphasiology
the intended illocutionary force of Ss turn becomes moot in this
Paul Broca, Karl Wernicke, John Hughlings Jackson, and other
approach; what matters is that H has clearly interpreted it as
neurologists described patients with lesions in the left inferior
request-like, having provided an appropriate second part to the
frontal lobe whose speech was hesitant and poorly articu-
adjacency pair. Of course, H might provide an incorrect interpre-
lated, and other patients with lesions more posteriorly, in the
tation, but if this is the case, it will become appararent in the sub-
superior temporal lobe, who had disturbances of compre-
sequent interaction. Such an inductive approach avoids some of
hension and luent speech with sound and word substitutions
the pitfalls inherent in attempts to classify speech-acts according
(see aphasia). hese correlations led to the theory that lan-
to the nonobservable, and therefore unfalsiiable, intentions of
guage comprehension went on in unimodal auditory association
the speaker.
cortex (wernickes area, BA 22) adjacent to the primary audi-
Ronald Geluykens tory cortex (Heschls gyrus, BA 41), and motor speech planning
went on in unimodal motor association cortex in brocas area
WORKS CITED AND SUGGESTIONS FOR FURTHER READING (BA 44 and 45) adjacent to the primary motor cortex (BA 4).
hese theories incorporated the only principle that has ever been
Austin, J. L. 1962. How to Do hings with Words. Oxford: Oxford University
articulated regarding the localization of a language operation.
Press.
According to this principle, language operations are localized in
Levinson, S. C. 1983. Pragmatics. Cambridge: Cambridge University
Press. relation to their sensory-motor requirements. Speech planning
Searle, J. R. 1969. Speech Acts: An Essay in the Philosophy of Language. goes on in Brocas area because Brocas is immediately adjacent
Cambridge: Cambridge University Press. to the motor area responsible for movement of the articulators,
. 1979. Expression and Meaning: Studies in the heory of Speech Acts. and Wernickes area is involved in comprehension because it
Cambridge: Cambridge University Press. is immediately adjacent to the primary auditory cortex. hese
ideas and models were extended by Norman Geschwind and his
colleagues in the 1960s and 1970s. Geschwind (1965) added the
PERISYLVIAN CORTEX
hypothesis that word meaning was localized in the inferior
Beginning in the late nineteenth century, the application of dei- parietal lobe (BA 39 and 40) because word meanings consist
cit-lesion correlations based on autopsy material to the problem of associations between sounds and properties of objects, and
of the regional specialization of the brain for language yielded the inferior parietal lobe is an area of multimodal association
the fact that human language requires parts of the association cortex to which ibers from unimodal association cortex related
cortex in the lateral portion of one cerebral hemisphere (Broca to audition, vision, and somasthesis project.
Inferior parietal lobe
Figure 1. A depiction of the left hemisphere of the brain

Brocas area showing the main language areas.
Wernickes area
593
Perisylvian Cortex
Despite its widespread clinical use, however, this model has evidence that this wave originates in the inferior temporal lobe
serious limitations. It deals only with words, not other levels of (Nobre and McCarthy 1995), though perhaps more posteriorly
the language code. From a linguistic and psycholinguistic than the lesion studies would suggest. Other brain areas that
point of view, the syndromes are all composed of many process- have been suggested as loci for semantic processing (the infe-
ing deicits, which are diferent in diferent patients. he syn- rior frontal lobe: Petersen et al. 1988; Dapretto and Bookheimer
dromes themselves do not provide a guide to the localization of 1999) are much less clearly related to this function.
more speciic components of the language processing system. In the past two decades, studies of impairments of word
As reviewed in the following, Geschwinds critical contribu- meaning and functional neuroimaging have suggested a iner-
tion regarding the role of the parietal lobe receives no empirical grained set of distinctions within the class of objects. Both dei-
support. cits and functional activation studies have suggested that there
are unique neural loci for the representation of categories such
Linguistically Oriented Models of the Functional as tools (frontal association cortex), animals and foods (lateral
Neuroanatomy of the Perisylvian Cortex for Language inferior temporal lobe), and faces (medial inferior temporal lobe)
Since approximately 1975, psychologists and linguists have (see Caramazza and Mahon 2006, for review). Debate continues
approached language disorders and their neural basis in a more as to whether such divisions and localizations relect diferent
systematic fashion, informed by models of language structure co-occurrences of properties of objects within these classes or
and function. I briely review two areas of work that relate these innate, neurally localized human capacities to divide the world
models to the functional neuroanatomy of the persiylvian cortex along these lines.
and other brain regions.
SYNTACTIC PROCESSING. Most researchers also subscribe to
LEXICAL SEMANTIC PROCESSING. As noted, traditional neu- localizationist views regarding aspects of syntactic process-
rological models of the neural basis for word meaning main- ing. A well-known hypothesis is the trace deletion hypothesis
tained that the meanings of words consist of sets of neural (Grodzinsky 2000), which claims that patients with lesions in
correlates of the physical properties that are associated with a Brocas area have deicits afecting certain moved constituents
heard word (Wernicke [1874] 1969), all converging in the infe- (traces in Chomskys theory). he evidence supporting these
rior parietal lobe (Geschwind 1965). It is now known that most models is based on correlating deicits in syntactic comprehen-
lesions in the inferior parietal lobe do not afect word meaning sion to lesions. However, there are two issues that such data must
(Hart and Gordon 1990), and functional neuroimaging stud- face. First, it is often not clear whether a patient has a deicit in
ies designed to activate word meanings do not tend to activate a particular parsing operation or a reduction in the resources
this region (see the following). A. Damasio (1989), therefore, available to accomplish syntactically based comprehension.
modiied this model, suggesting that the meanings of words Second, there is virtually no consistency in an individual patients
included retroactivation of neural patterns in unimodal performance across tasks, raising questions about whether a
association primary sensory cortex. Evidence for this comes patient who fails on a particular structure has a parsing deicit
from functional neuroimaging results that reveal activation (Caplan, DeDe, and Michaud 2006 and Caplan, Waters, Dede, et
for diferent classes of words in diferent areas, each related al. 2007).
to the sensory-motor associations of the word (frontal cortex Assuming that patients performances relect deicits in
for verbs and manipulable objects; inferior temporal cortex for particular parsing operations, the relation of these deicits to
concrete nouns) (see Caramazza and Mahon 2006, for review). lesions does not support invariant localization models. We have
However, it is not clear that these activations relect the mean- recently reported the most detailed study of patients with lesions
ing of words, rather than properties commonly associated with whose syntactic comprehension has been assessed (Caplan,
words. Word meanings include much more than sensory and Waters, Kennedy, et al. 2007). Lesion size in multiple, cytoarchi-
motor associations; the essence of word meaning is itself quite tectonically diferent small areas of cortex both within and out-
mysterious (Fodor 1998). In any event, word meanings are part side the perisylvian and non-perisylvian area, not connected by
of a network that relates a word to a complex set of concepts major iber tracts, predicted performance, ruling out invariant
and contexts (Tulving 1972). localization as the mode of neural organization for the opera-
here is evidence that a critical part of this semantic network tions supporting this function that were assessed. At the same
is located outside the perisylvian cortex, in the anterior inferior time, patients who performed at similar levels behaviorally had
temporal lobes. Patients with semantic dementia, a degenera- lesions of very diferent sizes in larger areas of the brain (such
tive disease that afects the anterior inferior temporal lobe, and as the perisylvian association cortex, or the entire left hemi-
herpes encephalitis, with somewhat more posterior lesions, have spheric cortex) in which it has been suggested that syntactic
initially selective and ongoing major problems with many aspects processing might be distributed, and patients with equivalent
of semantic memory (Davies et al. 2005; Gorno-Tempini et al. lesion sizes in these larger areas varied greatly in their level of
2004; Warrington and Shallice 1984). Activation studies have performance, arguing that syntactic processing in comprehen-
implicated the inferior temporal cortex in representing concepts sion is not distributed in these areas. he data are consistent
and word meanings (Caramazza and Mahon 2006). Some stud- with a model in which the neural tissue that is responsible for
ies of the neural generators for the N400 event-related potential the operations underlying sentence comprehension and syntac-
(ERP) wave, which relects some aspect of semantic processing tic processing is localized in diferent neural regions in diferent
(Kutas and Hillyard 1980; Holcomb and Neville 1990), present individuals.
594
Perisylvian Cortex Perlocution
Functional neuroimaging studies have been said to provide Caramazza, A. and B. Mahon. 2006. he organization of conceptual
evidence for the localization of speciic parsing and interpretive knowledge in the brain: he futures past and some future directions.
operations in Brocas area (Ben-Shachar et al. 2003; Ben-Shachar, Cognitive Neuropsychology 23: 1338.
Palti, and Grodzinsky 2004; Bornkessel, Fiebach, and Friederici Damasio, A. 1989. Time-locked multiregional retroactivation: A sys-
tems-level proposal for the neural substrates of recall and recogni-
2005; Fiebach, Schlesewsky, and Lohmann 2005). However,
tion. Cognition 33: 2562.
most neuroimaging studies actually show multiple cortical areas
Dapretto, M., and S. Y. Bookheimer. 1999. Form and con-
of activation in tasks that involve syntactic processing, and dif- tent: Dissociating syntax and semantics in sentence comprehension.
ferent areas have been activated in diferent tasks. Overall, these Neuron 24: 42732.
data also suggest variation in the localization of the areas that Davies, R. R., J. R. Hodges, J. R. Kril, K. Patterson, G. M. Halliday, and J.
are suicient to support syntactic processing within the language H. Xuereb. 2005. he pathological basis of semantic dementia. Brain
area across the adult population, although invariant localization 128: 198495.
models are not ruled out (Caplan, Chen, and Waters 2008). Fiebach, C. J., M. Schlesewsky, and G. Lohmann. 2005. Revisiting the
role of Brocas area in sentence processing: Syntactic integration ver-
Overview sus syntactic working memory. Human Brain Mapping 24: 7991.
he left perisylvian association cortex appears to be the most Fodor, J. A. 1998. Concepts. Oxford: Oxford University Press.
Geschwind, N. 1965. Disconnection syndromes in animals and man.
important brain region supporting human language. However,
Brain 88: 23794, 585644.
it is not the sole area involved in these abilities. How this area
Gorno-Tempini, M. L., N. F. Dronkers, K. P. Rankin, J. M. Ogar,
and other brain regions act to support particular language oper- L. Phengrasamy, H. J. Rosen, J. K. Johnson, M. W. Weiner, and B. L.
ations is not yet understood. here is evidence for both locali- Miller. 2004. Cognition and anatomy in three variants of primary pro-
zation of some functions in subparts of this region and other gressive aphasia. Annals of Neurology 55: 33546.
brain areas, and for either multifocal or distributed involve- Grodzinsky, Y. 2000. he neurology of syntax: Language use without
ment of brain areas in other language functions. It may be that Brocaa area. Behavioral and Brain Sciences 23: 47117.
some higher-level principles are operative in this domain. For Hart, J., Jr., and B. Gordon. 1990. Delineation of single-word seman-
instance, content-addressable activation and associative opera- tic comprehension deicits in aphasia, with anatomical correlation.
tions such as those that underlie phoneme recognition, lexical Annals of Neurology 27: 22633.
Holcomb, P. J., and H. J. Neville. 1990. Auditory and visual semantic
access, and lexical semantic activation, may be invariantly
priming in lexical decision: A comparison using event-related brain
localized, while combinatorial computational operations such
potentials. Language and Cognitive Processes 5.4: 281312.
as those that constitute the syntax of natural language may not
Kutas, M., and S. A. Hillyard. 1980. Reading senseless sentences: Brain
be. However, many aspects of these topics remain to be studied potentials relect semantic incongruity. Science 207: 2034.
with tools of modern cognitive neuroscience. Nobre, A. C., and G. McCarthy. 1995. Language-related ield poten-
tials in the anterior-medial temporal lobe: II. Efects of word type and
David Caplan
semantic priming. Journal of Neuroscience 15: 10908.
Petersen, S. E., P. T. Fox, M. Posner, M. Minton, and M. Raichle. 1988.
WORKS CITED AND SUGGESTIONS FOR FURTHER READING Positron emission tomographic studies of the cortical anatomy of
Ben-Shachar, M., T. Hendler, I. Kahn, D. Ben-Bashat, and Y. Grodzinsky. single-word processing. Nature 331: 5859.
2003. he neural reality of syntactic transformations: Evidence from Tulving, E. 1972. Episodic and semantic memory. In Organization
fMRI. Psychology Science 14: 43340. of Memory, ed. E. Tulving and W. Donaldson, 381403. New
Ben-Shachar M., D. Palti, and Y. Grodzinsky. 2004. he neural correlates York: Academic Press.
of syntactic movement: Converging evidence from two fMRI experi- Warrington, E., and T. Shallice. 1984. Category speciic semantic impair-
ments. Neuroimage 21: 132036. ments. Brain 107: 82953.
Bornkessel I., C. Fiebach, and A. Friederici. 2005. On the cost of syntactic Wernicke, K. [1874] 1969. he aphasic symptom complex: A psychologi-
ambiguity in human language comprehension: An individual difer- cal study on a neurological basis. Breslau: Kohn and Wegert. Repr. in
ences approach. Cognitive Brain Research 21: 1121. Boston Studies in the Philosophy of Science. Vol. 4. Ed. R. S. Cohen and
Broca, P. 1861. Remarques sur le sige de la facult du parole articul, M. W. Wartofsky, 3497. Boston: Reidel.
suivis dune observation daphmie (perte de parole). Bulletin de l
Socit dAnatomie (Paris) 36: 33057.
PERLOCUTION
Broca P. 1865. Sur le sige de la facult du langage articul. Bulletin de
la Socit danthropologie 6: 33793. In pragmatics, perlocution refers to the efect speech-acts
Caplan, D., E. Chen, and G. Waters. 2008. Task-dependent and task- have on the hearer (H). J. L. Austin (1962) distinguishes three
independent neurovascular responses to syntactic processing. Cortex types of act that utterances perform simultaneously: locution
44: 25775. (roughly equivalent to the meaning in a propositional sense),
Caplan, D., G. DeDe, and J. Michaud. 2006. Task-independent and task-
illocution (the intended force of the speech-act), and perlocu-
speciic syntactic deicits in aphasic comprehension. Aphasiology
tion. Austin characterizes perlocution as follows: Saying some-
20: 893920.
thing will often, or even normally, produce certain consequential
Caplan, D., G. Waters, G. DeDe, J. Michaud, and A. Reddy. 2007. A study
of syntactic processing in Aphasia I: Behavioral (psycholinguistic) efects upon the feelings, thoughts, or actions or actions of the
aspects. Brain and Language 101: 10350. audience, or of the speaker, or of other persons: and it may be
Caplan, D., G. Waters, D. Kennedy, N. Alpert, N. Makris, G. DeDe, done with the design, intention, or purpose of producing them
J. Michaud, and A. Reddy. 2007. A study of syntactic processing in (1962, 101). Hs reaction to an illocutionary act might be verbal
aphasia II: Neurological aspects. Brain and Language 101: 15177. (e.g., asking a question might prompt an answer), or nonverbal
595
Person
(e.g., an insult may result in a slap in the face), but also an internal groups including speakers but excluding addressees. Further
psychological or emotional state (e.g., a threat might result in H divisions include an impersonal category and a sentient/non-
being frightened or angry). sentient third person opposition. Like number and gender
Although Austin intended perlocution to be an integral part marking, person can also be indicated on agreeing elements,
of a speech-act, later developments of speech-act theory have particularly inite verbs. Present tense English verbs show only
focused almost exclusively on illocution, that is, the speakers third person singular agreement (walk-s), while agreement
mental state or intention (e.g., Searle 1969). As a result, the term on Italian indicative verbs distinguishes three persons in both
speech-act has become virtually synonomous with illocution- singular (parl-o I speak, parl-i you speak, parl-a he/she/it
ary force. his is perhaps unsurprising, given that perlocutions speaks) and plural.
do not always consist of observable behavior (and might therefore Linguistic phenomena related to person include morpho-
be argued to fall outside a linguistic theory of pragmatics; but see logical categories of pronouns and agreement; partial mor-
Gu 1993). Moreover, perlocutions are hard to classify: Not only phological syncretisms among person categories, in pronouns
do certain illocutions allow for a range of possible perlocutions or agreement; interactions of person with the ordering of pro-
(a request, for instance, may result in either compliance or rejec- nominal clitics; interactions of person with case, agreement,
tion by the hearer); there is often no way of knowing whether the or structural position; and surprising restrictions on person com-
achieved perlocution is actually the one the speaker (S) intended binations, usually involving direct and indirect objects (the *me
to achieve (a warning, say, may be intended to make the hearer lui efect, or Person Case Constraint). Such phenomena form the
(H) take evasive action but may only result in frightening him/ empirical basis of morphosyntactic theories of person.
her). Nevertheless, it is easy to demonstrate that perlocutions are here are three principal theoretical approaches to person. A
intrinsic parts of speech-acts since their successful performance traditional insight represents person categories within a hierar-
often depends on them. As Austin points out, an utterance such chy of nominals inluencing pronoun morphosyntax, for exam-
as I bet you 10 dollars the Knicks will win by 5 points is felicitous ple, case and agreement marking in transitive clauses (Dixon
only if it receives uptake, that is, if H acknowledges and accepts 1994, 85). Cross-linguistically, third person is the least marked,
the bet (see felicity conditions). ranking below irst and second. For example, in Georgian, irst
conversation analysis (Sacks 1992) ofers a potential and second person objects are indexed by verbal morphology,
alternative, inductive approach to (verbal) perlocutions based on while verbs with third person objects resemble intransitives. In
local sequential organization. Consider the following exchange: Dyirbal, irst and second person pronouns have nominative/
accusative case marking, while third person pronouns, proper
S: Have a cookie
names, and common nouns show an ergative/absolutive oppo-
H: ehm no thanks Ive just had dinner sition. Some scholars rank irst person highest (Zwicky 1977),
while others regard the ranking of irst and second person as
In this exchange, Ss contribution can be labeled an ofer by vir-
variable.
tue of its being the irst part of an ofer-refusal (or ofer-accep-
Another approach seeks to derive morphosyntactic efects by
tance) adjacency pair. If H recognizes Ss utterance as such,
representing person as a complex category built from elemental
he/she will have to provide a sequentially appropriate response
features. One such feature analysis locates person features
(or perlocution). he second part is thus conditionally reliant on
such as [participant], [speaker], and [addressee] within a univer-
the irst part.
sal geometry of privative pronominal features, in which the avail-
Ronald Geluykens ability of one feature may depend on the presence of another.
An inluential paper by Heidi Harley and Elizabeth Ritter (2002)
WORKS CITED AND SUGGESTIONS FOR FURTHER READING outlines this approach. Another type of analysis treats person
features as binary rather than privative; this allows the grammar
Austin, J. L. 1962. How to Do hings with Words. Oxford: Clarendon
Press. to refer to negative values, such as [speaker]. Robert Rolf Noyer
Gu, Yueguo. 1993. he impasse of perlocution. Journal of Pragmatics (1997) makes a signiicant case for the binary-feature analysis.
20: 40532. A third approach, potentially compatible with the second,
Sacks, Harvey. 1992. Lectures on Conversation. Oxford: Blackwell. associates diferent persons with diferent syntactic representa-
Searle, J. R. 1969. Speech Acts: An Essay in the Philosophy of Language. tions (Ritter 1995; Dchaine and Wiltschko 2002; Bejar 2003).
Cambridge: Cambridge University Press. Within the featural approach, most commentators assume
the existence of features corresponding to irst and second per-
son. However, third person is widely treated as simply lacking
PERSON
such features (Zwicky 1977; Noyer 1997). his analysis correctly
Person is a morphosyntactic property of nominal phrases (nouns predicts certain limits on the typology of person categories
and pronouns) used to indicate the discourse role of their ref- (Greenberg 1966). As noted, some languages have separate cat-
erent. English personal pronouns show three person distinc- egories for inclusive and exclusive we, whose use depends on
tions: irst person, indicating speakers (I, we); second person, whether addressees are included. hus, [addressee] is a distinc-
indicating addressees (you); and third person, indicating dis- tive feature; inclusive ([speaker, addressee]) has it, while irst
course nonparticipants (he, she, it, they). Some languages also person ([speaker]) does not. However, there is no parallel con-
distinguish inclusive and exclusive we: Ojibwa has kiinawint for trast between categories whose use depends on whether nonpar-
groups including speakers and addressees, and niinawint for ticipants are included. For example, no known languages have
596
Person Philology and Hermeneutics
separate categories for inclusive and exclusive plural you, whose Bruening, Yoonjung Kang, and Martha McGinnis, 42549. Cambridge,
use depends on whether nonparticipants are included. Such MA: MITWPL.
observations imply that there is no third person feature, there- Harley, Heidi, and Elizabeth Ritter. 2002. Person and number in pro-
fore no categories such as [speaker, addressee, nonparticipant], nouns: A feature-geometric analysis. Language 78: 482526.
McGinnis, Martha. 2005. On markedness asymmetries in person and
[speaker, nonparticipant], or [addressee, nonparticipant]. hird
number. Language 81: 699718.
person pronouns thus refer to nonparticipants by default, lack-
Noyer, Robert Rolf. 1997. Features, Positions, and Aixes in Autonomous
ing the features that allow reference to discourse participants. Morphological Structure. New York: Garland.
Nevertheless, some phenomena seem to require reference to Ritter, Elizabeth. 1995. On the syntactic category of pronouns and agree-
nonparticipants, for example, syncretism in Mam pronominal ment. Natural Language and Linguistic heory 13: 40543.
enclitics (Noyer 1997) or the Spanish spurious se rule (Bonet Zwicky, Arnold M. 1977. Hierarchies of person. Chicago Linguistic
1991). An obvious solution is to permit limited reference to nega- Society 13: 71433.
tive values, such as [speaker, addressee]. he success of the
privative approach depends on identifying plausible alternative
PHILOLOGY AND HERMENEUTICS
analyses for such cases.
Although the [speaker] and [addressee] features are sui- his entry briely outlines some aspects of the study of linguistics
cient to generate the four main person categories attested cross- leading up to the twentieth century. As two of the earliest, most
linguistically, there is evidence for an additional [participant] thoroughgoing attempts in the West to understand written texts
feature, shared by irst and second person (Farkas 1990; Noyer and spoken discourse, philology and hermeneutics represent
1997; Halle 1997). For example, while Winnebago agreement dis- vital precursors of todays language science. Still synonymous
tinguishes irst and second person, free personal pronouns only with classical studies and historical linguistics, philol-
distinguish participants from nonparticipants (nee, I or you, ogy as both word and practice can be traced to ancient Greece
ee he/she). and Rome. While it is likewise based on a Greek word and while
he argument against a [nonparticipant] feature also applies the problem of interpretation engaged many ancient thinkers,
to the [addressee] feature in languages without an inclusive cat- hermeneutics is often narrowly associated with vigorous philo-
egory (McGinnis 2005). Such languages treat the inclusive as irst sophical debates centered in late eighteenth-century Germany
person, not second (Zwicky 1977; Noyer 1997). hus, in such and originating in Reformation treatises on the right interpreta-
languages, [addressee] is non-distinctive: here can only be an tion of scripture. Today, the heritage of philology and hermeneu-
opposition between [speaker] and non[speaker] participants, tics persists in the modern organization of university disciplines,
not between [addressee] and non-[addressee] participants. If as well as in many indispensable scholarly monuments, such as
[nonparticipant] is nonexistent because it is never distinctive, he Oxford English Dictionary.
then [addressee] is likewise nonexistent in languages without Philology implies love of language and once stood for lin-
an inclusive category. his suggests that the morphosyntactic guistics. Hermeneutics can be deined more speciically as the
contrast between irst and second person is suicient to activate art (or science) of interpretation. he progress from amateur art
[speaker], while [addressee] can be activated only by an addi- to professional science marks the history of both. In their hey-
tional contrast between inclusive and irst person. In such cases, days, philology and hermeneutics were deemed central to all dis-
[addressee] is indeed necessary to capture widespread (and non- ciplines, whether scientiic or humanistic; at other times, either
default) syncretisms between inclusive and second person discipline could also be reduced to trivial pedantry. Among their
most famously identiied in Algonquian languages but common more prescient discoveries are Sir William Joness hypothesis of a
among languages with an inclusive category. For example, the common genetic origin for the evolution of all Indo-European
inclusive pronoun in Ojibwa (kiinawint) shows syncretism with languages nearly a century before Charles Darwins On the
both second person (kiin, plural kiinawaa) and irst (niin, plural Origin of Species and the hermeneutic circle, the feedback-like
niinawint), but not with third (wiin, plural wiinawaa). cycle of interpretation formulated by Friedrich Ast almost 150
years before the birth of cybernetics.
Martha McGinnis
he historical survey to follow highlights the respective ori-
gins, development, and interrelations of philology and her-
meneutics and is singularly appropriate, given the historical
Bejar, Susana. 2003. Phi-syntax: A theory of agreement. Ph.D. diss., predilection of both ields. Because of the limitations of space, the
University of Toronto. focus remains on the European intellectual tradition. However,
Bonet, Eullia. 1991. Morphology after syntax: Pronominal clitics in the theme emphasized here, that early investigations of language
Romance. Ph.D diss., Massachusetts Institute of Technology. sometimes uncannily anticipated modern scientiic paradigms,
Dchaine, Rose-Marie, and Martina Wiltschko. 2002. Decomposing
applies equally to non-Western traditions. In South Asia, for
pronouns. Linguistic Inquiry 33: 40942.
instance, the classical Sanskrit grammar of Panini (ca. sixth to
Dixon, R. M. W. 1994. Ergativity. Cambridge: Cambridge University
Press.
ifth cent. b.c.) strongly preigures generative grammar.
Farkas, Donka. 1990. Two cases of underspeciication in morphology. Interest in the nature and origins of human language goes back
Linguistic Inquiry 21: 53950. to the earliest Western literature, such as the Tower of Babel in
Greenberg, Joseph. 1966. Universals of Language. he Hague: Mouton. Genesis. here is also a fascinating folktale retold by Herodotus,
Halle, Morris. 1997. Distributed morphology: Impoverishment in which an Egyptian pharaoh isolates two children from birth
and ission. In MITWPL 30: Papers at the Interface, ed. Benjamin in order to see what language they will speak presumably the
597
Philology and Hermeneutics
worlds oldest. Nevertheless, although both philology and her- verbal tense and aspect. All these advances were authorita-
meneutics have Greek roots, neither was avowed as a primary tively compiled by Dionysius hrax in his Treatise on Grammar
concern of leading classical philosophers such as Plato and (ca. 100 b.c.), which was so inluential that it was often called
Aristotle. In classical Greek, the keyword logos signiied dis- simply he Manual (and thereby probably subject to extensive
course in many diverse senses, including speech (both lan- later revision by others). Some of this work, such as the eightfold
guage and oration), argument (a single proposition or an division of parts of speech and the treatment of Greek nominal
entire line of reasoning), prose, story, history, reason, and verbal systems, still appears in twentieth-century textbooks.
and thought. Eventually, philologia, like philomatheia, would In Rome, the Greek grammatical heritage was appropriated by
imply studiousness, love of learning in general, since all learn- writers from Varro (On the Latin Language, irst cent. b.c., only
ing at that time revolved around gaining written (and mathe- partially preserved) down to Priscian (ifth to sixth cent. a.d.),
matical) literacy, but Socrates could be called a philologos in the whose exhaustive Principles of Grammar (ca. 500), fortuitously
more original sense of fond of speaking he famously refused designed to assist the Greek speakers of the longer-lived eastern
to write down his ideas. Plato, on the other hand, had fewer com- Roman empire, would become the ultimate authority for learn-
punctions about writing. (In order to elevate written dialogue to ing Latin throughout medieval Europe. Although (as is so often
full-blown dialectic, Plato may himself have coined philosophia the case) much Latin grammatical theory slavishly followed
philosophy as a more rigorous alternative.) he irst classi- Greek models, it was impossible to ignore obvious diferences
cal igure to embrace the title philologos was Eratosthenes, the between the two languages (e.g., Latins lack of an article, one
second Plato, who was one of the librarians of Alexandria and past tense fewer, and one additional case).
a true philomath: He wrote on such diverse ields as geometry, Since the dominant unit of linguistic analysis of the time was
history, philosophy, poetry, and literary criticism. the word, and less so the sentence, the primary achievements
In the classical era, hermeneia interpretation (sometimes of classical language science lay in its descriptive and pragmatic
in the sense of translation) was a secondary philosophi- dimensions, particularly in linguistic pedagogy and the accurate
cal concern, recalling the subsidiary status of the messenger preservation, understanding, and annotation of written texts. For
god Hermes (Roman Mercury). Today, readers of Aristotles instance, we have the Hellenistic era to thank for the invention of
On Interpretation (probably not Aristotles title) may be disap- such scholarly staples as footnotes, commentaries, critical edi-
pointed to ind that this short treatise deals exclusively with the tions, dictionaries, encyclopedias, and library catalogs. On the
logic of propositions. Similarly, Platos dialogue Cratylus is mired other hand, investigations of phonetics and syntax, though
in a shortsighted attempt to show that the names of things may found in some early classical theorists, remained rudimentary.
be both conventional and natural, as if individual letters could And sadly, despite the story of King Mithridates of Pontus (or
somehow coherently imitate reality. (Socrates commitment to Mithradates VI, 12063 b.c.), who was luent in all 22 languages
sound symbolism is satirized in Aristophanes comedy Clouds.) of his subjects, there was almost no formal ethnographic study
Nevertheless, the ancient world made great strides in one par- of the many other now-extinct languages of the Mediterranean
ticular area, namely, grammar (grammatike), which ranged region; non-Greek speakers were simply barbarians (bar-
from the teaching of literacy (including to non-native speakers), baroi, babblers). Lexicographical work was driven by the need
to scholarly description and cataloging of word forms, to literary to translate Greek and Latin, as well as to comprehend archaic
and textual criticism, to more rariied philosophical concerns. texts (e.g., Homer), and many word lists have been preserved as
Like philology, grammar could entail a very wide disciplinary hermeneumata translations and lists of glosses (glossaries,
spectrum. In addition to the question of whether language was from glossai, unfamiliar words). Although prodigious efort
a product of nature (physis) or convention (nomos), an equally from Socrates on was invested in etymology (the pursuit of a
central and ultimately more fruitful debate among grammar- words etymon, truth), this was almost a complete failure since
ians revolved around whether language should be understood ancient philologists did not yet grasp how important phonol-
in terms of analogy or anomaly: Analogia implied that lan- ogy and rules of sound change are for tracing the historical roots
guage was ultimately patterned and governed by regularity, of words. he results ranged from the fanciful to the ridiculous.
and anomalia that language was irreparably disorganized and hus, Latin lignum wood hid potential ignis ire; lepus hare
marred by exceptions. To analogy can be traced the systematicity was light-foot (compounding levis + pes); and words could
that still dominates language science (to say nothing of the leg- stem from their opposites: bellum war was so named for being
acy of prescriptivist correctness in language use), and anomaly not at all bellum beautiful. Much of this dubious heritage was
can be thanked for introducing an honestly empirical dimension compiled by Isidore of Seville (sixth to seventh cent. a.d.), whose
to linguistic studies. Etymologiae remained inluential throughout the medieval
In the century after Plato, Stoic philosophers elevated the period. Many such classical and medieval compilations remain
study of language to a separate philosophical concern, but secondarily valuable, however, because they often preserve the
their treatises have largely been lost. Under the Ptolemies, the sole remaining fragments of hundreds of ancient texts.
Hellenistic librarians of Alexandria reined and advanced all As a time of consolidation and preservation of the Greco-
earlier knowledge of language in their quest to amass, catalog, Roman heritage, the Middle Ages made relatively few signiicant
and edit as many texts in as many ields of knowledge as pos- contributions to the study of language, as for many centuries
sible. his included gathering descriptive word lists of various the Latin culture of Europe lagged behind the Greek learning of
Greek dialects, as well as making detailed analyses of orthogra- the eastern Roman or Byzantine empire and the Arabic schol-
phy, parts of speech (see word classes), morphology, and arship of Moorish Spain. Based on Varros lost writings on the
598
Philology and Hermeneutics
disciplines, Martianus Capellas Marriage of Philology and of languages. In 1440, Lorenzo Valla used historical-linguistic
Mercury (ifth cent. a.d.) formalized the division of the seven lib- evidence to demonstrate that he Donation of Constantine, a
eral arts (the lettered trivium of grammar, logic, and rhetoric, lucrative grant to the church, was a forgery, thereby founding the
and the numeric quadrivium of geometry, arithmetic, music, forensic philology of diplomatics. he Renaissance humanists
and astronomy), cementing the philological basis of Western also revived the learning of Greek, along with Arabic and Hebrew
education for more than a millennium. Capella personiied phi- (considered the original human language), and in the wake of
lology as the mother of the liberal arts (from Latin ars, better Dante, various vernacular languages of Europe and even some
translated today as science), and the irst art was grammar, the languages of foreign lands received grammars of their own. he
learning of literacy through the close study and imitation of clas- languages of the world began to be surveyed, and Joseph Justus
sic texts. he advent of Christianity did not entirely displace the Scaliger sharpened Dantes analysis of the language families of
pagan past, but instead brought new urgency to the problem of Europe (Diatriba de Europaeorum Linguis, 1599). Meanwhile, the
how to comprehend this legacy in the context of the new world- fundamentals of human thought explored by Ren Descartes and
view. One result was the famous multileveled system of allegory, John Locke also inspired such works as the Port-Royal General
a hermeneutics that invited medieval thinkers to integrate three and Rational Grammar (1660) and utopian attempts at invent-
competing cultural systems: the Hebrew Bible and the Jewish ing universal communication systems, such as John Wilkinss
religion; Greco-Roman mythology, literature, and history; and Essay towards a Real Character and a Philosophical Language
orthodox Latin Christianity. Also known as typology, allegori- (1668). he Italian rhetorician Giambattista Vico argued for what
cal interpretation was not limited to biblical texts but could be he called he New Science (1725; revised 1744), an ambitious
extended to read types (emblems, characters) everywhere in philological recreation of the history of human mental and cul-
Gods creation, including the natural world (the second book tural development via a succession of master tropes embodied
after the Bible). in ancient language, laws, and other social institutions. In short,
Although it foreshadows modern linguistic procedures, the Enlightenment brought a return to Eratosthenes multidis-
medieval allegory now seems as empty as classical etymologiz- ciplinary philology: he famous French Encyclopdie of Denis
ing. It is not overly unfair to the philology of the Latin Middle Diderot and others (17512) cites philologie as a universal disci-
Ages to say that it is bracketed by its two greatest authors, its irst pline bridging the sciences and the humanities.
and its last: Augustine and Dante. Certainly, there were import- he year 1768 is justly remembered as a watershed in the his-
ant contributions to the understanding of language in between tory of linguistics: It is the date of the famous paper of the legal
these landmarks, such as the brilliant attempt at orthographical scholar William Jones to the Royal Asiatic Society in Calcutta.
reform via phonetic analysis by the so-called First Grammarian Assigned as a colonial judge in the subcontinent, Jones had set
of twelfth-century Iceland, but it seems typical that this work was about learning Sanskrit, the ancient language in which Indias
forgotten until the nineteenth century. And there were the scho- religious and legal texts are preserved, much as Latin had done
lastic authors of speculative grammars (often under the rubric for Europe. After only a few months of study, Joness brilliant
of modi signiicandi, the means of signifying) who began the surmise was that certain obvious similarities among Sanskrit,
ongoing search for universal principles in language. Yet long Latin, Greek, and other European languages implied a common
before, around the ifth-century fall of Rome, Augustine bril- ancestor, which, crucially, might no longer survive. Such group-
liantly anticipated modern semiotics in On Christian Doctrine, ings had been noticed before, as by Dante and J. J. Scaliger, but
and he was the irst ever to consider the problem of childhood had been explained by the mechanisms of borrowing or decay,
language acquisition in the autobiographical Confessions. rather than by the process of gradual and divergent evolution
Augustine also helped Christianize Capellas seven pagan lib- from a now-dead proto-language. he modern discipline of
eral arts. Meanwhile, the Latin language itself was undergoing historical and comparative linguistics had been born, and the
change, and Augustine could no longer hear the vowel quantities Enlightenments passionate but efete search for language ori-
that underlaid Virgils poetic meter: the Romance languages gins was given a fresh scientiic direction: the problem of proto-
were slowly diferentiating across Europe. A millennium later, linguistic reconstruction.
Dante (who also took the theory and practice of medieval alle- he year 1768 also marked the birth of Friedrich
gory to new heights in his Divine Comedy and elsewhere) wrote Schleiermacher, so inluential in the ield of hermeneutics. Since
a milestone work on language entitled On the Eloquence of the the Reformation, increasing philological concern had been
Vernacular (ca. 1305). hough necessarily and paradoxically brought to bear on the text of the Bible. hough a philological
written in Latin, this uninished treatise argued for the propriety monument in its own right, Jeromes Latin Vulgate (trans. ca.
of using vernacular languages like Italian in literature, and is the 380405) was no longer suicient for the new commentaries and
earliest mapping of European languages based on diferences vernacular translations desired by the Reformers who knew the
that seem to have evolved over time. It was the irst articulation original Hebrew and Greek. his biblical hermeneutics would
of the problem of language change. develop into the inluential higher criticism, one of the trou-
he fact that Johannes Gutenberg worked simultaneously bling scientiic advances that precipitated the Victorian crisis
on printing his famous Bible alongside an edition of the still- of faith. Higher criticism described scripture not as an inspired
ubiquitous Latin grammar of Donatus (fourth cent. a.d.) reveals and inerrant document but as a layered tissue of competing
how the classical world still dominated the early Renaissance. sources that had been edited together at some intermediate
Soon, the rediscovery and promulgation of less-digested ancient time. Stemming from the patterns of stylistic diferences in bibli-
texts and ideas caused a surge in textual criticism and the study cal accounts (e.g., the varying names for God) irst noticed by
599
Philology and Hermeneutics Phonetics
Reformation commentators, the documentary hypothesis sug- Another great paradigm split was marked by the publication
gests that the canonical ive books of Moses (the Pentateuch) of the one-time philologist Ferdinand de Saussures Course in
are carefully patched together from a number of distinct source General Linguistics (1916). Perhaps a victim of its own success,
texts. diachronic philology, which so carefully traced the evolution of
Just as Dante had thought it to apply sacred allegory to his parole, eventually yielded its disciplinary headship of language
own secular literary production, so did Enlightenment students study to Saussures synchronic langue (see synchrony and
of the Bible acknowledge that no special method of interpreta- diachonry and structuralism).
tion should be required for the word of God. As hermeneutic Presently partitioned among various university disciplines,
theorist Johann August Ernesti put it in 1761, the verbal sense of philology and hermeneutics still govern the ields of medieval
Scripture must be determined in the same way in which we ascer- and classical studies, historical linguistics, literary theory and
tain that of other books (quoted in Palmer 1969, 38). he paral- criticism, textual editing, lexicography, prosody and metrics,
lel development of secular higher criticism was also underway. and many others (see Cerquiglini [1989] 1999; Gumbrecht 2003).
In his Introduction to the Correct Interpretation of Reasonable Today, though the normal science of language emphasizes
Discourses and Books (1742), Johann Martin Chladensius became such synchronic contexts as society, psychology, and the brain,
the irst hermeneuticist to argue for the importance of point of there is little doubt that philology and hermeneutics will persist
view (Sehe-Punckt) in interpreting historical texts. Similarly, the and reappear, like Hermes and Mercury, in many new guises in
classical scholar Friedrich August Wolf, who famously insisted the future.
on taking his doctoral degree in philology, rather than philoso-
Christopher M. Kuipers
phy, published his Prolegomena to Homer (1795), which asked the
still-vexed Homeric question: Was there really a single author
behind the Iliad and Odyssey? he concept of the linguistic fam-
ily tree of William Jones also found application in secular textual Boeckh, August. [1886] 1968. On Interpretation and Criticism. Ed. and
editing, as Karl Lachmann (17931851) perfected the method of trans. John Paul Pritchard. Norman: University of Oklahoma Press.
stemmatics to posit nonextant archetypes from which various Cerquiglini, Bernard. [1989] 1999. In Praise of the Variant: A Critical
History of Philology. Trans. Betsy Wing. Baltimore: Johns Hopkins
groups of manuscripts descended and thus to help eliminate a
University Press.
texts accumulated errors.
Gumbrecht, Hans Ulrich. 2003. he Powers of Philology: Dynamics of
While Wolf and others developed Altertumswissenschaft Textual Scholarship. Urbana: University of Illinois Press.
(classical scholarship) and biblical critics analyzed scripture, Mueller-Volmer, Kurt, ed. 1985. he Hermeneutics Reader: Texts of the
Schleiermacher, who himself published on both classical and bib- German Tradition from the Enlightenment to the Present. New York:
lical philology, elevated hermeneutics to a general practice that Continuum.
would ultimately bring it far away from traditional philological Ormiston, Gayle L., and Alan D. Shrift, eds. 1990. he Hermeneutic
concerns. (he primary source for Schleiermachers general her- Tradition: From Ast to Ricoeur. Albany: State University of New York
meneutics are detailed outlines he prepared for his university Press.
lectures, notes partly published in 1819). Hermeneutics followed Palmer, Richard E. 1969. Hermeneutics: Interpretation heory in
this philosophical direction throughout the nineteenth cen- Schleiermacher, Dilthey, Heidegger, and Gadamer. Evanston,
IL: Northwestern University Press. Still the standard introduction.
tury; Wilhelm Dilthey, for example, located hermeneutics as the
Robins, R. H. 1997. A Short History of Linguistics. 4th ed. London:
supporting discipline for the universitys Geisteswissenshaften
Longman.
(human sciences, literally sciences of the spirit). Following
the phenomenology of Edmund Husserl, the hermeneutic pro-
ject was furthered by Martin Heidegger and has continued down PHONEME
to the present in a debate between Hans-Georg Gadamer and
he phoneme is the smallest unit of speech that discriminates
Jrgen Habermas, with wider theoretical ripples still being felt in
one word from another in a particular language. Phonemes
the French and Anglo-American discourses of modernism and
are represented by symbols between slashes thus /p/ or /b/.
postmodernism. General hermeneutics grew to be concerned
Phonemes may have alternate forms, called allophones. For
not only with interpretation per se but also with the very nature
example, in English, the same phoneme /p/ is produced difer-
of understanding, being, and reality itself. Today, hermeneut-
ently in pit and spit. Minimal pairs are used to determine whether
ics has grown more at home with the purer varieties of literary
two speech sounds are allophones or separate phonemes. For
theory and aesthetics than with traditional philologys lower
example, in English, the phonemes /p/ and /b/ distinguish the
criticism.
word pull from bull, and /t/ and /d/ distinguish the word bat
Philology became an increasingly technical mode of histori-
from bad.
cal linguistics during the nineteenth century. Comparative phi-
lologists such as Rasmus Rask, Jacob Grimm, and Franz Bopp Miwako Hisagi
assembled exhaustive phonological and morphological data on
modern and ancient global languages in order to trace their devel-
PHONETICS
opment and interrelationships (sometimes with a troublingly
Orientalist attitude; see Boeckh [1886] 1968, 10, 44). Eventually, What Is Phonetics?
philologys scientiic hypertrophy drove the foundation of sepa- Phonetics is the area of language science research that studies
rate humanistic departments devoted to texts as literature. the articulation, acoustic properties, and auditory perception of
600
Phonetics
speech units (see speech production, acoustic phonet- lawful, albeit complex, physical properties. A second puzzle that
ics, articulatory phonetics , and speech perception , has been much discussed in phonetic research is the puzzle of
respectively). More speciically, phonetics can be understood lack of invariance. his refers to the diiculty of reconciling the
as linguistically informed speech science, and research phone- linguists view that language calls on a small ixed set of pho-
ticians are generally trained linguists who bring to bear their nological (or contrastive) units in organizing its words with the
knowledge of the structural properties of language. Rather than observation that there are no invariant properties of these units
focusing only on one particular language or on universal ana- in the speech signal. Indeed, experiments using sinewave syn-
tomical properties of hearing or articulation, a phonetician thesis have shown that even signals completely lacking nor-
has a special interest in understanding the full range of dis- mal speech cues can nevertheless be perceived as speech and
tinct possibilities in human speech or signed communication. understood. One, but not the only, source of lack of segment-
Because of the important role that linguistics plays in phonetic ability and lack of invariance is the phenomenon of coarticula-
study, most often phonetics inds itself housed academically tion. his refers to the fact that neighboring speech sounds are, in
as a linguistic discipline, though sometimes it inds its home fact, articulatorily coproduced in time and thus interact with one
in engineering, psychology, or in a language-speciic setting. another and mutually shape the speech signal. Consequently,
he most prominent textbook used in educating phoneticians phonological units in natural speech are realized in a highly vari-
is P. Ladefogeds A Course in Phonetics (2006), now in its ifth able, context-dependent fashion.
edition. Speech perception research in both children and adults
Within linguistics, phonetics is related to the ield of probes, in part, how human listeners are able to recover pho-
phonology, another area of theoretical linguistic research. nological units from the speech signal (see, e.g., speech per-
Linguists vary in their opinions regarding the degree of dis- ception in infants) and engage in lexical access (word
tinctness and areas of overlap between the phenomena con- identiication; see word recognition, auditory) (Pisoni
sidered to be the objects of phonetic versus phonological and Remez 2005). his involves understanding how listeners deal
research. Both are concerned with the component speech units with variability in phonetic form and how prior speech and lan-
or building blocks into which words can be divided. However, guage experience shapes these processes. Investigation of these
the general view is that phonetics investigates measurable, puzzles has informed phoneticians theoretical views regarding
physical properties of these speech units, such as the precise the fundamental nature of speech units.
articulation of speech units, their detailed and contextually
dependent acoustic properties, and cross-linguistic variation Ways of Doing Phonetic Research
in these physical properties. Phonology, in contrast, is gener- here are a number of areas of inquiry in the ield of phonetics,
ally concerned with how these speech units are combined or and these generally fall under the purview of articulatory pho-
organized into acceptable word forms within a language (e.g., netics, acoustic phonetics, or speech perception. We encounter
allowable sequences) and with the underlying principles of some issues related to each of these areas in the following, but
organization shared across languages (see phonology, uni- irst, it is worthwhile to consider the two general approaches to
versals of ). On analogy to chemistry, phonetics investigates phonetic research. he irst focuses on the description, classii-
subatomic structure, and phonology studies the formation of cation, and transcription of speech sounds, the second on exper-
molecules out of basic atoms. Traditionally, the phonological imental phonetics.
structure has been viewed as cognitive or grammatical, while Traditionally, the irst approach was done by ear, thanks to
the phonetic structure has been viewed as purely physical and the carefully trained abilities of phoneticians, often trained in
implementational. However, the dividing line between cogni- a direct line of descent from one practitioner to another. he
tive and physical has blurred or dissolved over the years (e.g., International Phonetic Association is a more than century-old
Browman and Goldstein 1995). organization whose aim is to promote the scientiic study of
phonetics and its practical applications. he association has
Well-Known Theoretical Puzzles in Phonetics provided, with regular updates over the years, a consensus
here are a number of well-known puzzles in the area of pho- International Phonetic Alphabet (referred to as the IPA, as is
netics whose empirical and theoretical consideration has helped the association itself) that serves as a notational standard for the
lead to our current understanding of some fundamental aspects phonetic transcription of all sounds known to exist contrastively
of the linguistic speech system. As one example, phoneticians in the worlds languages (and many noncontrastive variations of
have an abiding interest in understanding how to reconcile a these sounds) (IPA 1999). he latest version of the IPA was pub-
linguistic view of speech as being composed of concatenated lished in 2005 and is displayed in Figure 1.
symbolic units with its physical realization in articulation and his transcription system is a standard reference in the ield
acoustics in which there are no silences, separations, or obvious of phonetics and has been an important tool for description and
criteria for segmentation between these units. We can refer to classiication. Phoneticians doing work of this sort must deter-
this puzzle as lack of segmentability. It is famously acknowledged mine what the linguistically relevant speech categories are that
in Charles Francis Hocketts (1955) Easter egg analogy, which is, what counts as linguistically the same and diferent and what
describes the phonetic speech production processes as making a principled (or idiosyncratic) variation is observed among these
smeared mess out of neat Easter eggs moving through a wringer. speech units. As can be seen from the IPA chart, phoneticians
Gradually, however, the ield has come to understand that rather have identiied important dimensions of variation, in particular,
than a mess, the speech produced by humans is governed by for consonants:
601
Phonetics
Figure 1. The IPA Chart. Reprinted with permission from the International Phonetic Association. Copyright 2005 by
International Phonetic Association.
place in the vocal tract at which a consonant is articulated or height [high-mid-low], related to the lowest resonant fre-
creates its constriction; quency (the irst formant) of the vowel; and
manner of articulation, which refers generally to the type of backness [front-central-back], related to the distance between
constriction: complete closure for stops, narrow closure for the irst and second resonant frequencies (formants) of a
fricatives, constrictions having nasal or lateral airlow; and vowel.
voicing, whether the vocal folds are vibrating or not. Rounding or lip protrusion or compression is also encoded in
the symbol choice itself.
For vowels, the variations are captured in a continuous plane
whose dimensions can be identiied with auditory properties In addition, in order to adequately describe speech units, the
called: mechanism by which the air moves in the vocal tract must be
602
Phonetics
6000
5000
Frequency (Hz)
4000
Figure 2. A spectrogram of the sentence There are

3000
no silences here. In a spectrogram, time is displayed
2000 on the x-axis, frequency (in Hz) on the y-axis, and
amplitude in grayscale darkness.
1000
0 1450
Time (ms)
identiied. All languages use pulmonic sounds with air lowing nonlinguistic variables, such as age, gender, speaking rate and
out from the lungs, but some languages also move air by laryn- style, afect, or language background, inluence these detailed
geal (glottalic) or tongue (velaric) maneuvers. Other important speech properties.
linguistic properties of speech units can include distinctions In experimental phonetics, the development of speech syn-
in the tone (i.e., placement in and/or movement through the thesis played a critical role in researchers ability to design and
speakers pitch range), phonation type (i.e., the mode or quality execute speech perception experiments by allowing for stimuli
of vocal fold vibration and amount of laryngeal airlow), and VOT with well-controlled acoustic properties. his ushered in a new
(voice onset time: the temporal coordination of an oral constric- era of experimental speech perception research that examines
tion with a laryngeal event). how humans utilize all of the myriad informational cues pres-
Much of the most important phonetic work of this sort has ent in the acoustic signal. Another particular body of experimen-
been done in the ield by phoneticians working with native tal work called laboratory phonology seeks to inform questions
speakers of languages (Ladefoged 2003), often languages that of linguistic representation and processes in phonology via
are poorly documented or possibly endangered. he most experimental phonetic data. his work generally takes a cogni-
authoritative description of the consonants and vowels of the tive science perspective and has been archived in the multivol-
worlds languages can be found in he Sounds of the Worlds ume Papers in Laboratory Phonology collection (arising from a
Languages (Ladefoged and Maddieson 1996). In addition to regular Conference on Laboratory Phonology, which has met
describing the range of possible variation in the units used to every other year since 1987). Browman and Goldstein (1991) and
build human speech, phoneticians also address the question of Beckman and Edwards (1994) provide classic examples of this
universal properties of human speech systems. Finally, descrip- type of phonetics.
tive phonetics can also address variation within a language, such
as geographical dialect variation. his is one type of socio- Other Areas of Phonetic Inquiry
phonetics (other types include investigations of gender, age, or Other important areas of inquiry in the ield of phonetics include
class, for example.) For English, an impressive example of this investigation of the biomechanics or functional behavior and
type of phonetic investigation can be found in the Atlas of North coordination of the moving vocal tract (Saltzman and Munhall
American English (Labov, Ash, and Boberg 2006). 1989; Guenther 1995), the role of audition and auditory processing
Whereas descriptive phonetics was traditionally done by ear, in speech communication (e.g., psychophysics of speech),
a wide variety of current instrumental techniques is brought and the vocal tract as a sound-producing device, often charac-
to bear as well. Instrumental phonetics might utilize acoustic terized in terms of source-ilter theory (Stevens 1998; Fant 1960).
analysis such as digitized waveforms, and spectrograms (see Source-ilter theory has provided a sophisticated mathematical
Figure 2); pitch and formant tracking; articulatory analysis, such understanding of how noise sources at the larynx and along the
as provided by laryngoscopy, palatography, magnetometry, vocal tract are shaped by the geometry of the vocal tract and its
ultrasound, and MRI; and perceptual information such as that particular resonance properties to yield the output speech. he
provided by discrimination and categorization experiments and nonlinear properties of the articulatory-acoustic mapping have
even eye-tracking and neuroimaging. been argued to be important in understanding constraints on
Work on the other general type of phonetic research, exper- the sound inventories of languages (Stevens 1989). Other pho-
imental phonetics, also utilizes a wide variety of instrumen- neticians focus on listener-oriented motivations, such as maxi-
tal approaches, but in this case, the data characterize human mizing auditory distinctions in shaping sound systems, rather
behavior in the processes of producing and perceiving speech, than speaker-generated inluences. Clearly, speech systems are
or relect quantitative rather than purely qualitative prop- adaptive to communicative and situational demands (Lindblom
erties of speech. Experimental phonetics often investigates 1990). Speakerlistener interactions may give rise to change in
how linguistic variables, such as segmental context, sylla- word forms over time, that is, diachronically (see phonology,
ble structure, or prosody, inluence the detailed properties evolution of; syncrhony and diachrony), and they may
of speech, such as its timing, articulation, spectral character- give rise to synchronic adjustments speciic to the interlocutors
istics, or intonation. Alternatively, it might examine how and the situation.
603
Phonetics
Connections to Other Fields understanding dyslexia. It is critical for educational success

Phonetics is an interdisciplinary area of linguistics; for exam- that reading teachers are made aware of the importance of
ple, we have sketched its connection to phonology. It also can characteristic diferences between speech and reading, of how
closely tie into other areas of experimental linguistics, particu- speech knowledge can be leveraged in the teaching of reading,
larly psycholinguistic research on spoken language pro- and of how interference from the phonetic properties of native
duction and processing and neurolinguistic research on brain languages can inluence the acquisition of reading in non-native
function. Knowledge of the phonetic properties of languages languages (Rayner et al. 2002).
and of the characteristics of the speech signal is critical to the A synergistic relationship exists between phonetics and the
design of experimental linguistic and neurolinguistic research ield of biomedical imaging. Advances in imaging of the vocal
programs that examine speech production and processing (see tract and larynx have greatly illuminated our understanding of
phonetics and phonology, neurobiology of; brain speech production. In turn, new techniques for upper airway
and language). Such speech-related work might address lexi- and laryngeal imaging and image analysis have been developed
cal access, speech production planning, neural localization of by phoneticians. hese techniques can be incorporated into the
various functions related to speaking and speech understanding ield of clinical phonetics and speech pathology. Traditional
(see lexical processing, neurobiology of), the integra- types of descriptive and instrumental phonetics have also
tion of visuofacial and auditory information, and the relation of found utility in the understanding of clinical challenges such
action and perception (see mirror systems, imitation, and as apraxia, stuttering, phonological disorders, and voice dis-
language). orders. Indeed, the National Institute on Deafness and Other
Furthermore, there are many ields outside of linguistics Communication Disorders is one of the largest funding sources
on which phonetics has a direct bearing. In the area of speech for phonetic research. Currently, there is enormous interest in
technology, linguistic phonetic knowledge can contribute making cochlear implants as successful as possible for their user
sometimes directly, sometimes indirectly to machine speech populations. Knowledge of the acoustic properties of speech and
synthesis and recognition (see voice interaction design). of methods for assessing perception adds to the broad body of
And conversely, much early work in acoustic phonetics grew technological, engineering, and audiological knowledge cur-
out of the eforts of speech engineers, for example, at Bell rently contributing to this efort.
Laboratories, Haskins Laboratories, Massachusetts Institute Phonetics is one of the foundational areas of linguistic research
of Technology, the Joint Speech Research Unit in England, the and language science. It focuses on the descriptive, quantitative,
Speech Transmission Laboratory in Sweden, and the Advanced and behavioral aspects of speech production, transmission, and
Telecommunications Research (ATR) Institute International perception. Phonetic knowledge helps guide our understanding
in Japan. Currently, in the ield of speech engineering, there is of the phonological representations and patterning observed in
interest in capturing linguistic knowledge in ways that will allow human language. Phonetics also makes interdisciplinary con-
better system performance with conversational interfaces and tact with speech technology, biomedical imaging, forensics, and
with audiovisual speech. pedagogical and clinical ields.
Phonetic science also has utility in forensics, and forensic
Dani Byrd
phonetics is a recognized area of applied science (see forensic
linguistics). Forensic experts bring both instrumental and
expertlistening techniques to the determination of whether a
suspects voice is a likely or unlikely match to forensic evidence Beckman, Mary E., and Jan Edwards. 1994. Articulatory evidence for dif-
that investigators have in hand. It should be noted, however, that ferentiating stress categories. In Phonological Structure and Phonetic
there is no unique identiier in the voice of an individual that is Form: Papers in Laboratory Phonology. Vol 3. Ed. Patricia A. Keating,
733. Cambridge: Cambridge University Press.
analogous to a ingerprint. Phoneticians are frequently called on
Browman, C. P., and L. Goldstein. 1991. Tiers in articulatory phonology,
in such speaker-identiication cases to provide expert knowledge
with some implications for casual speech. In Papers in Laboratory
and testimony as to the many subtle properties that may distin- Phonology. Vol. 1: Between the Grammar and the Physics of Speech.
guish one individuals speech from that of another. Ed. J. Kingston and M. E. Beckman, 34176. Cambridge: Cambridge
Another ield outside of linguistics is often, in fact, an indi- University Press.
viduals irst contact with phonetics second language pedagogy . 1995. Dynamics and articulatory phonology. In Mind as
(see bilingual education). An accurate understanding of Motion: Explorations in the Dynamics of Cognition, ed. Robert F. Port
how a languages speech sounds are articulated proves helpful and Timothy Van Gelder, 17593. Cambridge, MA: MIT Press.
in pronunciation instruction. Instrumental techniques for dis- Fant, Gunnar. 1960. Acoustic heory of Speech Production. he
playing feedback on articulation, speech acoustics, or linguistic Hague: Mouton.
categorization can also help in training production and percep- Guenther, F. H. 1995. Speech sound acquisition, coarticulation, and rate
efects in a neural network model of speech production. Psychological
tion of non-native linguistic contrasts.
Review 102: 594621.
he paramount area of the inluence of theoretical phonol-
Hardcastle, William J., and John Laver. 1997, eds. he Handbook of
ogy and phonetics on pedagogy is in the teaching of reading Phonetic Sciences. Oxford: Blackwell.
(see teaching reading). Linguists from diverse backgrounds Haskins Laboratories. A speech and reading laboratory in New Haven,
and groups have taken a leadership position in emphasizing CT, that maintains a Web site at http://www.haskins.yale.edu.
the importance of phonemic awareness (see phonologi- Hockett, Charles Francis. 1955. A Manual of Phonology.
cal awareness) for the acquisition of reading skills and for Baltimore: Waverly.
604
Phonetics and Phonology, Neurobiology of
IPA (International Phonetic Association). 1999. Handbook of the the microlevel are necessary for understanding how neurobio-
International Phonetic Association: A Guide to the Use of the logical methods are used to examine phonetics and phonology
International Phonetic Alphabet. Cambridge: Cambridge University (see Kandel, Schwartz, and Jessell 2000; Shafer and Garrido-
Press. he association maintains a Web site at http://www.arts.gla. Nag 2007, for greater detail).
ac.uk/IPA/ipa.html.
First, brain function is in terms of electrochemical mes-
Johnson, Keith. 2003. Acoustic and Auditory Phonetics. 2d ed.
sages between neurons. neuroimaging methods index dif-
Oxford: Blackwell.
Labov, W., Ash, S., and C. Boberg. 2006. Atlas of North American
ferent aspects of these processes and the metabolic processes
English: Phonetics, Phonology and Sound Change. Berlin: Walter de that support these. Electrophysiological methods (electroen-
Gruyter. Available online at: http://www.langsci.ucl.ac.uk/ipa/. cephalogram [EEG], magnetoencephalogram [MEG]) record
Ladefoged, P. 2003. Phonetic Data Analysis: An Introduction to changes in electrical potential at the scalp. hese changes are the
Instrumental Phonetic Fieldwork. Oxford: Blackwell. result of the synchronous iring of large assemblies of neurons.
. 2006. A Course in Phonetics, 5th ed. Boston: homson Functional magnetic resonance imaging (fMRI) and positron
Wadsworth. emission tomography (PET) measure changes in the metabo-
Ladefoged, Peter, and Ian Maddieson. 1996. he Sounds of the Worlds lism of oxygen, and PET can also measure changes in the chemi-
Languages. Oxford: Blackwell. cal aspect of the electrochemical signals sent between neurons.
Lindblom, B. 1990. Explaining phonetic variation: A sketch of the H&H
hese changes in electrochemical and metabolic measures are
theory. In Speech Production and Speech Modeling, ed. W. Hardcastle
used to make inferences about timing and localization of neural
and A. Marchal, 40339, Dordrecht, the Netherlands: Kluwer.
Miller, J., ed. 1991. Papers in Speech Communication. A three-volume ser-
activity related to some stimulus or event.
ies published by the Acoustical Society of America (New York) through A second point is that diferent brain regions have distinctive
the American Institute of Physics. structure in terms of neurons and connectivity and that these
Pisoni, D. , and R. Remez, eds. 2005. Handbook of Speech Perception. distinctions are the basis of Korbinian Brodmanns classiication
Malden, MA: Blackwell. system. For example, primary auditory cortex (Brodmanns area
Rayner, K., B. R. Foorman, C. A. Perfetti, D. Pesetsky, and M. Seidenberg. [BA] 41) has a thick layer of neurons specialized to receive infor-
2002. How should reading be taught? Scientiic American mation from the peripheral auditory system. hese neurons then
286: 8491. send signals to other cortical regions but not directly back to the
Saltzman, E. L., and K. G. Munhall. 1989. A dynamical approach to
periphery for motor responses. Ultimately, phonological func-
gestural patterning in speech production. Ecological Psychology
tioning will need to be described in terms of connectivity at this
1: 33382.
neural level for a complete understanding of the brainbehavior
Stevens, K. 1989. On the quantal nature of speech. Journal of Phonetics
17: 345 relationship.
Stevens, K. 1998. Acoustic Phonetics. Cambridge, MA: MIT Press. At the macrolevel, neurobiology of phonetics/phonology is
described in terms of the activated brain regions and the timing
of activation of these regions in perception or production (see
speech perception and speech production). hese brain
PHONETICS AND PHONOLOGY, NEUROBIOLOGY OF regions are referred to by Brodmanns areas, by names describing
he study of the neurobiology of phonetics and phonol- function (e.g., primary auditory cortex), by the scientist involved
ogy focuses on the brain mechanisms that support percep- in identifying the regions (e.g., brocas area), or by some term
tion and production of linguistic phonological forms. his entry describing an attribute of the regions (e.g., Greek hippocampus
describes the neural structures and processes underlying pho- for a region that is shaped like a seahorse).
netic and phonological processing and briely discusses four he principal brain structures involved in phonetic/phono-
current theoretical controversies that neurophysiological data logical perception are found in the perisylvian cortex and
can help address. First, are there invariant relationships between include primary (BA 41) and secondary (BA 42) auditory cortex
acoustic properties and phonological categories? Second, for processing the acoustic-phonetic aspects of speech (Scott and
does speech have some special status apart from other acoustic Wise 2004) (see Color Plate 10). Sound in general (e.g., noise)
information? hird, is there a critical period for language- activates bilateral regions of the dorsal plane of the superior
speciic learning? Fourth, to what degree does biology constrain temporal gyrus (STG) and regions of the lateral STG. In contrast
the nature of phonological systems? with noise, temporally complex signals, including speech, more
strongly activate the dorsal region of STG, and the lateral STG
Neurobiological Underpinnings activation extends more ventrally. Auditory information identi-
he physiology (or function) of phonological processing is ied as speech compared to non-speech leads to increased acti-
described in terms of the structures (anatomy) activated in vation of regions of the STG and superior temporal sulcus (STS)
processing and the function of these structures. A compre- that are more anterior and ventral (inferior). he left STS appears
hensive understanding of the physiology requires explication to be active in mapping speech onto lexical-semantic repre-
at the micro- and macrolevels of processing. he microlevel sentation. In contrast, the right STS shows sensitivity to melodic
describes the microstructures and their processing (neuron, features. he left planum temporale (PT, in superior posterior
axon, synaptic potential) and are general to brain function, temporal cortex) is believed to have a special role in phonetic/
whereas the macrolevel focuses on larger-scale structures and phonological processing and appears to support a motor/sen-
processes speciic to a particular motor, sensory, or cognitive sory interface for acoustic information. A left-greater-than-right
process (e.g., phonetic processing). Several points concerning asymmetry is generally stronger for speech than non-speech (see
605
left hemisphere and right hemisphere). Anterior regions appear to index acoustic levels of processing. To date, there is
are also activated in speech perception. he left prefrontal cor- no clear evidence that language experience at the phonological
tex (BA 46) is activated in phonological processing in access- level directly afects processing in the time range of the P1 and
ing, sequencing, and monitoring phonemes and processing N1 components.
transitions from consonants to vowels or vowels to consonants. ERP components occurring later in time are related to higher-
Articulation of phonetic information is supported by motor (BA level cognitive processes. hose showing modulation by phono-
4) and premotor/Brocas area cortex (BA 6, BA 44/BA 45). logical experience include mismatch negativity (MMN), N2b,
Recent models have organized these observations into a sim- P3b, and N400 (Ntnen 2001; Kujala et al. 2004). Listeners
ple framework in which the more dorsal regions (i.e., posterior show more robust MMNs (peaking between 100 and 300 ms and
and superior) are active in auditory-motor integration during indicating preattentive, automatic processing) in discriminating
speech perception and the more ventral regions (anterior and pairs of sounds with which they have had experience (Ntnen
inferior) are more involved in the speech-meaning interface. 2001). Speciically, the MMN is smaller or later to a contrast in
his indicates that the phonetic aspects of processing, which speech sounds if the speech sounds are assimilated into one pho-
are independent of meaning, will be carried out in more dorsal nological category for listeners (e.g., Japanese listeners percep-
regions of the auditory and motor cortex, whereas the phonolog- tion of English [l] vs. [r]) or if the speech sounds are assimilated
ical aspects, which are the basis of meaningful distinctions, are into two categories, but one or both sounds are poor exemplars
processed in more ventral areas of the auditory cortex. he exact of these categories (e.g., English listeners perception of Hindi
roles of STG, STS, and the two hemispheres in phonetic and pho- retrolex [Da] versus [ba]; Shafer, Schwartz, and Kurtzberg 2004).
nological processing have not been deinitively established yet, he later components, N2b, P3b, and N400, are observed when
but it is known that these areas are all important in speech pro- a participant is asked to actively discriminate a speech contrast.
cessing (Poeppel and Hickok 2004). No discernible N2b, P3b, or N400 is observed if discrimination
Anterior and posterior brain regions involved in phonetic is very diicult (chance performance). If discrimination is bet-
and phonological processing communicate directly via bundles ter than chance but more diicult than for native listeners, then
of ibers (axons), such as the arcuate fasciculus, but also via more these components are later and larger than those found for the
indirect routes, including the basal ganglia, thalamus, and native group. For example, English speakers showed reasonably
cerebellum. hese additional structures are involved in gen- good discrimination of Japanese (JP) vowel duration (taado
eral functions related to information processing, motor plan- versus tado), but a later and larger P3b component compared
ning, and coordination and will not be discussed further here. to native Japanese listeners (Hisagi 2007).
he timing of activation of levels of phonetic and phonological Integrating the knowledge of location obtained from fMRI/
processing has largely been provided by EEG and MEG mea- PET and timing obtained from EEG/MEG indicates that acoustic-
sures. he timing of auditory processes can be roughly related phonetic processing occurs in primary and secondary auditory
to levels of processing in the primary and secondary auditory cortical regions between 10 and 100 ms, followed by phonolog-
cortex and to the timing of more basic (e.g., signal detection) ical aspects of processing, presumably in more ventral regions,
versus higher-level cognitive processes (phonological discrimin- between 100 and 400 ms. his model is supported by studies
ation). he principal method used to investigate these processes localizing the sources of N1 and the phonologically elicited N400
is event-related potentials (ERPs). he EEG/MEG is time-locked (Kujala et al. 2004).
to a stimulus of interest (e.g., ba), and this stimulus is deliv-
ered multiple times (anywhere from 20 to 10,000, depending on Lack of Invariance Problem
the ERP component of interest). he portion of the EEG/MEG A major theoretical debate in speech perception over the past 40
time-locked to the stimulus is averaged to remove noise (i.e., years has been the relationship between acoustic and phonolog-
activity produced by unrelated processes). ical properties. Speech with similar acoustic properties may be
ERPs are described in terms of the latency, polarity, and assigned to diferent phonological categories, and, conversely,
topography of peaks that vary with some stimulus property or speech with diferent acoustic properties is sometimes assigned
cognitive process. hese identiied peaks are often referred to as to the same phonological category. Much research focused on
components. he P and N in a component label refer to positive discovering invariant properties of speech sound categories that
and negative polarity, respectively, and the number indicates would allow for precise categorization has failed to do so.
the approximate peak latency (e.g., N400) or the position in a A recent model can be used to illustrate how neurophysi-
sequence (e.g., N2). ological data can address the lack of invariance issue. In this
Studies of auditory processing have shown that auditory model, speech is categorized and identiied by an active pro-
information enters primary cortical regions between 10 and 50 cess of hypothesis testing (e.g., Magnuson and Nusbaum 2007).
ms following contact with the outer ear and that a frontocentral Diferent types of information are used with regards to the type
positivity peaking around 50 ms (P1 component) and negativ- and amount of sensory and lexical information available. For
ity peaking around 100 ms (N1 component) index activity in the example, clear auditory-speech information and knowledge
primary and secondary auditory cortex. Neurobiological studies of the possible phoneme categories of a language lead to reli-
with animals suggest that P1 indexes input from the periphery ance on auditory information in categorization. More ambigu-
into the superior temporal plane of the auditory cortex and that ous auditory-speech information can lead to greater reliance on
N1 relects activity of neurons in the secondary auditory cortex visual information (e.g., lip closure for [p] but not [t]). In other
receiving information from other cortical regions. P1 and N1 words, there are many routes to phonological categorization.
606
If this model is viable, then neurophysiological data will show Alternatively, listeners may have diiculty refocusing their atten-
whether diferent sensory and motor cortex are activated when tion to the relevant cues needed for rapid processing of the sec-
speech is more versus less clear and when other information ond language (Strange and Shafer 2008).
(e.g., visual) is available. Several recent studies have shown more Neurophysiological data can address this question by
involvement of the motor cortex and visual sensory areas for examining where in the nervous system diferences in process-
ambiguous acoustic speech information when facial informa- ing are found for irst and second language learners. he cur-
tion is available, and less activation of these regions when only rent research has not shown diferences earlier than the MMN
the speech signal is available (e.g., Skipper, Nusbaum, and Small response. Furthermore, a recent study from our laboratory sug-
2006). gests that attention plays a role in loss of ability to learn novel
In summary, this example illustrates the importance of categories. Speciically, listeners learn to automatically attend to
neurophysiological data for addressing long-standing theoret- relevant cues in their irst language and can only overcome these
ical controversies. weightings with great attentional efort. his result suggests that
the loss of sensitivity in adjusting to novel phonological catego-
Does Speech Have Some Special Status Apart from Other ries by second language learners is not directly due to a closure
Acoustic Information? of a critical period for changing the sensitivity or resolution of the
Over the past forty years, there has been a debate regarding primary and auditory cortex; rather, it is due, at least in part, to
whether speech requires a special type of auditory processing attentional issues (Hisagi 2007).
speciic to humans. Behavioral studies have delivered mixed hese indings do not answer all the questions regarding
answers to this question. For example, studies have shown that critical and sensitive periods for setting up phonetic and pho-
speech (in particular, consonants) is perceived categorically, nological categories since second language learners acquired
rather than continuously, and used this to argue for special sta- categories for a irst language early in life. Recent research exam-
tus. On the other hand, other species (e.g., chinchillas) are shown ining the neurophysiological and behavioral consequences of
to perceive speech categorically, and complex non-speech audi- deprivation of hearing, which is reversed by cochlear implants,
tory sounds can be categorically perceived. will have much to contribute for addressing this question. Recent
Neurophysiological data can help examine this question by advances have led to implantation at earlier ages, which is allow-
determining whether the same structures and processes sup- ing researchers to compare the quality of phonological process-
port processing of speech and non-speech. he current available ing across diferent ages of irst exposure to speech information.
data suggest that in one sense, speech and non-speech are simi- Improvements in these implanted devices will also allow exami-
lar. he same auditory cortical regions are activated in process- nation of how the quality of auditory-speech input impacts pho-
ing speech and non-speech, as described previously (also see netic and phonological systems. his emerging area of research
Dehaene-Lambertz and Gigla 2004). Furthermore, the sensory- is likely to provide less ambiguous evidence regarding a critical
motor links found for speech are similar to those seen for other or sensitive period for speech.
sensory-motor links (e.g., tool manipulation using visual and
motor regions) and seen in other species (see Skipper, Nusbaum, To What Degree Does Biology Constrain the Nature of
and Small 2006). Phonological Systems?
In another sense, the neurophysiological data suggest A classic debate in linguistics concerns the extent to which lan-
that the processing of speech difers from that of non-speech. guage is innate. A more useful way to ask this question is what
Speciically, as described previously, more ventral areas (lateral biological constraints are placed on the nature of phonological
and anterior superior temporal gyrus) become involved in pho- systems and how environmental input contributes to construct-
nological processing of speech sounds because these sounds ing these systems. Across languages, there are common patterns.
are relevant for making meaning contrasts. It is possible that For example, all languages contrast /i/ (heep), /u/ (hoop), and
humans are the only species that fractionate sound symbols into /a/ (hop) (although there can be slight variations in the actual
subcomponents (phonemes) that can be manipulated to create production of these sounds), and some languages only contrast
novel symbols, and, in this way, speech is special. these three vowels. However, there is no existing language that
only contrasts i (in bead), I (in bid), and e (in bed) without
Is There a Critical Period for Language-Speciic Learning? also contrasting /i/, /u/, and /a/. It is possible that these univer-
Researchers have long noted that learning a second language sal patterns are due to biological constraints. On the other hand,
late in life typically results in a stronger non-native accent and they may be attributed to environmental factors. Examination of
poorer speech perception in the second language (see Strange the evidence suggests that the system is constrained by an inter-
and Shafer 8; see also second language acquisition). One action of biological and environmental constraints. For example,
explanation for this pattern is that there is a critical or sensi- /i/, /u/, and /a/ are perceptually more distinct than i (in bead),
tive period in which phonological information must be learned I (in bid), and e (in bed), and this is a property of the auditory
in order to lead to native-like performance (see phonology, system; the environment (input) leads to less salient distinctions
acquisition of). Some research suggests a gradual loss of abil- included in some languages, but many possible distinctions are
ity to alter phonological categories up to puberty. never found in languages.
he reason for this change in ability is unknown. It could be Neurobiological data will aid in further elucidating how devel-
that the auditory cortex is altered at an early level so that it loses opment of phonological systems is constrained by instructions
the ability (or resolution) to respond to non-native contrasts. from the genetic code and emerges from patterns in the input. In
607
Phonetics and Phonology, Neurobiology of Phonological Awareness
particular, examination of the way that genetic variation afects Skipper, Jeremy I., Howard C. Nusbaum, and Steven L. Small. 2006.
the development of speech processing and its neurophysiologi- Lending a helping hand to hearing: Another motor theory of speech
cal substrate will help us understand the contributions of biology perception. In Action to Language via the Mirror Neuron System, ed.
and the environment. For example, studies of congenitally deaf Michael A. Arbib, 25084. Cambridge: Cambridge University Press.
Strange, W., and V. L. Shafer. 2008. Speech perception in second language
populations have revealed that some brain regions that are typi-
learners: he re-education of selective perception. In Phonology and
cally specialized for audition (e.g., regions of secondary auditory
Second Language Acquisition, ed. M. Zampini and J. Hansen, 15392.
cortex) are used in visual processing and thus are highly sensi- Cambridge: Cambridge University Press.
tive to input. In contrast, more primary regions specialized for
audition (primary auditory cortex and subcortical areas) do not
PHONOLOGICAL AWARENESS
reorganize to take on nonauditory functions and are thus less
sensitive to input. An understanding of the relationship among Phonological awareness encompasses the broad class of abili-
the genetic code, neural connectivity, and plasticity of auditory ties that enable one to attend to, isolate, identify, and manipulate
and language-association brain regions will help to create real- the speech sounds in spoken words. he domain of phonologi-
istic models of phonetic/phonological development and pro- cal awareness abilities can be subdivided into two levels. he
cessing, which in turn will help to answer how biology and the irst, phonological sensitivity, pertains to conscious aware-
environment contribute to the development of this system. ness of larger, more salient sound structures within words,
including rhymes and syllable structures (i.e., syllables and
Conclusion subsyllabic units) (Scarborough and Brady 2002). (Rhymes,
his entry illustrated the importance of neurobiological data in deined at the word level, consist of the stressed vowel and
addressing signiicant questions concerning phonetic and pho- what follows [e.g., be/we; feather/weather]; subsyllabic units
nological processing. In particular, an understanding of the neu- include onsets, i.e., the portion of each syllable preceding the
robiology supporting phonetic and phonological processing will vowel [e.g., be; spot; magnet], and rimes, i.e., the remaining por-
allow researchers to construct better models of processing and to tion [e.g., be; spot; magnet]). he second level of phonological
address questions related to irst and second language learning awareness, phoneme awareness, refers to explicit awareness of
and disorders (such as dyslexia and aphasia) attributable to the individual phonemes making up words. Generally, children
deicits in phonological processing. acquire at least some degree of phonological sensitivity prior
to phoneme awareness (see phonology, acquisition of ).
Valerie Shafer However, questions remain as to whether attainment of pho-
nological sensitivity is a necessary prerequisite for the devel-
WORKS CITED AND SUGGESTIONS FOR FURTHER READING opment of phoneme awareness (Gillon 2005). When children
Dehaene-Lambertz, G., and T. Gigla. 2004. Common neural basis for begin to acquire phoneme awareness, they usually irst are able
phoneme processing in infants and adults. Journal of Cognitive to isolate and identify the external phonemes (i.e., the begin-
Neuroscience 16: 137587. ning and/or inal phonemes in words). Ultimately, proiciency
Hisagi, M. 2007. Perception of Japanese temporally-cued phonetic con- in phoneme awareness entails the ability to segment, identify,
trasts by Japanese and American English listeners: Behavioral and elec- and blend all of the individual phonemes, including those
trophysiological measures. Ph.D. diss., City University of New York. within consonant clusters (e.g., in words such as blast).
Kandel, E. , J. Schwartz, and T. Jessell. 2000. Principles of Neural Science. he signiicance of phoneme awareness stems from its role
New York: William Heinemann and Harvard University Press. in reading acquisition (see writing and reading, acqui-
Kujala, A., K. Alho, E. Service, R. J. Ilmoniemi, and J. F. Connolly. 2004. sition of). Understanding that spoken words are made up of
Activation in the anterior left auditory cortex associated with pho- individual speech sounds provides a conceptual foundation for
nological analysis of speech input: Localization of the phonological
understanding the alphabetic principle (i.e., that letters cor-
mismatch negativity response with MEG. Cognitive Brain Research
respond with phonemes). his awareness, in turn, facilitates
21: 10613.
Magnuson, J. S., and H. C. Nusbaum. 2007. Acoustic diferences, lis-
learning to read and spell. he relationship between phoneme
tener expectations, and the perceptual accommodation of talker vari- awareness and literacy development is reciprocal: With
ability. Journal of Experimental Psychology, Human Perception and some emergent awareness of phonemes, the student can start
Performance 33: 391409. to acquire lettersound knowledge. In turn, awareness of pho-
Ntnen, Risto 2001. he perception of speech sounds by the human nemes is heightened by experience with print.
brain as relected by the mismatch negativity (MMN) and its magnetic Since the concept of phoneme awareness was established in
equivalent (MMNm). Psychophysiology 38: 121. the 1970s (e.g., Liberman 1971), evidence for the signiicance of
Poeppel, David, and Gregory Hickok. 2004. Towards a new functional phoneme awareness for reading achievement has accrued from
anatomy of language. Cognition 92: 112. correlational, prediction, and training studies. At all ages, includ-
Scott, S., and R. Wise. 2004. he functional neuroanatomy of prelexical
ing adulthood, less-skilled readers demonstrate weaker perfor-
processing in speech perception. Cognition 92: 1345.
mance on phoneme awareness measures than better-reading
Shafer, V. L., and K. Garrido-Nag. 2007. he neurodevelopmental bases
of language. In he Handbook of Language Development, ed. M. Shatz
peers, whether the same age or younger reading-age controls.
and E. Hof, 2145. Oxford: Blackwell. Prediction studies with kindergarten students document that
Shafer, V. L., R. G. Schwartz, and D. Kurtzberg. 2004, Language-speciic phoneme awareness performance is one of the strongest pre-
memory traces of consonants in the brain. Cognitive Brain Research dictors of their subsequent reading achievement, particularly
18: 24254. for decoding and word recognition skills, but also for reading
608
Phonology
comprehension (see teaching reading). Most compelling, considerable diversity among these views, it is fair to say that
intervention studies conirm a causal link between instruction in by and large, they focused on the elucidation of the contrastive
phoneme awareness and increased success at learning to read, properties of elements of surface phonetic form to the exclusion
with greater beneits when discovery of phonemes is linked with of other aspects of sound structure.
letter knowledge (Ehri et al. 2001).
The Development of Modern Phonology
Susan A. Brady
Poststructuralist theories fall broadly within the tradition of gen-
erative phonology, associated in its origins with Noam Chomsky
and Morris Halle (1968). he distinguishing character of this
Ehri, L. C., S. R. Nunes, D. M. Willows, B. Schuster, Z. Yaghoub-Zadeh, view was its attention not simply to surface contrasts but also to
and T. Shanahan. 2001. Phonemic awareness instruction helps patterns of alternation in shape, and its positing of an abstract
children learn to read: Evidence from the National Reading Panels underlying representation (where contrasts among elements
meta-analysis. Reading Research Quarterly 36.3: 25087.
are characterized) that is related to surface phonetic form by a
Gillon, G. 2005. Phonological Awareness: From Research to Practice.
system of rewriting rules. Each of these rules represents a single
New York: Guilford.
generalization about the realization of phonological elements
Liberman, I. Y. 1971. Basic research in speech and lateralization of
language: Some implications for reading disability. Bulletin of the (e.g., Vowels are long before voiced obstruents). Much of the
Orton Society 21: 7187. theoretical discussion in the 1960s and early 1970s concerned
Scarborough, H. S., and S. A. Brady. 2002. Toward a common terminol- the role of an explicit formalism for these rules.
ogy for talking about speech and reading: A glossary of the phon words he rules were presumed to apply in a sequence, with each
and some related terms. Journal of Literacy Research 34: 299334. applying to the result of all previous rules. As a consequence,
some of the generalizations represented by individual rules may
only be valid at an abstract level and not true of all surface forms
PHONOLOGY
to the extent subsequent changes obscure the conditioning fac-
As opposed to phonetics, which deals with the properties tors of a rule or its efects, leading to the opacity of the rule in
of sounds from a language-independent point of view, pho- question. For example, in many varieties of American English,
nology constitutes the study of the sound structure of units the medial consonants of words like ladder and latter are both
(morphemes, words, phrases, utterances) within individual pronounced as the same voiced lap [D]. he vowels of the ini-
languages. Its goal is to elucidate the system of distinctions in tial syllables of such words continue to difer in length, however,
sound that diferentiate such units within a particular language, relecting the abstract diference in voicing between /d/ and /t/,
and the range of realizations of a given units sound structure as even though that diference is obscured by the (subsequent)
a function of the shape of other units in its context. hese two application of a rule of lapping that renders the vowel-length
goals the study of invariants of sound structure and of the vari- rule opaque. Much attention was paid in this period to the theo-
ation shown by these elements in combination are obviously ries of rule ordering necessary for describing such phenomena.
closely related, but attention has tended to shift between them In the years immediately following the publication of Chomsky
over time. and Halle (1968), a number of scholars reacted strongly to the
Late nineteenth- and early twentieth-century study of sound perceived abstractness of the underlying phonological repre-
structure focused on the details of sound production. As these sentations to which it appeared to lead. Various proposals that
studies (in both articulatory phonetics and acoustic intended to restrain this aspect of the theory appeared, some of
phonetics) became more sophisticated, however, it was them based on the idea that if the rules themselves could be con-
increasingly apparent that the resulting explosion of data about strained so as to permit only highly natural ones, drawn from
sound properties was obscuring, rather than enhancing, schol- some substantively constrained universal set, the underlying
ars understanding of the way sound is organized for linguistic representations would thereby be forced to be closer to surface
purposes. Much that is measurable in the speech signal is pre- forms. Others proposed to constrain the relation between pho-
dictable, internal to the system of a given language, even though nological and phonetic representation directly (again, often in
exactly comparable properties may serve to distinguish items the name of naturalness).
from one another in a diferent language. In general, these attempts to limit the power of phonological
Vowels in English, for example, are relatively longer before systems by iat ran into apparent counterexamples that deprived
certain consonants than before others, but the diference in the them of their appeal. Other developments in phonological the-
vowels of, for example, cod and cot is entirely predictable from orizing shifted scholars attention away from this issue while also
this principle alone. By contrast, an exactly parallel diference leading (as somewhat unintentional by-products) to a general
between the vowels of kaade dip and kade envious in Finnish reduction in the degree of abstractness of representation. Some
serves as the sole diference between these words. A focus on of these elaborations and reorientations of the program of gen-
phonetic features alone fails to reveal the role played by sound erative phonology are sketched here.
properties within a language.
he result of this insight was the development within vari- AUTOSEGMENTAL PHONOLOGY. he bulk of research during the
ous theories of structuralism of attempts (Anderson 1985) classical period of generative phonology was concerned with
to deine the phoneme, a presumed minimal unit of contrast segmental phenomena (although the main goal of Chomsky
within the sound system of a single language. While there is and Halle 1968 was an account of English stress). In the early
609
Phonology
1970s, attempts to describe the phonology of tonal systems led of features) were thereby characterized as similar to one another,
to important changes in assumptions about representations and and thus predicted to behave in the same way in rules.
a concurrent shift of attention on the part of phonologists. It soon became apparent, however, that feature analysis
he classical theory had assumed that phonological (and by itself does not exhaust this matter. When nasal consonants
phonetic) representations were given in the form of a simple assimilate in place of articulation to a following obstruent, for
matrix, where each row represented a phonological distinctive instance, each individual place is speciied by a distinct feature
feature and the columns represented successive segments. Such (or set of features), and the overall unity of the process as one
a representation is based on the assumption that there is a one- applying exactly to all and only nasals, regardless of their place
to-one relation between the speciications for any given feature of articulation, is not expressed. Nothing in the notation, that is,
and those for all other features, since each column contains makes it clear that a rule assimilating labiality, coronality, and
exactly one speciication for each feature. velarity is more coherent in some sense than one assimilating
Tonal phenomena, however, made it clear that features need labiality, voicing, and nasality.
not be synchronized in this way: A given feature speciication he response to this problem was a program to treat the
might take as its scope either more or less than a single segment. features themselves as organized into a hierarchy, such that all
A classic example of this, ofered by W. Leben, is found in Mende, place-of-articulation features (for example), and no others, are
where each word bears one of a limited set of tonal patterns, daughters of a unitary node [Place]. On that approach, place
regardless of the number of syllables on which this pattern is assimilation could be viewed as a unitary association of the
realized. hus, the tone pattern high-low appears on a single [Place] node itself, rather than individually to each of its vari-
syllable in mb (and thus the low has scope over only the last half ous possible values, while no such single unit corresponds to the
of the vowel), on two in ngl, and on three in flm (where the hypothetical alternative. Attention focused on such problems of
single low of the pattern takes scope over two vowels). his led the internal geometry of the feature system generally led to the
to the development of autosegmental representations, in which assumption that the way to approach them was to assume that
feature speciications were linked by lines of association (subject the theory of rules should be limited to a very simple set of reas-
to speciic constraints), rather than all being aligned into seg- sociations and deletions within the autosegmental structure of
ments. he extension of this insight to other phenomena, and its an utterance, and that a single, universal feature hierarchy could
consolidation, essentially displaced the earlier concerns of rule be speciied on the basis of which all observed natural rules
notation and ordering in phonologists attention. (and no unnatural ones) could be formulated. Arguments
for and against speciic proposals about such a hierarchy have
METRICAL PHONOLOGY. A similar development took place in drawn considerable attention, though it is perhaps notable that
the analysis of stress and the study of the syllable. he analysis the theoretical assumptions underlying the program have been
in Chomsky and Halle (1968) treated stress as simply one more much less discussed.
phonological feature, with a value assigned to some (but not all)
of the segments in the representation of a word. his account was LEXICAL PHONOLOGY. In classical generative phonology, the
forced to attribute a number of basic properties to the feature interface between word structure and sound structure is quite
[Stress], however, that had no obvious correlates in the behavior simple. morphological elements are combined into words
of other features. in the syntax, these elements are provided with phonological
It became possible to rationalize these properties by viewing (underlying) forms, and the resulting syntactically organized
stress not as a segmental feature but as a relational property of labeled, bracketed structure serves as the input to the phonology.
the organization of syllables into larger structures. his, in turn, At least some of the phonological rules were assumed to apply
required the recognition of syllables as signiicant structural according to the principle of the cycle, based on this structure,
units: a notion that was explicitly rejected in the earlier theory in a uniform way. To the extent that morphological elements dis-
in favor of an attempt to reformulate all apparent syllable-based play diferent phonological properties in their combinations with
generalizations in terms of segmental structure alone. he orga- others, this was represented as diferences within an inventory of
nization of segments into syllables and these, in turn, into larger boundary elements separating them from adjacent items.
units called feet, which themselves are organized into phonologi- Originating from the apparent generalization that elements
cal words (and phrases, etc.), allows for the elimination of the with the same phonological behavior (hence, associated with the
anomalous character of segmentalized stress. he study within same boundary type) tend to appear adjacent to one another,
metrical phonology of these units, their internal organization, the theory of lexical phonology proposed a substantial revision to
and their relation to one another completed the enrichment of this architecture. Instead of constructing the entire representa-
the notion of phonological representation begun within autoseg- tion once and for all and then submitting it to the phonology for
mental phonology. realization, this view proposed that the lexicon of morphological
elements is divided into multiple strata or levels. Basic roots can
FEATURE GEOMETRY. A standard theme of classical generative combine with elements of the irst stratum; after each such mor-
phonology was that of natural classes of phonological segments, phological addition, the resulting form is subject to adjustment
groups of segments that function in some parallel fashion in pho- by the rules of a corresponding level of the phonology, and the
nological rules to the exclusion of others. It was originally hoped output is then eligible to serve as the input to further morpho-
that the analysis of segments into distinctive features would pro- logical elaboration. At some point, addition of elements from the
vide the solution to this issue: Segments sharing a feature (or set irst stratum is replaced by use of the morphology and phonology
610
Phonology
of the next, and, from then on, no further elements from the ini- example, when languages accommodate loan words to the sur-
tial stratum can be added. his process continues (perhaps vacu- face patterns of other words of the language, the adjustments
ously) through all of the strata of the lexicon, yielding a potential needed to achieve this may include changes that do not corre-
surface word. All of the words in a given syntactic structure are spond to any rule of the phonology of native forms. Constraints
then subject to adjustment by another set of postlexical phono- accomplish this directly and without further stipulation, whereas
logical processes. a system of rules may have to be arbitrarily extended to account
here are a number of further points that characterize this for loanword adaptation.
view, including proposed diferences in the properties of lexical On the other hand, some of the same issues that rule-based
and postlexical rules and the relations between rules on one level phonology dealt with (and at least largely resolved) have resur-
and those on the others. he central point for a broader theory faced as serious challenges to the architecture of grammar gen-
of grammar, however, is probably the replacement of a syntax- erally assumed in constraint-based theories. Most important
based (but purely phonological) notion of cyclic rule application among these is the problem of opaque generalizations. he
by a repeated cycle of morphological addition and phonologi- standard model of OT assumes that its constraints apply directly
cal adjustment. his results, for example, in the possibility that to surface forms and govern a single-stage mapping between
a phonologically derived property (on one cycle) can be relevant these and underlying phonological representations, and so
to the conditioning of a morphological operation (on a following has no place for generalizations that crucially apply to any sort
cycle), a possibility that has been shown to be quite real. of intermediate level. Nonetheless, a number of compelling
examples of such phenomena have been demonstrated, and
OPTIMALITY THEORY. In the early 1990s, a much more radical some sort of accommodation of these facts must be provided by
challenge to the classical model was presented by the develop- an adequate phonological theory.
ment of optimality theory (OT), a view of phonology based Some responses to this challenge have attempted to maintain
on a system of ranked, violable constraints on surface shape, the standard OT model by introducing new sorts of constraints.
as opposed to a system of ordered rules deriving the phonetics Mechanisms such as output-output constraints or sympathy the-
from an underlying phonological representation. hese con- ory, however, have not generally succeeded in dealing with all of
straints govern (in the standard formulation) a one-step relation the relevant phenomena and have been shown to produce new
between underlying and surface representations (cf. underly- diiculties of their own.
ing structure and surface structure), with no interme- One approach that seems promising is that of stratal OT,
diate stages of the sort produced in a rule-based description. he an architecture that grafts a constraint-based account onto the
constraints can be divided into general classes: a) markedness standard model of lexical phonology. he result is a framework
constraints, which express universally preferred conigurations, in which the phonological mapping at each stage is a one-step
and b) faithfulness constraints, requiring that contrasts present process governed by a constraint system. Since the model is
in the phonological representation be preserved in the surface built on a cyclic interaction of phonology and morphology, how-
form. In general, these are in conlict, and the ranking of the con- ever, it also provides for multiple successive stages in the overall
straints governs the resolution of those conlicts, in conformity derivation, thus accommodating opacity to the extent it can be
with general principles of grammar. related to morphological structure (as in the best-established
Initially, OT seemed to ofer its greatest promise in the analy- examples).
sis of stress, syllable structure, and related phenomena, but sub- Examples also seem to exist in which the speciic changes
sequent development has encompassed a full range of segmental through which a language achieves conformity with a gen-
and other facts. Descriptions in constraint-based terms are at eral constraint on surface forms do not follow directly from
least supericially very diferent from those couched in terms of the content of the constraint (together with other interacting
traditional rules, and theoretical discussion in phonology since generalizations). In such a case, something like a rewriting
their introduction has been largely dominated by comparisons rule might be necessary, as a supplement to the constraint
of the two frameworks. system a notion that is clearly antithetical to the basic phi-
losophy of OT.
Current Approaches to Phonology A quite diferent problem concerns the very nature of the
he central issues in phonology in the irst decade of the twen- universals of phonological structure (see phonology, uni-
ty-irst century concern the comparative merits of OT and versals of). Phonological theorizing has generally accepted
rule-based descriptions. On the one hand, constraint-based the premise that generalizations that are true of phonological
formulations seem much better equipped to describe global systems in general result from the cognitive organization of the
properties of phonological systems. It was noted in work from human language faculty and, thus, must be incorporated in some
the classical period of generative phonology that multiple dis- way into the architecture of phonological theory. Recently, how-
tinct processes in an individual language may all have the efect ever, it has been argued that at least some such typological regu-
of ensuring (or avoiding) a single characteristic property of sur- larities result not from the content of a universal grammar
face form, but no satisfactory account of the unity displayed by constraining synchronic systems but, rather, from the universals
these conspiracies was ever achieved. OT, in contrast, provides of language change (see language change, universals of)
a very direct description of such facts. governing the diachronic developments resulting in the systems
In some ways, the surface constraint approach goes beyond we observe. To the extent that this is true, it requires investiga-
anything available in principle to the rule-based theory. For tors to examine closely the arguments for incorporating any
611
Phonology, Acquisition of
particular regularity into phonological theory per se, as opposed understand the system that relates childrens stored representa-
to seeking its basis elsewhere. tions to their productions, and to formalize the developmental
paths that children follow.
Conclusion
While there have, of course, been other trends not covered here, Childrens Grammars as Possible Grammars
it seems fair to say that the bulk of the theoretical discussion in he focus of research on phonological acquisition is on the shapes
phonology from the 1960s to the present has been devoted to of early grammars in the segmental and prosodic domains; thus,
the elaboration and reinement of the generative program of it parallels research on end-state (adult) grammars. (Segmental
Chomsky and Halle (1968). he most recent developments in phonology is concerned with individual speech sounds, prosodic
that tradition, involving the wholesale replacement of rules by phonology with larger units including syllables and feet.) here is
constraints as the mechanism for expressing regularities of a lan- typically a comparison drawn between the shapes of developing
guages sound pattern, have shown great promise but cannot yet grammars and some end-state grammar. Order of emergence
be considered wholly consolidated. Apparently, some appropri- of segmental (Dinnsen 1992) and prosodic complexity (Fikkert
ate synthesis of the classical and OT models remains to be found, 1994; Levelt, Schiller, and Levelt 1999/2000), as well as error pat-
and it is that search that dominates discussion today. terns observed in the segmental and prosodic domains, whether
these patterns are expressed through rules (Smith 1973; Ingram
Stephen R. Anderson
1974), templates (Macken 1992; Fikkert 1994), or constraints
(Pater and Barlow 2003; Goad and Rose 2004), are all considered
in relation to some adult grammar.
Anderson, Stephen R. 1985. Phonology in the Twentieth Century: heories One exception is a body of research that views childrens
of Rules and heories of Representations. Chicago: University of Chicago grammars as self-contained systems subject to their own con-
Press. Describes the development of phonological theory, from its ori- straints (Stoel-Gammon and Cooper 1984; Vihman 1996). his
gins through the classical period of generative phonology. research program developed in response to the observation that
Chomsky, Noam, and Morris Halle. 1968. he Sound Pattern of English.
childrens grammars are not simply reduced versions of the
New York: Harper and Row.
target grammar; indeed, variation across learners is rampant
Gussenhovern, Carlos, and Haike Jacobs, 2005. Understanding Phonology.
(Ferguson and Farwell 1975).
2d ed. New York: Oxford University Press. Lucid elementary introduc-
tion to current phonology. While childrens grammars may be self-organizing in that
Kager, Ren 1999. Optimality heory: A Textbook. Cambridge: Cambridge they contain processes not present in the target language, they
University Press. Introduces the main ideas of optimality theory in can still be viewed as possible grammars (White 1982; Pinker
phonology and their implementation. 1984) if these processes have correlates in other adult languages.
Kenstowicz, Michael 1994. Phonology in Generative Grammar. he notion of possible grammar thus requires that, at each
Cambridge, MA: Blackwell. Provides a comprehensive description of stage, childrens grammars respect the constraints of adult gram-
the principal themes in phonology up to the introduction of optimality mars, even if they bear little resemblance to the target system. In
theory. optimality theory (OT) (Prince and Smolensky [1993] 2004),
for example, alternate routes observed across learners, as well as
stages in the development of a single learner, are viewed from
PHONOLOGY, ACQUISITION OF
the perspective of the typological options that adult languages
A diversity of issues informs work in the ield of phonological display: Both are accounted for by diferent rankings of the same
acquisition, as it encompasses both irst (L1) and second (L2) constraints.
language acquisition examined by researchers in linguistics,
psychology, speechlanguage pathology, and language edu- Markedness
cation. In L1, there are questions such as how the acquisition Although children take diferent paths to the adult grammar, early
of phonology interfaces with perceptual and motor develop- phonologies are also strikingly similar (Jakobson [1941] 1968).
ment (Locke 1993), and how an examination of disordered As Roman Jakobson emphasizes, these similarities relect cross-
development can illuminate the normally developing gram- linguistically unmarked properties. markedness constrains the
mar (Bernhardt and Stemberger 1998; Dinnsen 1999). In L2, shapes of linguistic systems such that less complex properties are
there are questions as to whether the acquisition process is favored. For example, there is a well-documented preference for
fundamentally like L1 acquisition (Flege 1995), or whether L2 consonant+vowel (CV) syllables among children (Ingram 1978;
grammars are in some sense impaired due to, for example, L1 cf. Grijzenhout and Joppen-Hellwig 2002); this is also a syllable
constraints that impede native-like attainment (see Brown 1998 shape that no end-state grammar forbids (Jakobson 1962). Since
on perception). unmarked patterns are systematically observed across learners,
Due to space constraints, this entry focuses on L1, although one might reasonably infer that they relect early grammatical
many of the same issues arise for L2. he acquisition of phonol- organization. However, markedness has not always been well
ogy is examined from the perspective of generative grammar; integrated into the theory of grammar (as part of the theory of
thus, a principal theme is to examine how acquisition research representations or formulation of rules/constraints). his begs
has used linguistic theory to inform development. his theme the question of whether markedness should instead be part of
considers the starting hypothesis to be that childrens produc- the theory of acquisition, which interfaces with, but is indepen-
tions are largely system driven: Acquisition research strives to dent of, the theory of grammar.
612
Table 1.
Ambient Stage Grammar: Stored form: Grammar: Produced

form: form:
[wei] 1 M >> F-perc, F /wei/ M >> F-perc, F

[wei]
2 F-perc >> M >> F F-perc >> M >> F
/wei/
3 F-perc, F >> M F-perc, F >> M [wei]
An advantage of OT is that the formal devices for expressing As shown in Table 1, at Stage 1, both perception-speciic
phonological generalizations include a set of markedness con- faithfulness (F-perc) and general faithfulness (F) are outranked
straints. Most researchers have proposed that learners begin by markedness (M). he result is unmarked forms stored in per-
acquisition with a ranking wherein markedness constraints dom- ception and uttered in production. In the example provided, the
inate faithfulness (which favor identity between inputs [stored ambient form [wei] away undergoes truncation of the pre-
representations] and outputs) (e.g., Demuth 1995; Gnanadesikan tonic syllable (an unstressed syllable immediately preceding a
[1995] 2004; Smolensky 1996; Pater 1997; Ota 2003; cf. Hale and stressed syllable) in both components of the childs grammar;
Reiss 1998). hroughout development, constraints are reranked accordingly, words of this shape are perceived and produced
to yield more marked outputs. However, many paths can be fol- without this syllable. At Stage 2, the childs perceptual abilities
lowed, as there are many options for what to rerank. hus, the become more target-like (i.e., he/she learns to correctly iden-
idea that grammars are initially unmarked is not inconsistent tify information in the ambient language); this indicates that
with their being self-organizing. the relevant markedness constraints have been demoted below
perception-speciic faithfulness. General faithfulness is still out-
Perception Versus Production ranked, yielding a mismatch between what the child perceives
Most work in phonological acquisition has focused on produc- and what he/she produces. At Stage 3, markedness is demoted
tion; indeed, researchers typically assume that children accu- below general faithfulness, and the form is correctly produced.
rately perceive the ambient input. his is due, in part, to the he perception-production time lag results because forms that
observation that prelinguistic infants can perceptually discrimi- are correctly perceived at Stage 2 are not correctly produced until
nate perhaps all contrasts exploited by the worlds languages Stage 3.
(Eimas et al. 1971; Werker et al. 1981). his ability largely declines
by age one (cf. Best, McRoberts, and Sithole 1988), coinciding Phonological Theory and Phonological Acquisition
with a reorganization of perceptual categories according to what As the preceding discussion reveals, research in phonological
is contrastive in the target language (Werker and Tees 1984). As acquisition has been directly impacted by thinking in generative
children start to speak around age one, it would appear that per- phonology. Modern generative phonology began with Chomsky
ception is complete by the onset of production. and Halles (1968) Sound Pattern of English (SPE). Although more
Research on phonemic perception, which requires the abil- recent work has situated the shapes of developing grammars
ity to form soundmeaning pairings, has challenged this view within the typological range manifested by adult systems, this
(Shvachkin [1948] 1973; Edwards 1974; Brown and Matthews was less the case in the SPE-based literature. Much of this work
1997). Although experiments examining minimal contrasts used SPE as a tool only, in part because, with the formal appa-
between native-language sounds have revealed that perceptual ratus employed by the theory, it was diicult to constrain what
development is mostly complete by age two, some contrasts a possible grammar is: developing or end state. And although
develop as late as three. Even age three is probably conservative the theory contained an evaluation metric to guide learners in
because, for consonant perception, this research has focused selecting the most highly valued among descriptively adequate
almost exclusively on word-initial position. Since contrasts in grammars, rules for unattested processes were as easy to formal-
other positions are harder to discriminate, many non-target ize as rules for commonly attested processes. Finally, SPE con-
patterns that childrens productions display could relect pertained no workable theory of markedness and, thus, childrens
ceptual miscoding, rather than production constraints (Macken grammars could not be considered relative to some notion of
1980). optimal.
If perception and production both relect aspects of childrens To facilitate a comparison between SPE and later theories,
competence, both must be included in the grammar (cf. Hale we draw on truncation, further exempliied in (1) from Amahl,
and Reiss 1998). However, the time lag observed (production age 2.60 (Smith 1973; [b,g] are voiceless unaspirated lenis stops).
trails perception) has suggested to some researchers that they (he discussion focuses on the stage when perception is target-
form independent (interacting) grammatical modules (see Menn like and truncation is restricted to production.)
and Matthei 1992). his approach, though, cannot predict that
(1) [gep] escape
perception and production abilities develop in a similar order.
[an] banana
he latter favors the postulation of a single grammar if the time
lag can be built in. In Pater (2004), this is accomplished by intro- In SPE, every deviation from adult forms required one or more
ducing perception-speciic faithfulness constraints into OT. rules, and so there was little in common between the rule sets
613
for developing and target systems. To capture truncation, Neil Table 2.

Smith (1973) provides the rules below, neither of which operates
in the adult grammar: ParseSyll Max[lab]-IO Max-IO I-Contig
(2) R14: V / # (C) ______ C V a. [b(nan)Ft]Wd *!
[-stress] [+stress] b. [(nan)Ft]Wd *! **
R16: [+sonorant] / [+consonantal] ______
c. [(ban)Ft]Wd ** *
R14 deletes initial vowels in words like escape. For consonant-
initial forms like banana, the result is [bnan], which then under-
goes R16, yielding [ban].
To illustrate, concerning truncation in Table 2, the con-
Since SPE employed linear representations, the theory did
straint ParseSyllable (syllables are parsed into feet), along
not ofer any insight into why pretonic rather than posttonic syl-
with other markedness constraints, must be satisfied at the
lables delete (escape [gep], but tiger [aig], *[ai()]).
expense of the lower-ranked faithfulness constraint Max-IO
he development of nonlinear phonology (see Goldsmith 1985
(every segment in the input has a correspondent in the out-
for an overview), notably the move to highly articulated prosodic
put). Fully faithful (a) is thus eliminated because the initial syl-
representations, led to signiicant breakthroughs in understand-
lable is unfooted. Concerning onset selection, Max[labial]-IO
ing this asymmetry. In trochaic languages, where the foot (the
(every [labial] in the input has a correspondent in the out-
rhythmic unit in which stress is assigned) is left-headed (stress-
put) must be ranked over I-Contiguity (the portion of the
initial), escape cannot form a single foot, [s(kip)Ft]Wd, whereas
input standing in correspondence forms a contiguous string).
tiger can, [(tig)Ft]Wd.
Preservation of [labial] in banana will thus be favored, (c),
Much work in nonlinear phonology has explored the idea
even though the result violates I-Contig through morpheme-
that prosodically deined templates constrain output shape
internal segment deletion.
(McCarthy and Prince 1995). Paula Fikkert (1994) proposes that
OT has had a major impact on acquisition research.
templates, which at early developmental stages relect what
Phonological processes are now generally expressed through
is unmarked, are responsible for truncation. If the childs pro-
constraints, rather than rules, as this provides a better concep-
ductions are limited to one foot, circumscribed from the adult
tualization of the observation that markedness shapes early
output, this template will determine which material is preserved
grammars. As discussed, childrens productions become more
from the adult form and which is deleted:
target-like when markedness constraints are demoted below
A dult output: Wd faithfulness. A similar idea, that development is best viewed as
the gradual relaxing of constraints, had been proposed earlier
Ft (Stampe 1969; Menn 1980), but it was diicult to formally imple-
keip ment it in the rule-based frameworks of the time.
s
e
OT seems to provide an appealing view of the initial state and
Ft of development; researchers can address important questions,
such as how the theory may restrict what a possible develop-
Child output: Wd ing grammar is, and how, in turn, data from development may
inform the theory. However, this is not to say that OT has solved
In contrast to SPE, nonlinear phonology reveals the rela-
all problems in phonological acquisition. One understudied
tionship between target and truncated forms, and the role that
problem is rogue behavior. We have been assuming that chil-
markedness plays in shaping outputs. he material inside the
drens grammars are possible grammars, thereby ignoring the
foot survives, as syllables organized by feet ([keip]) are less
fact that some commonly attested processes, notably conso-
marked than those linking directly to the word ([s]). One prob-
nant harmony (CH), have no adult analogs (Drachman 1978). In
lem with the templatic approach, however, is that it is too rigid: If
CH, consonants share place over vowels of any quality (Vihman
the segments predicted to survive are precisely those delimited
1978), as seen in (3) for Amahl, age 2.60 (Smith 1973):
by the constituent that serves to organize them in the adult form,
it becomes diicult to capture the observation that material from (3) [aig] tiger
the truncated syllable can also survive. For example, in Amahls [ok] stroke
pronunciation of banana in (1), onset selection favors [b], replac-
Some recent accounts of CH (Goad 1997; Rose 2000) incor-
ing [n] from the stressed syllable; that is, his production is [an]
rectly predict that the process should be attested in adult gram-
not *[nan] as expected from adult [b(nan)Ft]Wd (see Kehoe
mars; others (Pater 1997) appeal to child-speciic constraints,
and Stoel-Gammon 1997 for other problems with the templatic
thereby challenging the notion that childrens grammars are pos-
approach).
sible grammars. Neither of these approaches questions whether
his problem is rectiied in OT. First, there are no templates;
CH is truly grammar-driven nor addresses, more generally, the
templatic efects arise from the interaction of markedness con-
criteria that should factor into the determination concerning
straints. Second, segmental content (e.g., labial preservation)
what is grammar-driven and what is not. I leave these questions
is the responsibility of faithfulness constraints. Finally, all con-
to future work.
straints are interranked; thus, the co-occurrence of truncation
and onset selection is not unexpected (see Pater 1997). Heather Goad
614
WORKS CITED AND SUGGESTIONS FOR FURTHER READING Ingram, David. 1974. Phonological rules in young children. Journal of
Bernhardt, Barbara, and Joe Stemberger. 1998. Handbook of Phonological
. 1978. he role of the syllable in phonological development.
Development from the Perspective of Constraint-Based Nonlinear
In Syllables and Segments, ed. Alan Bell and Joan B. Hooper, 14355.
Phonology. San Diego, CA: Academic Press.
Amsterdam: North-Holland.
Best, Catherine, Gerald McRoberts, and Nomathemba Sithole. 1988.
Jakobson, Roman. [1941] 1968. Child Language, Aphasia and Phonological
Examination of perceptual reorganization for nonnative speech
Universals. Trans. Allan Keiler. he Hague: Mouton.
contrasts: Zulu click discrimination by English-speaking adults and
. 1962. Selected Writings. Vol. 1. Phonological Studies. he
infants. Journal of Experimental Psychology: Human Perception and
Hague: Mouton.
Performance 14: 34560.
Kehoe, Margaret, and Carol Stoel-Gammon. 1997. he acquisition of
Brown, Cindy. 1998. he role of the L1 grammar in the L2 acquisition of
prosodic structure: An investigation of current accounts of childrens
segmental structure. Second Language Research 14: 13693.
prosodic development. Language 73: 11344.
Brown, Cindy, and John Matthews. 1997. he role of feature geometry
Levelt, Clara, Niels Schiller, and Willem Levelt. 1999/2000. he acquisi-
in the development of phonetic contrasts. In Focus on Phonological
tion of syllable types. Language Acquisition 8: 23764.
Acquisition, ed. S. J. Hannahs and Martha Young-Scholten, 67112.
Locke, John. 1993. he Childs Path to Spoken Language.
Amsterdam: Benjamins.
Chomsky, Noam, and Morris Halle. 1968. he Sound Pattern of English.
Macken, Marlys. 1980. he childs lexical representation: he puzzle-
New York: Harper & Row.
puddle-pickle evidence. Journal of Linguistics 16: 117.
Demuth, Katherine. 1995. Markedness and the development of prosodic
. 1992. Wheres phonology? In Phonological Development: Models,
structure. Proceedings of NELS 25: 1325.
Research, Implications, ed. Charles Ferguson, Lise Menn, and Carol
Dinnsen, Daniel. 1992. Variation in developing and fully developed pho-
Stoel-Gammon, 24969. Timonium, MD: York.
netic inventories. In Phonological Development: Models, Research,
McCarthy, John, and Alan Prince. 1995. Prosodic morphology. In
Implications, ed. Charles Ferguson, Lise Menn, and Carol Stoel-
Goldsmith 1995, 31866.
Gammon, 191210. Timonium, MD: York.
Menn, Lise. 1980. Child phonology and phonological theory. In Child
. 1999. Some empirical and theoretical issues in disordered child
Phonology. Vol. 1: Production. Ed. Grace Yeni-Komshian, James
phonology. In Handbook of Child Language Acquisition, ed. William
Kavanaugh, and Charles Ferguson, 2342. New York: Academic Press.
Ritchie and Tej Bhatia, 647704. San Diego, CA: Academic Press.
Menn, Lise, and Edward Matthei. 1992. he two-lexicon account to
Drachman, Gaberell. 1978. Child language and language change: A con-
child phonology. In Phonological Development: Models, Research,
jecture and some refutations. In Recent Developments in Historical
Implications, ed. Charles Ferguson, Lise Menn, and Carol Stoel-
Phonology, ed. Jacek Fisiak, 12344. he Hague: Mouton.
Gammon, 21147. Timonium MD: York.
Edwards, Mary Louise. 1974. Perception and production in child pho-
Ota, Mits. 2003. he Development of Prosodic Structure in Early Words.
nology: he testing of four hypotheses. Journal of Child Language
1: 20519.
Pater, Joe. 1997. Minimal violation and phonological development.
Eimas, Peter, Einar Siqueland, Peter Jusczyk, and James Vigorito. 1971.
Language Acquisition 6: 20153.
Speech perception in infants. Science 171: 3036.
. 2004. Bridging the gap between receptive and productive
Ferguson, Charles, and Carol Farwell. 1975. Words and sounds in early
development with minimally violable constraints. In Constraints
language acquisition. Language 51: 41939.
in Phonological Development, ed. Ren Kager, Joe Pater, and Wim
Fikkert, Paula. 1994. On the Acquisition of Prosodic Structure. he
Zonneveld, 21944. Cambridge: Cambridge University Press.
Hague: Holland Academic Graphics.
Pater, Joe, and Jessica Barlow. 2003. Constraint conlict in cluster reduc-
Flege, James. 1995. Second language speech learning: heory,
tion. Journal of Child Language 30: 487526.
indings, and problems. In Speech Perception and Linguistic
Pinker, Steven. 1984. Language Learnability and Language Development.
Experience: heoretical and Methodological Issues, ed. Winifred
Strange, 23377. Timonium, MD: York.
Prince, Alan, and Paul Smolensky. [1993] 2004. Optimality
Gnanadesikan, Amalia. [1995] 2004. Markedness and faithfulness
heory: Constraint Interaction in Generative Grammar.
constraints in child phonology. In Constraints in Phonological
Oxford: Blackwell.
Development, ed. Ren Kager, Joe Pater, and Wim Zonneveld, 73108.
Rose, Yvan. 2000. Headedness and Prosodic Licensing in the L1 Acquisition
of Phonology. Ph.D. diss., McGill University.
Goad, Heather. 1997. Consonant harmony in child language: An
Shvachkin, N. Kh. [1948] 1973. he development of phonemic speech
optimality-theoretic account. In Focus on Phonological
perception in early childhood. In Studies of Child Language
Acquisition, ed. S. J. Hannahs and Martha Young-Scholten, 11342.
Development, ed. Charles Ferguson and Dan Slobin, 91127. New
York: Holt, Rinehart and Winston.
Goad, Heather, and Yvan Rose. 2004. Input elaboration, head faith-
Smith, Neil. 1973. he Acquisition of Phonology: A Case Study.
fulness and evidence for representation in the acquisition of left-
edge clusters in West Germanic. In Constraints in Phonological
Smolensky, Paul. 1996. On the comprehension/production dilemma in
Development, ed. Ren Kager, Joe Pater, and Wim Zonneveld, 10957.
child language. Linguistic Inquiry 27: 72031.
Stampe, David. 1969. he acquisition of phonetic representation.
Goldsmith, John, ed. 1995. he Handbook of Phonological heory.
Chicago Linguistic Society 5: 43344.
Oxford: Blackwell.
Stoel-Gammon, Carol, and Judith Cooper. 1984. Patterns of early lex-
Grijzenhout, Janet, and Sandra Joppen-Hellwig. 2002. he lack of onsets
ical and phonological development. Journal of Child Language
in German child phonology. In he Process of Language Acquisition,
11: 24771.
ed. Ingeborg Lasser, 31939. Frankfurt am Main: Peter Lang.
Vihman, Marilyn. 1978. Consonant harmony: Its scope and function in
Hale, Mark, and Charles Reiss. 1998. Formal and empirical arguments
child language. In Universals of Human Language. Vol. 2: Phonology.
concerning phonological acquisition. Linguistic Inquiry 29: 65683.
Ed. Charles Ferguson, 281334. Stanford, CA: Stanford University Press.
615
Phonology, Evolution of
. 1996. Phonological Development: he Origins of Language in the Communication during this period was based on gestures.
Child. Oxford: Blackwell. Spoken language emerged in the second transition. In Donalds
Werker, Janet, John Gilbert, Keith Humphrey, and Richard Tees. 1981. scenario, it takes until the end of the period associated with
Developmental aspects of cross-language speech perception. Child
archaic Homo sapiens (45,000 years ago) for spoken language to
appear.
Werker, Janet, and Richard Tees. 1984. Cross-language speech percep-
If mimesis was a basically gestural mode of communication,
tion: Evidence for perceptual reorganization during the irst year of
life. Infant Behavior and Development 7: 4963. would it not imply a proto-language that was signed, rather than
White, Lydia. 1982. Grammatical heory and Language Acquisition. spoken (cf. Arbib 2005)? Donald assumes that as mimetic mes-
Dordrecht, the Netherlands: Foris. sages grew more elaborate, they eventually reached a complex-
ity that favored faster and more precise ways of communicating.
he vocal/auditory modality ofered an independent, omnidir-
PHONOLOGY, EVOLUTION OF
ectional channel useful at a distance and in the dark. It did not
True Phonology: How Did It Evolve? impede locomotion, gestures, or manual work. he vocal system
phonology is the study of how languages use segmental and came to be exploited more and more and further adaptations
prosodic categories to build spoken words and signal their dif- occurred: irst lexical invention and high-speed phonological
ferences in meaning. Many animals that communicate vocally speech, syntax later.
use distinct sound patterns to signal diferent meanings, but
their repertoires are typically small and closed sets. By contrast, Specializations of the Vocal/Auditory Modality
humans have large vocabularies and learn new words all of A number of comparative studies have been undertaken in
their lives. From infancy to adolescence, children acquire lexi- attempts to evaluate the adaptive signiicance of novel fea-
cal entries at a remarkably fast rate. he vocabulary size of high tures of human anatomy (see speech anatomy, evolution
school students has been conservatively estimated at 60,000 root of): for example, disappearance of air sacs (Hewitt, MacLarnon,
forms a number that implies an average acquisition rate of and Jones 2002), bigger hypoglossal and vertebral canals, smaller
more than 10 words per day (Miller 1991). masticatory muscle mass, genetic changes, and uniqueness of
his diference is linked to the uniquely human method of craniofacial sensorimotor system (Fitch 2000; Kent 2004).
coding information: the combinatorial use of discrete entities. Perhaps the most conclusive example of a speech-related
Combinatorial structure, the hallmark of true language, creates adaptation is the descent of the larynx, which makes swallowing
the conditions for open-ended lexical and syntactic systems more hazardous but expands the space of possible sound quali-
that provide the foundation for the singular expressive power of ties (Lieberman 1991; Carr, Lindblom, and MacNeilage 1995).
human languages. How did it evolve? A human ability, central to language but curiously absent
We focus on two areas of empirical research. One is the study in primates, is vocal imitation. A beginning of a neural account
of human cognition. he other is the investigation of the phonetic of imitation was suggested by the discovery of mirror neurons.
signal space from which all phonological patterns are drawn. First identiied in the macaques premotor cortex, these neurons
he irst theme highlights mans rich semantic abilities. he discharge when the monkey manipulates objects and when it
second looks for phenomena that presage combinatorial sound observes other monkeys or humans do the same. Neurons that
structure. respond to sound and to communicative or ingestive actions
have also been identiied. Although there is no direct evidence
Cognitive Growth for a human mirror system, brain stimulation and imaging stud-
he virtually ininite set of meanings encodable by language ies indicate increased activity in speech muscles when subjects
raises the question of how mans cognitive capacity evolved listen to speech (Hurley and Chater 2005; see mirror systems,
from skills not unlike those of present-day apes. How do we pic- imitation, and language).
ture the transition from a nonhuman to a human primate mind?
What was the role of language? Signals for Speaker, Listener, and Learner
According to Merlin Donalds synthesis of neurobiologi- In technical jargon, phonology has been characterized as provid-
cal, psychological, archeological, and anthropological evidence ing an impedance match between semantics and phonetics in
(1991), our ancestors broke away from the stimulus-driven the sense that it succeeds in coding a large number of meanings
behavior of apes in two steps. First, during the period of Homo despite its use of only a small set of phonetic dimensions (Bellugi
erectus (from 1.5 million years ago), an adaptation called mimesis and Studdert-Kennedy 1980). How was this match achieved?
occurred, a communicative culture allowing individuals to share
mental states and begin to represent reality in new and expand- GESTURES AS BASIC UNITS. One answer is that the building
ing ways. blocks of speech are phonetic gestures, units corresponding to
Mimetic behavior is an ability to voluntarily access and the discrete articulators. he argument is that, evolutionarily,
retrieve motor memories and to rehearse and model them for as holistic utterances were processed by the mirror system, they
communication with others. he whole body is used as a rep- came to be parsed into the basic articulators of the vocal tract
resentational device, as in imitating vocal, manual, and postural and their preferred, natural motions. Data from early speech
movements for a communicative purpose. Mimesis involved have been used to argue that these units, when properly timed
major changes of motor and memory mechanisms based on and modulated in amplitude, produce the vowels and conso-
existing capacities. nants of the ambient adult input (Studdert-Kennedy 2005).
616
Phonology, Evolution of
PHYLOGENY OF THE SYLLABLE. he frame/content theory why phonologies do not recruit more of a humans total sound-
(MacNeilage 2008) ofers an evolutionary account of the syllable. making capabilities (e.g., mouth sounds and other non-speech
Syllables are universally associated with openclose alternations vocalizations; Catford 1982) but prefer practically the same small
of the mandible, vowels being open and consonants closed artic- set of phonetic properties. However, the study of these constraints
ulations. his movement has a parallel in childrens babbles, only partially illuminates the roots of combinatorial coding. his
which resemble consonant-vowel sequences such as [bababa], topic has been explicitly addressed in computer modeling exper-
but are in no way organized in terms of discrete segments. Rather, iments. One such study shows how discrete phonetic targets and
their syllabic and segmental character arises fortuitously from reuse can emerge from a dynamic systems network of agents
adding phonation to the openclose jaw motion. his rhythmic- (speaker/listener models) whose vocalizations initially ran-
ally repeated up-and-down movement is also found in so-called domly distributed in phonetic space tend to converge (driven
lipsmacks, a facio-visual behavior in primates often combined by a magnet-like dominance of the patterns heard most often) on
with phonation during grooming. a few targets (Oudeyer 2006).
Accordingly, the evolutionary path to the syllable began in
deep prehistoric time when mammal biomechanics evolved for TARGETS AND MOTOR EQUIVALENCE. Traditionally, the basic units
feeding. A second stage was the use of this machinery in primate of speech have been assumed to be targets, the intertarget transi-
communication. In a third step, this primate mechanism was co- tions being primarily determined by the response characteristics
opted for speech by scafolding early phonology on its pseudo- of the production system. Speech, like other movements, exhib-
syllables and pseudo-segments. its motor equivalence: the ability of motor systems to compensate
and reach a given goal irrespective of initial conditions. his view
QUANTAL THEORY. he acoustic consequences of a continuous implies that the end state of phonetic learning is a set of context-
articulatory movement are often noncontinuous, as illus- independent targets and a system capable of motor equivalence.
trated by the pseudo-segmental character of babbling. In the It moreover suggests that once a target has been learned in one
babble example, the jaw moves continuously but the acoustics context, it can immediately be reused in other contexts, since the
shows an abrupt change from a vowel-like to a stoplike pattern. motor equivalence capability handles the new trajectory. Also, it
his quantal jump illustrates a general fact about the phonetic means that, developmentally, discrete segments derive from the
space. he mapping of articulation onto acoustic parameters emergent targets and recombination from motor equivalence.
creates a set of acoustic patterns that forms a number of disjoint A further relevant observation on the target hypothesis is that
subspaces, rather than a single continuous, coherent space. linguistic systems with phonemically coded vocabularies would
Within each such subregion, sound quality is homogeneous. be learned faster, more easily, and in an open-ended manner
Voiced and voiceless sounds, as well as diferent manners of than repertoires based on holistic forms (Lindblom 2007).
articulation (e.g., stops, nasals, fricatives, trills), exemplify such
distinct subspaces (Stevens 1989).
Conclusion
Where does combinatorial structure come from? From prespeci-
USER-BASED CONSTRAINTS: ON-LINE SPEECH. he human voice ications in our genetic endowment? Or from a modality-inde-
is an expressive instrument that undergoes moment-to-moment pendent principle shared by sign and speech and perhaps also
retuning by many nonlinguistic factors. Consequently, the pho- operating in genetics and chemistry (cf. the particulate principle
netic patterns conveying linguistically the same utterance exhibit [Abler 1989])? Or from a mutually reinforcing interplay between
great variability. However, the need for messages to be both intel- cognitive growth and a suite of conditions entailed by communi-
ligible and pronounceable imposes a systematic distribution on cating by vocal sounds?
phonetic variations, placing them between clear hyperforms and In view of the materials reviewed here, a positive treatment of
reduced hypoforms. his view portrays speakerlistener interac- the last possibility appears within reach. More lexical inventions
tions as a tug-of-war between the listeners need for comprehen- imply an increasing number of soundmeaning pairs. he link-
sion and the speakers tendency to simplify. here is a great deal of ing of phonetic shapes with distinct meanings would be subject
experimental evidence for this view of speech (Lindblom 1990). to numerous user-based constraints and processes shaping the
instrinsic content of lexical entries, fractionating them into dis-
USER-BASED CONSTRAINTS: PHONOLOGY. hese user-based concrete units and facilitating unit recombination. Sound structure
straints also leave their mark on phonology, as is evident from could, thus, plausibly have evolved in response to the expressive
typological data on strengthening and weakening processes in needs associated with growing semantic abilities and as a pro-
phonological rules and sound changes (Kiparsky 1988) and from cess of phonetically biased scaling, self-organizing without
attempts to simulate segment inventories. hese studies indicate any formal a priori or modality-independent blueprint.
that systemic selections have been favored that simultaneously Bjrn Lindblom
optimize distinctiveness and articulatory ease. An example of
the efect of these conditions is the size principle: he larger the WORKS CITED AND SUGGESTIONS FOR FURTHER READING
system, the greater the proportion of articulatorily complex seg- Abler, William L. 1989. On the particulate principle of self-diversifying
ments (Lindblom and Maddieson 1988) systems. Journal of Social and Biological Structures 12: 113.
Arbib, Michael. 2005. he mirror system hypothesis: How did proto-
SELF-ORGANIZATION. hese user-based constraints in conjunc- language evolve? In Language Origins: Perspectives on Evolution, ed.
tion with the quantal nature of the signal space help explain Maggie Tallerman, 2147. New York: Oxford University Press.
617
Phonology, Universals of
Bellugi, U., And M. Studdert-Kennedy. 1980. Signed and Spoken supposed human innate capacity for language. here is a sizable
Language: Constraints on Linguistic Form. (Dahlem Konferenzen.) literature on phonological universals (e.g., Greenberg, Ferguson,
Weinheim, Germany: Verlag Chemie GmbH. and Moravcsik 1978; Maddieson 1984), and it will not be possible
Carr, Ren, B. Lindblom, and P. MacNeilage. 1995. Rle de lacoustique in this limited space to discuss and exemplify more than a few
dans lvolution du conduit vocal humain. Comptes Rendus de
of those that have been discovered. What is more important and
lAcadmie des Sciences (Paris) t 30, srie Iib: 47176.
what will be emphasized here is a consideration of the explana-
Catford, John C. 1982. Fundamental Problems in Phonetics.
Bloomington: Indiana University Press.
tion for phonological universals. he best evidence presented so
Donald, Merlin. 1991. Origins of the Modern Mind. Cambridge: Harvard far points to their phonetic origin.
University Press A caveat: Phonological universals as with any other phono-
Fitch, W. Tecumseh. 2000. he evolution of speech: A comparative logical generalization are inevitably stated in terms of a tradi-
review. Trends in Cognitive Science 4.3: 25867. tional pretheoretic taxonomy. One should always be alert to the
Hewitt, G., A. MacLarnon, and K. E. Jones. 2002. he functions of laryn- possibility that the taxonomic terms devised for purely practical
geal air sacs in primates: A new hypothesis. Folia Primatol 73: 7094. and descriptive purposes may not conform to the true essence
Hurley, Susan, and Nick Chater. 2005. Perspectives on Imitation: From of speech, just as, for example, a pretheoretic category for living
Neuroscience to Social Science. Vols. 1, 2. Cambridge, MA: MIT Press. animals of those that ly would result in a heterogeneous class
Kent, Ray D. 2004. Development, pathology and remediation of
that included birds, bats, lying ish, and winged insects to the
speech. In From Sound to Sense: 50+ Years of Discoveries in Speech
exclusion of penguins, ostriches, emus, and kiwis.
Communication, ed J. Slifka et al. Cambridge, MA: Research
Laboratories of Electronics, MIT.
Kiparsky, Paul. 1988. Phonological change. In Linguistics: he Cambridge Universals Deriving from Speech Aerodynamics
Survey. Vol. 1. Ed. F. J. Newmeyer, 363415. Cambridge: Cambridge All languages have consonants and vowels. Among conso-
University Press. nants, all languages employ stops. Among stops, voiceless stops
Lieberman, Philip. 1991. Uniquely Human. Cambridge: Harvard are the default; that is, if a language employs voiced stops it
University Press. will also have voiceless stops, but not the reverse. his can be
Lindblom, Bjrn. 1990. Explaining phonetic variation: A sketch of the H&H explained by the aerodynamic voicing constraint (AVC) (Ohala
theory. In Speech Production and Speech Modeling, ed. W. Hardcastle 1983): Voicing requires air low through the approximated vocal
and A. Marchal, 40339. Dordrecht, the Netherlands: Kluwer.
cords, and this requires a positive pressure diferential between
. 2007. he target hypothesis, dynamic speciication and seg-
the subglottal and the oral air pressures. During obstruents, the
mental independence. In Syllable Development: he Frame/Content
lowing air is blocked by the consonantal closure so that air accu-
heory and Beyond, ed. B. Davis and K. Zajd. Hillsdale, NJ: Lawrence
Erlbaum. mulates in the oral cavity, thus increasing the oral air pressure
Lindblom, B., and I. Maddieson. 1988. Phonetic universals in conso- above the glottis such that eventually the required pressure dif-
nant systems. Language, Speech and Mind, ed. Larry M. Hyman and ferential diminishes, thereby reducing transglottal airlow below
C. N. Li, 6278. London and New York: Routledge. the level needed for vocal cord vibration. Another universal pat-
MacNeilage, Peter F. 2008. he Origin of Speech. New York: Oxford tern explained in part by the AVC is that among languages that
University Press. do have voiced stops, it is often the case that the back-articulated
Miller, George A. 1991. he Science of Words. New York: Freeman. stop is missing, for example, as in Dutch and hai. his is because
Oudeyer, Pierre-Yves. 2006. Self-Organization in the Evolution of Speech. insofar as the AVC can be ameliorated, it is due to the compli-
New York: Oxford University Press.
ance of the surfaces of the vocal tract to the impinging oral air
Stevens, Kenneth N. 1989. On the quantal nature of speech. J Phonetics
pressure. he magnitude of this compliance is greatest for labial
17: 346.
obstruents (due to expandability of the cheeks), less for apicals,
Studdert-Kennedy, Michael. 2005. How did language go discrete? In
Language Origins: Perspectives on Evolution, ed. Maggie Tallerman, and least for velars, which have the least surface area exposed
4867. New York: Oxford University Press. to the oral pressure. hese factors also help to explain the kind
of sound change that occurred in Nubian, now manifested as
a morphophonemic alternation, whereby geminated voiced
PHONOLOGY, UNIVERSALS OF stops become voiceless at all places of articulation except labial
Phonological universals are those aspects of languages sound (Table 1; data from Bell 1971).
system that are found either in every or most human lan- Among languages that have both voiced and voiceless stops,
guages or in diverse languages where their presence cannot be there are many that have only voiceless fricatives (e.g., hai,
accounted for by inheritance from a common parent language, Galician, Taba). Again, the AVC is part of the explanation: As
geographical proximity, or borrowing. hey are often referred
to as unmarked or default conditions in languages phonologies
Table 1
when these terms imply reference to common cross-language
patterns. here are universal patterns in languages 1) sound Noun stem Stem + and English gloss
inventories, including their prosodies, 2) sequential constraints
/fab/ /fab:n/ Father
(how sounds are sequenced), and 3) sound changes and the
phonological alternations they create within a given language. /sgd/ /sgt:n/ Scorpion
hey are of interest because they give insight into the physical /ka/ /ka: n/ Donkey
factors that shape human speech, help to elucidate mechanisms
/mg/ /mk:n/ Dog
of sound change, and, perhaps, suggest something about the
618
Table 2. Table 3.
Language Voiceless and voiced Voiced only a. Sundanese:
Awadi i, u, e a, o n to wet
Campa i o, e, a bhr to be rich
Chatino i, u o, e, a hkn to inform
Dagur i, u, e o, a msih to love
Huichol i, , e u, a
b. Tereno:
Serbo-Croatian i, u e, o, a
1st person 3rd person
Tadjik i, u, a e, o, u mbiho
piho I/he went
Tunica u i, e, , a, , o
ahjaao anaao I/he desire(s)
Uzbek i, u e, , o, a
iso nzo I/he hoed
owoku wgu my/his house
mentioned, optimal conditions for voicing require oral pressure ajo jo my/his brother
as low as possible (with respect to subglottal pressure), but opti- emou m my/his word
mal conditions for generating frication (turbulence) at an oral
iha nza my/his name
constriction requires the oral pressure as high as possible (with
respect to atmospheric pressure). hese conditions are con- Sources: For Sudanese: Robins 1957; for Tereno: Bendor-Samuel 1960, 1966.
tradictory. hus, voiced fricatives are less common than voice-
less ones. Phonetically, in languages that have both voiced and
voiceless fricatives (e.g., English, French, Italian, etc.), the frica- glides following, unless blocked by a buccal obstruent (that is,
tion noise of voiced fricatives is always less than that for voiceless one made in the oral cavity from the uvular-velar region to the
fricatives. lips). Nonbuccal obstruents such as the glottal fricative [h] or
Although all languages have voiced vowels, some languages the glottal stop [] do not block it. his follows from a straightfor-
feature voiceless vowels as well, though often these are con- ward physiological constraint: Buccal obstruents, insofar as they
textually determined, for example, word inally or in the envi- require the buildup of oral pressure, cannot tolerate venting of
ronment of voiceless consonants. In any case, it seems to be this pressure via an open velic port. he nonbuccal obstruents
always the case that a voiceless vowel has a voiced counterpart. require a pressure buildup in a cavity that does not access the
J. H. Greenberg (1969) provided a survey of the incidence of velic port, and so whether the velic port is open or closed is irrel-
voiceless vowels in several languages. He found a virtually uni- evant to their production.
form pattern: Voiceless vowels appear as the counterparts to Among fricatives, the most common are the apical s-like fric-
vowels higher in the vowel space. (See Table 2.) he explanation atives (Maddieson 1984). his stems from a combination of aero-
for this also requires reference to the AVC. Among vowels, high, dynamic and anatomical factors. Apical fricatives have relatively
close vowels like [i] and [u] are almost obstruents. If articulated long and intense noise in the high frequencies (3 to 8 kHz) and
suiciently close, they impede the exiting airlow almost as much are thus easily detected and are distinct from all other speech
as fricatives. his, in combination with other factors that could sounds. his is due to the fact that the approximation of the
create a slightly open glottis via coarticulation, such as appear- tongue apex at or near the alveolar ridge enables the generation
ing in word- (and thus utterance-) inal position or near voice- of a relatively focused high-velocity air jet, which itself generates
less obstruents, can lead to the vowel being voiceless. he same noise, but the air jet is also directed at the incisors, which act as a
factors apply to glides (approximants) that are high, close like bale and cause the generation of more high-frequency noise as
[j], [w], and [] and account for the frequent devoicing and frica- the air hits the teeth surface (this is why s sounds are impaired
tivization that gives rise through sound change to cases like the in the speech of juveniles when they lose their primary teeth and

dialectal alternations in English as Tuesday [thjuzdi] ~ [th uzdi], before the growth of their permanent teeth). Additionally the

and lieutenant [lwtnnt] ~ [lftnnt] and truck [t k]~[th k]
h small space between the tongue apex and the lips constitutes a
(and similar patterns in many other languages). he same factors resonator that reinforces high frequencies.
frequently lead to the africation of stops before high close vow- he existence and properties of a resonator downstream of
els or glides as in Japanese, for example, or the sound change that the point where turbulent noise is generated underlies another
converted Benjamin Franklins natural [ntjul] to the modern marked asymmetry in the incidence of stop types. We saw previ-
pronunciation [n t l]. ously that in languages that have both voiced and voiceless stops,
Aerodynamic factors also explain patterns of nasal prosody the voiced velar stop [g] is often missing. Among voiceless stops,
in languages as diverse as Sundanese (spoken in the Indonesian the bilabial [p] is often missing (Sherman 1975), for example, in
archipelago) and Tereno (spoken in the Mato Grosso, Brazil). Arabic and in Aleut (except for loanwords) and in Proto-Celtic.
As shown in Table 3, in these (and other) languages, the pres- Noise generated by the air turbulence at the lips have no down-
ence of a nasal consonant induces nasalization on all vowels and stream resonator to amplify the noise.
619
Table 4.
Kpelle: [w] patterns with velars in nasal assimilation:
Indeinte Deinite
`mi wax
lu `nui fog, mist
l `il dog
we `wei white clay
Notes: Melanesian: m > / __w:

Common Melanesian /limwa/ hand ~ Fijian /linga/ (= phonetic [liwa])
/mala/ ~ /mwala/ ~ /wala/ (name of the Mala Island in different dialects of
the island)
Sources: For Kpelle: Welmers 1962; for Melanesian: Ivens 1931.
Figure 1. A schematic representation of the resonating cavities during the

production of different nasal consonants. The solid line demarcates the continuant) + glide + vowel and at syllable ofset, the reverse.
main pharyngeal-nasal cavity, which is the same for all such nasals. What he English words swamp and tryst, the French words plume
differentiates one nasal from another is the effect produced by the oral [plym] and soir [swa], and the Czech Psov name of a city
resonator, which branches off this main cavity. Even though a labial velar [psf] would thus adhere to this generalization. But there are
consonant has two main constrictions, it is only the rearmost, the velar reasons to be skeptical of the sonority hierarchy. First, there is
constriction, that matters and thus sounds similar to the velar nasal []. no empirical content to the term sonority; it has never been ade-
quately deined. Second, it ignores such very common clusters
as /sp/, /st/, and so on in syllable initial position and /ps/, /ts/,
Virtually all languages employ nasal consonants (Ferguson and so on in syllable inal position. hird, it ignores cross-lan-
1963). However, there are never more place distinctions among guage prohibitions of onset sequences like /tl/, /dl/, /ji/, /wu/,
nasals than there are obstruents, and there are often fewer. he /twu/, and /bji/, that is, sequences that have similar elements.
acoustics of nasals probably account for this. All nasal conso- John Ohala and H. Kawasaki-Fukumori (1997) suggest replac-
nants have in common the pharynx-plus-nasal air space. What ing a) the one-dimensional concept of sonority with a multidi-
diferentiates one nasal consonant from another is the efect of mensional measure, where similarity of sounds is a function of
the oral cavity, which branches of the nasal-pharyngeal cavity acoustic amplitude, formant frequencies and spectral shape in
(see Figure 1). hus, although nasals are highly distinct as a class general, degree of periodicity (whether from fricatives or stop
from non-nasals, they are auditorily very similar to one another. bursts), and even fundamental frequency; and b) the notion of
his also partly accounts for the frequent pattern whereby nasals the ixed hierarchy with a measure of the degree of similarity of
assimilate in place to a following stop, for example, English sounds according to (a). he more two sounds are similar, the
incredible [ikhbl]< in (neg. preix) + credible; Latin qunctus less common would such sequences be found; the greater the
> quntus (where original n = [] > [n] / __t). diference in sounds, the more common. By this criterion, initial
An interesting cross-language pattern is the character of nasal sequences like /sp/and inal sequences like /-kst/ (in English
assimilation to labial velar consonants such as [kp], [ b] and text), and so forth are normal, and initials /tl/, /ji/, /wu/, and so
[w], that is, segments that have equal constrictions in the labial forth are less preferred.
and velar region. he nasal that appears before such segments
is invariably a velar [] not the labial [m], for example, Kpelle Sound Changes
and Melanesian (Table 4). he explanation for this pattern can Of the thousands of regular sound changes that have been
be seen in Figure 1. What matters for the place of articulation of identiied using the comparative method in historical pho-
a nasal consonant is the irst buccal constriction encountered nology, certain ones are recognized as showing independent
from the nasal pharyngeal cavity. In a labial velar, this is the velar cross-language incidence. One such is velar palatalization, k >
constriction; the labial constriction, being beyond that, is acous- t, t, s, /__i (j), (and similar changes involving the voiced velar
tically largely irrelevant (Ohala and Ohala 1993). /g/), for example, English cheese [thiz]] from Latin caseus (cf.
Dutch kaas); Ikalanga [ti-ledu] chin < Proto-Bantu *ki-dedu.
Phonotactics Traditionally, the causes of sound change were attributed to two
he conventional view of common cross-language sound opposite tendencies: speakers striving for ease of articulation,
sequencing or phonotactics is couched in terms of whats which would lead to assimilations and reductions, and speak-
called the sonority hierarchy (attributed to E. Sievers and O. ers striving to speak more clearly, which would lead to exag-
Jespersen), whereby the favored pattern at syllable onset shows geration of articulation and augmentation of pronunciation.
sounds sequenced in the following order (where omissions here is no doubt that speakers do alter their pronunciation
are possible): stop + fricative + nasal + liquid (i.e., non-nasal in these manners, but it may be seriously questioned whether
620
Phonology, Universals of Phrase Structure
these changed forms replace previous norms of pronunciation. S. Eliasson and E. H. Jahr, 34365. Trends in Linguistics: Studies and
here is no evidence for this. here is, however, an alternative Monographs, Vol. 100. Berlin: Mouton de Gruyter.
scenario of sound change that does have empirical support: lis- Ohala, J. J., and J. Lorentz. 1977. he story of [w]: An exercise in the
teners errors. here have been numerous speech-perception phonetic explanation for sound patterns. Berkeley Linguistics Society,
Proceedings, Annual Meeting 3: 57799.
experiments, some involving natural speech, which revealed
Ohala, J. J., and M. Ohala. 1993. he phonetics of nasal phonol-
errors that mirrored sound change; for example, in a study in
ogy: heorems and data. In Nasals, Nasalization, and the Velum, ed.
Winitz Scheib, and Reeds (1972) where listeners heard a frag- M. K. Hufman and R. A. Krakow, 22549. San Diego, CA: Academic
ment of consonant/vowel (CV) syllables, [khi] was misidentiied Press.
as [thi] 47 percent of the time, paralleling the change in place Pertz, D. L., and T. G. Bever. 1975. Sensitivity to phonological universals
found in velar palatalization. Ohala (1981) has elaborated a in children and adolescents. Language 39: 34770.
theory of sound change based on listeners misperception or Robins, R. H. 1957. Vowel nasality in Sundanese. In Studies in Linguistic
misparsing of the speech signal. Such common sound change Analysis, 87103. Oxford: Blackwell.
includes VN > V , for example, Sanskrit dant tooth > Hindi / Sherman, D. 1975. Stop and fricative systems: A discussion of paradig-
dt/, Latin bon- good > French /b/, and the assimilation of matic gaps and the question of language sampling. Stanford Working
place in C1C2 consonant clusters, Latin scriptu > Italian scritto, Papers in Language Universals 17: 131.
Welmers, W. E. 1962. he phonology of Kpelle. Journal of African
English congress [khagrs] < (ultimately) Latin com- together
Languages 1: 6993.
+ gradi to walk.
Winitz, H., M. E. Scheib, and J. A. Reeds. 1972. Identiication of stops
and vowels for the burst portion of /p,t,k/ isolated from conversation
Phonological Universals and Universal Grammar
speech. Journal of the Acoustical Society of America 51.4: 130917.
It has also been proposed that phonological universals arise from
humans genetic endowment in the form of whats called univer-
sal grammar (Pertz and Bever 1975). Such claims have been dis- PHRASE STRUCTURE
puted by those who ind phonological universals rooted in the
It is an ancient observation that natural language syntax is hier-
physical and physiological attributes of all human speakers and
archically organized. As can be seen from a variety of diagnostics,
hearers. he dust has not settled on this issue as yet.
the words comprising a sentence do not behave as beads on
John Ohala a string but group into successively larger units, or constituents.
Phrase structure (PS) is a formal representation of this constit-
WORKS CITED AND SUGGESTIONS FOR FURTHER READING uent structure. PS is typically depicted as a tree-structured
Bell, H. 1971. he phonology of Nobiin Nubian. African Language graph (Figure 1), which encodes three sorts of structural informa-
Review 9: 11559. tion: i) dominance, specifying the words and constituents that a
Bendor-Samuel, J. T. 1960. Some problems of segmentation in the pho- constituent contains within it (e.g., as shown by vertical placement
nological analysis of Tereno. Word 16: 34855. in the igure, prepositional phrase (PP) dominates on and televi-
. 1966. Some prosodic features in Terena. In In Memory of J. R. sion); ii) precedence, specifying the temporal orderings among the
Firth, ed. C. E. Bazell, J. C. Catford, M. A. K. Halliday, and R. H. Robins, words and constituents (e.g., as shown by horizontal position, the
309. London: Longmans. constituent most fans precedes the constituent watched the game
Ferguson. C. A. 1963. Some assumptions about nasals. In Universals of on television); and iii) labeling, specifying the grammatical cate-
Language, ed. J. H. Greenberg, 427. Cambridge, MA: MIT Press.
gory of each word and constituent (e.g., the constituent the game
Greenberg, J. H. 1969. Some methods of dynamic comparison in linguis-
is a noun phrase (NP)). In PS-based approaches, this structural
tics. In Substance and Structure of Language, ed. J. Puhvel, 147203.
information plays an important role in deining the conditions
Berkeley and Los Angeles: University of California Press.
. 1970. Some generalizations concerning glottalic consonants, under which grammatical dependencies may obtain (see agree-
especially implosives. International Journal of American Linguistics ment, anaphora, binding, and case), and PS is often taken to
36: 12345. be the input to transformational operations (see movement and
Greenberg, J. H, C. A. Ferguson, and E. A. Moravcsik, eds. 1978. Universals transformational grammar). Further, PS representations
of Human Language. Vol. 2. Phonology. Stanford, CA: Stanford serve as the interface between syntax and semantics, as they pro-
University Press. vide the structural information necessary for interpretation (see
Ivens, W. G. 1931. A grammar of the language of KwaraAe, North Mala, compositionality, thematic roles, and logical form).
Solomon Islands. Bulletin of the School of Oriental Studies 6: 679700. A fundamental question concerns how the range of possible
Maddieson, I. 1984. Patterns of Sounds. Cambridge: Cambridge University
PS is speciied in a grammar. he earliest answer comes from
Press.
Noam Chomsky (1957), who suggests that PS is generated by a
Ohala, J. J. 1981. he listener as a source of sound change. In Papers
set of phrase structure rules, like the following:
from the Parasession on Language and Behavior, ed. C. S. Masek,
R. A. Hendrick, and M. F. Miller, 178203. Chicago: Chicago Linguistics 1. S NP VP
Society.
. 1983. he origin of sound patterns in vocal tract constraints.
2. NP N
In he production of Speech, ed. P. F. MacNeilage, 189216. New 3. NP Det N
York: Springer-Verlag. 4. VP V NP
Ohala, J. J., and H. Kawasaki-Fukumori. 1997. Alternatives to the sonor-
ity hierarchy for explaining segmental sequential constraints. In 5. VP VP PP
Language And Its Ecology: Essays In Memory Of Einar Haugen, ed. 6. PP P NP
621
Pidgins
(Baker and Muhlhausler 1990). Pidgins used widely for trading

purposes (but not limited to such functions) include Russenorsk,
Chinese Pidgin English, and Chinook Jargon, once widely used in
the American Northwest. Other pidgins arose where large num-
bers of slaves and/or indentured laborers had to work together
on colonial plantations. Such pidgins were usually short-lived,
evolving into creole languages; contrary to some claims, careful
examination reveals manifold signs of their pidgin ancestry, and
the earliest attestations of some (Baker and Corne 1982; van den
Berg 2000) show pidgin-like structures. Although the reality of this
pidgin-to-creole cycle has been denied (see entry on creoles),
massive empirical evidence exists in Hawaii, as described by S. J.
Roberts (1995, 1998); there is also evidence of prior pidginization
in other creoles, such as fossilized sequence markers and marked
Figure 1. Phrase structure representation for Most fans watched the game vocabulary mixture (see the following examples).
on television. As compared with natural languages (including creoles), all
pidgins are severely impoverished, with sharply reduced vocab-
ularies, few structural consistencies, and few if any inlectional
In these rules, a symbol appearing to the left of the arrow can be
aixes; complex sentences very seldom occur. Function words
rewritten as the sequence of symbols to the right of the arrow. he
are rare, if not completely absent; categories normally expressed
process of PS generation begins with a distinguished start sym-
via auxiliary verbs of tense, mood and aspect are indicated, if at
bol S and successively rewrites the symbols in the string using
all, by two adverbial forms meaning roughly soon or inish
the rules of the grammar until no rewritable symbols remain.
that are attached, not adjacent to the verb as in natural lan-
An example of this process follows, with the number above each
guages generally, but clause-inally or clause-initially. We ind,
arrow indicating the rewriting rule used:
1 3 5 4 3
for example, baimbai (English by-and-by) and pau (Hawaian
S NP VP Det N VP Det N VP PP Det N V NP PP inished), baimbai and pinis in Tokpisin; also a number of sim-
6 2
ilar pidgin fossils in creoles (sometimes inside, sometimes still
Det N V Det N PP Det N V Det N P NP Det N V Det N P N outside the verb phrase), such as in(i), inish in French-related
creoles, done in English-related creoles, or (ka)ba (Portuguese
he PS in Figure 1 can be understood as a history of this acabar inish) in Portuguese-related creoles.
derivation: he children of a node correspond to the sequence If a pidgin persists in a relatively stable population (one not
of symbols into which that node is rewritten. Some recent subject to the rapid expansion and turnover that typically char-
approaches have maintained rewriting as part of the grammar acterize creole societies) and is widely used over a long enough
but have questioned the nature of the rewrite rules employed period, it may acquire a more stable (although still limited)
in this system, generalizing and modifying them in a variety structure. However, pidgins still sufer from widespread misun-
of respects (see x-bar theory and minimalism ). Other derstanding of the linguistic mechanisms through which they
approaches have abandoned rewriting, taking well-formed arise. According to many writers (e.g., Bakker 1995; Manessy
PS representations to be those that best satisfy a set of gram- 1995) they are reduced or simpliied versions of preexisting
matical constraints (see head-driven phrase structure languages, or failed attempts by speakers with inadequate access
grammar, lexical-functional grammar , and opti- to acquire the locally dominant language a view reinforced by
mality theory ). standard usage of expressions such as Pidgin English, Pidgin
French, and so on.
Robert Frank Pidgins do not derive from processes applied to any preexist-
ing natural language, however, but (as is clear both from histori-
WORKS CITED AND SUGGESTIONS FOR FURTHER READING. cal data in Hawaii and reminiscences of older residents; see, e.g.,
Baltin, Mark, and Anthony Kroch, eds. 1989. Alternative Conceptions of Bickerton 1981, 11) arise naturally from strategies employed by
Phrase Structure. Chicago: University of Chicago Press. individuals of any ethnic background in a multilingual situation
Chomsky, Noam. 1957. Syntactic Structures. he Hague: Mouton. where no single existing language is both viable and accessible.
Speakers seek to communicate by any means possible, using iso-
PIDGINS lated words from their own language, from their interlocutors
language (if they know any), and from any third or fourth lan-
Pidgins are the worlds only non-native languages. hey are guage that they may happen to have picked up.
typically acquired by adults, after the critical period for language hese words are seldom assembled in the way words are
acquisition has passed. hey normally arise wherever suicient assembled in modern human languages, that is, hierarchically.
speakers of mutually incomprehensible languages must inter- Except for occasional rote-learned phrases, words are attached
act with one another. Some pidgins arose through and for trade; sequentially, like beads on a string. Consequently, no true gram-
the most plausible derivation ofered for the origin of the name matical relations exist, limiting utterances to brief strings of a few
pidgin attributes it to the Chinese pronunciation of business words without embedding.
622
Pidgins Pitch
he degree to which pidgins (and subsequent creoles) show Bakker, P. 1995. Pidgins. In Pidgins and Creoles: An Introduction, ed. J.
lexical mixture has been underestimated in the literature. Arends, P. Muysken, and N. Smith, 2539. Amsterdam: Benjamins.
Noteworthy are Russenorsk (with roughly equal quantities of Bickerton, D. 1981. Roots of Language. Ann Arbor, MI: Karoma.
Norwegian and Russian words, but also 14% of its vocabulary Broch, I., and E. H. Jahr. 1984. Russenorsk: A new look at the
Russo-Norwegian pidgin in northern Norway. In Scandinavian
drawn from other languages; Broch and Jahr 1984) and Chinook
Language Contacts, ed. P. S. Ureland and I. Clarkson, 2164.
Jargon (only 41% from Chinook, with at least 11 other languages,
European and non-European, contributing to the remainder; Gibbs, G. 1863. A Dictionary of the Chinook Trade Jargon or Trade
Gibbs 1863). he baragouin that preceded the formation of the Language of Oregon. New York: Gramoisy.
Lesser Antillean French Creoles showed a similar mixture (Wylie Manessy, G. 1995. Croles, Pidgins, Varits Vehiculaires. Paris: CNRS
1995). Editions.
Evidence from creoles suggests that the pidgins they evolved Price, R. 1976. he Guiana Maroons: A Historical and Bibliographical
from had equally mixed vocabularies. Berbice Dutch draws 27 Introduction. Baltimore: Johns Hopkins University Press.
percent of its vocabulary from one African language, Ijaw (Smith, Reinecke, J. E. 1969. Language and Dialect in Hawaii. Honolulu: University
Robertson, and Williamson 1987). Saramaccan may have as of Hawaii Press.
many as 50 percent African words (Price 1976). Comparison of Roberts, S. J. 1995. Pidgin Hawaiian: A sociohistorical study. Journal of
Pidgin and Creole Languages 10: 156.
Saramaccan and Sranan vocabularies shows that these creoles,
. 1998. he role of difusion in the genesis of Hawaiian Creole.
both derived from the same pidgin, difer in perhaps as many as
Language 74: 139.
75 percent of their vocabulary items; contra most sources, rela- . 2005. he Emergence of Hawaii Creole English in the Early 20th
tively few of these diferences involve a Portuguese/English con- Century: he Sociohistorical Context of Creole Genesis. Ph.D. diss.,
trast, strongly suggesting an antecedent macaronic pidgin that Stanford University.
drew on English, Dutch, and Portuguese, as well as a variety of Smith, N., I. Robertson, and K. Williamson. 1987. he Ijaw element in
African and Amerindian languages, and from which Sranan and Berbice Dutch. Language in Society 16: 4990.
Saramaccan each made a diferent selection. van den Berg, Margot. 2000, Mi no sal tron tongo: Early Sranan in Court
Why pidgins have so often been regarded as simpliica- Records, 16671767. Unpublished Masters thesis, Radboud University
tions of particular (almost invariably, European) languages is Nijmegen.
Wiley, J. 1995. he origin of Lesser Antillean French Creole: Some lit-
revealed by the massive database of contemporary citations
erary and lexical evidence. Journal of Pidgin and Creole Languages
gathered by Roberts (summarized in Roberts 1995, 1998. 2005,
10: 71126.
but not yet published in its entirety). From this data, it is clear
that pidgin descriptions have been shaped by observer bias.
Most citations from English-language sources contain a pre- PITCH
ponderance of English words, showing why J. E. Reinecke (1969)
and others characterized the lingua franca of early Hawaii as a When an object vibrates, its movement produces changes in
predominantly English pidgin with a sprinkling of Hawaiian air pressure that radiate like waves from the source. If the fre-
words. However, the abundant Hawaiian-language sources quencies of the vibrations are roughly between 20 and 20,000
reverse this picture, presenting a predominantly Hawaiian cycles per second, or Hertz (Hz), ideally they can be heard by a
vocabulary with a sprinkling of English words, while the much young, healthy human listener. he range of hearing frequen-
sparser Japanese-, Chinese-, and Portuguese-language sources cies declines with age. he physical characteristic of the vibrat-
each contain a higher admixture of their own languages (brief ing body, frequency, produces a psychological experience called
sentences containing words from three diferent languages are pitch. In general, a low frequency produces the sensation of a low
by no means uncommon). Clearly, in a pidgin situation, observ- pitch (for example, the 60 Hz hum produced by electrical power
ers record what they best understand and downplay or ignore in a poorly grounded radio), with the pitch increasing as the fre-
the rest. quency increases (a male voice at 100 Hz, a female voice at 200
he well-attested existence of a pidgin phase in the life cycle Hz, a childs voice at 300 Hz). Because there is a close corres-
of creoles also helps to explain the strong structural similari- pondence between frequency and pitch, people frequently use
ties that hold between creoles of widely diferent provenance. the terms interchangeably. However, in addition to frequency,
For such similarities to arise, input had irst to be reduced to an the sensation of pitch is inluenced by an interaction between
abnormally low level of structure, forcing children to draw on the amplitude of the vibration and the range of the frequency.
their innate language faculty for the systematic structures that a Pitch is also inluenced by the complexity of the vibration and its
pidgin can manage without, but that are essential for any natural corresponding wave form.
language. A vibrating body oscillates as a single entity, producing a fre-
quency referred to as the fundamental frequency. So, when the
Derek Bickerton key for A above middle C is played on a piano, a string vibrates
at 440 Hz. Vibrating bodies are not perfectly rigid, though, and
WORKS CITED AND SUGGESTIONS FOR FURTHER READING the string also vibrates in parts as if it is two strings (producing
a frequency of 880 Hz), and three strings (1320 Hz), etc. hus, a
Baker, P., and C. Corne. 1982. Isle de France Creole. Ann Arbor,
MI: Karoma. vibrating body produces a series of frequencies beginning with
Baker, P., and P. Muhlhausler. 1990. From business to pidgin. Journal the fundamental frequency (f0) and including its harmonics,
of Asian Paciic Communication 1: 87115. which are multiples of the f0. he distribution of acoustic energy
623
Pitch Poetic Form, Universals of
across the harmonic series contributes to the quality or timbre In sum, the perception of pitch can play linguistic and para-
of the sound. In addition to sound quality, the harmonics con- linguistic roles at the suprasegmental and segmental levels of
tribute signiicantly to the perception of pitch. he fundamental utterances. Pitch is closely related to the physical stimulus fre-
frequency of a harmonic series can be artiicially removed with- quency, but as a psychological event, it is inluenced by the com-
out changing the pitch, a demonstration referred to as the miss- plexity, frequency range, and loudness of the tone. Pitch can be
ing fundamental. In speech, the harmonic series is a function of processed in a low-resolution mode at many levels in the ner-
the complex wave produced by the glottal source. Formants are vous system or at high-resolution mode in specialized areas of
bands of resonance that concentrate the acoustic energy pro- the cerebral cortex of the brain in the right temporal lobe. Pitch
duced by the glottal source as a function of the vocal tract conig- can also be processed in linguistic and nonlinguistic modes by
uration and have center frequencies that relect the vocal tract, the left and right temporal lobes of the brain, respectively. he
rather than the harmonic series. variation of pitch during luent speech can be considered a truly
Pitch can be experienced from pure tones (f0 alone), which integrative process that conveys both linguistic and paralinguis-
do not occur naturally, as well as from complex tones (f0 + har- tic information.
monic series), but the pitch of complex tones is a stronger per-
John Sidtis
cept, allowing iner discriminations of f0 frequency diferences.
he processing of pitch from a determination of the f0 versus
the pattern recognition of a harmonic series relies on difer-
ent neurological systems. Simple frequency determination can Crystal, David. 1987. he Cambridge Encyclopedia of Language. New
occur at multiple levels of the nervous system, but complex pitch York: Cambridge University Press. Presents multiple brief discussions
processing occurs in auditory areas in the right cerebral hemi- of pitch in diferent linguistic roles.
Sidtis, John J. 1980. On the nature of the cortical function underlying right
sphere that complement speech and language areas in the left
hemisphere auditory perception. Neuropsychologia 18.3: 32130.
cerebral hemisphere (Sidtis 1980; see also right hemisphere
Sidtis, John J., and D. Van Lancker Sidtis. 2003. A neurobehav-
language processing and left hemisphere language ioral approach to dysprosody. Seminars in Speech and Language
processing). 24.2: 93105. Describes how diferent aspects of prosody rely on dif-
For practical purposes, pitch in language can be viewed as a ferent brain structures.
direct function of f0. Unlike the pitch distinctions made in music, Van Lancker, Diana, and V. A. Fromkin. 1973. Hemispheric specializa-
linguistic pitch distinctions are comparatively coarse. Whereas tion for pitch and tone: Evidence from hai. Journal of Phonetics
a musical octave can be divided into 12 semitones, and vibrato 1: 1019.
in experienced singers can be consistently less than a semitone,
most linguistic communicative situations only require distinc-
POETIC FORM, UNIVERSALS OF
tions of three semitones or more. Further, the pitch distinctions
in language are relative, allowing men, women, and children to A poetic universal is manifested by a feature that is found very
make the same linguistic and paralinguistic distinctions despite widely (for example, rhyme; see rhyme and assonance) or by
diferent vocal f0s (see paralanguage), whereas pitch distinc- a relation between features that is found very widely (for exam-
tions in music reference speciic frequencies (e.g., a musical ple, rhyme is generally found in verse, not in prose). Like linguis-
scale tuned to 440 Hz). tic universals, poetic universals might be studied by comparative
A number of linguistic and paralinguistic phenomena are work (often depending on fairly salient features) or by focused
provided by pitch. At the suprasegmental level, pitch produces work on the abstract forms (hypothesized abstract universals)
the melodic line of an utterance to convey linguistic intona- underlying the surface poetic forms in a particular language.
tion (e.g., declination efect: falling pitch anticipating the end A theory of universals can be formulated in terms of universal
of a statement, rising pitch indicating a question), sociolin- parameters (sets of related formal options) from which a speciic
guistic information (e.g., uptalk, rising pitch at the end of a poetic tradition makes speciic choices. Unlike a language, where
statement, falling pitch as a cue for turn-taking), and paralin- only one choice can be made, a literary tradition can divide into
guistic information (e.g., emotion, attitude). Pitch can also be subtraditions, each making a diferent choice (thus, for example,
used with loudness to provide syllable accent at the segmen- classical Sanskrit literature includes quite diferent kinds of met-
tal level. rical verse). here is no presupposition that diferent modalities
Pitch also has a lexical role in tone and pitch accent lan- will throw up signiicantly diferent universals; thus, the gen-
guages. Tone languages may have many tone patterns (esti- eral assumption is that written, oral, and signed literatures will
mates vary, but the numbers are fewer than the 12 notes in the have similar characteristics (see oral composition and sign
musical octave), and they tend to fall in relative categories like languages). he term literature is here used interchangeably
high, medium, and low, further distinguished by rising and fall- with verbal art and should not be used to imply a special status
ing patterns. Because such distinctions are relative, the listener is for written literature.
required to perform a tone normalization to identify a speakers With the exception of folklore studies (which, however, tend
lexical tones. Just as simple and complex pitch perception rely to have a narrow areal range), no discipline or subdiscipline
on diferent brain mechanisms, the processing of pitch for lin- takes as its responsibility the investigation of poetic universals.
guistic and nonlinguistic purposes engage diferent neurological Some researchers are actively hostile to universals in favor of
systems, principally the temporal lobes in the left and right an alternative emphasis on the special characteristics of each
cerebral hemispheres (Van Lancker and Fromkin 1973). tradition, and some ieldworkers ignore verbal arts when they
624
Poetic Form, Universals of
gather information about a language; some missionary linguists his generalization also draws attention to the fact that literary
have even been known to displace indigenous verbal arts with practices can have features in common because of imitation
hymns or Bible stories, with the odd result that it is these, rather of an admired writer or foreign tradition. Finally, generaliza-
than indigenous texts, that are gathered in grammars and other tion 5 is not true of all ways of writing verse, but we might ask
reports. hus, there has been relatively little work, either descrip- whether the wide acceptance of this practice tells us some-
tive or theoretical, on poetic universals. thing about the cognitive status of lines (e.g., that we cognize
he universal of poetic form that is most widely manifested, each line as a separate, isolated unit). hese are the kinds of
and may indeed be found everywhere, is the possibility of verse, questions we might ask in exploring the possibility of univer-
as a way of organizing language. A text that is in verse is a text sals of poetic form.
cut into a sequence of lines (= verse lines). A line is a section of Relative to the line, six categories of poetic form might be iden-
text that supports two or more generalizations, and the investiga- tiied, which follow; there may be others (such as the tendency
tion of universals of poetic form is largely an investigation of the to use speciic words at line edges), and the grouping in this list
generalizations formulated in terms of the line. depends on theoretical assumptions and is not simply given to
For example, Miltons poem Paradise Lost is in lines, and us by the data. For each of the categories, we might explore its
so is verse. (his example is chosen [see Fabb 2002] because status as a universal. None of these kinds of form is required in a
in the eighteenth century some critics claimed that it was not verse tradition. Most verse traditions are either metrical (i.e., they
verse.) Here are ive generalizations that are supported by the involve the counting of syllables) or parallelistic, which is itself
line in this poem: 1) here are 10 syllables in each line; 2) the an interesting universal. Either metrical or parallelistic verse can
end of the line coincides with the end of a word; 3) while there also have rhyme and alliteration, though rhythm and word
is a tendency for stressed syllables to be in even-numbered boundary rules are usually found only in metrical verse. here
positions, the line-initial position is often also occupied by a are some verse traditions, such as modern free verse, that do not
stressed syllable; 4) the word of is found with greater than consistently manifest any of these categories of poetic form (but,
expected frequency as the irst word in a line, as seen, for exam- as noted, they may manifest other categories of poetic form, such
ple, in the irst two lines of the poem; and 5) the printed form of as the tendency to use particular words or particular syntactic
the text arrays lines in a vertical sequence. he study of poetic structures at line edges).
universals seeks to establish the distribution of each of these 1. he counting of syllables: In all metrical verse, the
kinds of form (i.e., each of these generalizations) and then to line has a speciic number of syllables (Irish deibhidhe has 7,
understand whether these distributions imply anything about Icelandic drttkvaett has 6) or a deined range of possible num-
poetic universals. bers of syllables (English iambic pentameter is normatively 10
What would the study of poetic universals make of these but permits 911, French alexandrin 1213, Homeric dactylic
generalizations? Generalization 1 can be understood more hexameter 1317, Japanese haiku 35 or 47 in diferent lines,
abstractly as the line contains a speciic number of syllables, etc.). Some theories of meter suggest that units other than syl-
and this is deinitely a universal, in the sense that it is true of lables can also be counted, such as morae (subsyllabic units)
many verse traditions; however, we might also ask whether the or larger groupings of syllables. Across literary traditions, we
fact that there are speciically 10 syllables in the line also consti- ind that not all syllables in the line are counted for metrical
tutes a (more narrowly manifested) universal. Generalization purposes; in particular, when a vowel-inal syllable precedes a
2 is also very widespread; while lineation does not necessarily vowel-initial syllable, many traditions permit or require these
respect phrase or sentence boundaries, word boundaries to count as a single metrical syllable. Various other generaliza-
are usually respected (and this connects with the fact dis- tions can be made about the counting of syllables, which may
cussed later that metrical rules control for word boundaries be the source of universals (this is the basic claim of Fabb and
but not for phrase or sentence boundaries); hence, there is an Halle 2008).
interesting potentially universal relation between the line that 2. he patterning of syllables, requiring a division of sylla-
is a nonlinguistic section of text and the word that is a linguis- bles into two classes for metrical purposes: Accentual rhythms
tic constituent. Generalization 3 in its speciic formulation is manifest this type of patterning, where syllables are distin-
generally true of English, but more abstractly the possibility of guished into two classes as stressed versus unstressed and pat-
relaxing a rule at the beginning of a line is found widely; for terned on this basis, for example, into triplets where every third
example, Greek verse lines often begin with a syllable whose syllable is stressed. More generally, most kinds of metrical verse
weight is uncontrolled (anceps). Generalization 4 holds true divide syllables into two classes, on the basis of stress, syllable
also of eighteenth- and nineteenth-century verse after Milton; weight, lexical tone, or whether they alliterate and on possibly
is it telling us something signiicant about of or about the natu- other characteristics yet undiscovered); the class membership
ralness of beginning a line with a preposition phrase? On the of a syllable then admits it to speciic positions within the verse
one hand, we might say that post-Miltonic verse is just imitat- line. In some cases, the distribution of the two types of syllable is
ing a kind of form that Milton may have invented, but we might periodic (e.g., a regular recurrence as in an iambic rhythm) and
also note that much twentieth-century free verse also favors in other cases partially periodic or apparently nonperiodic (as
preposition-initial lines. It is worth noting that the Greek early in the supericially aperiodic sequences of heavy and light syl-
elegiac poets tended to begin and end lines mainly with words lables required in Classical Sanskrit verse). An interesting rhyth-
used previously by Homer, suggesting again that choice of par- mic universal is that syllables are divided into just two classes
ticular words at line edges has the potential to be a universal. for metrical purposes, even when there would be a basis in the
625
Poetic Form, Universals of
language for more than two classes. For example, Vietnamese intervening lines) between related elements in diferent lines.
has six types of lexical tone, which are grouped into just two (Bruce Hayes and Margaret MacEachern 1998 discuss universals
tonal classes for the purposes of metrical regulation. It has been in stanza structure.)
claimed that another rhythmic universal is based on the (opti- Are there any kinds of poetic form that are unrelated to the
mality-theoretic) phonological notion of the moraic line? In part, this is a matter of deinition (i.e., of whether we
trochee as a basic rhythmic unit (see, for example, Golston and intend poetic to mean verse). Clearly, various types of igure
Riad 2005). Robbins Burling (1966) claimed that a certain com- and trope are widely found in the worlds literatures, and not only
bination of meter and rhythm is found universally in childrens in verse (though these may be better understood as linguistic or
verse. pragmatic universals, rather than poetic universals; see prag-
3. Word-boundary rules: In the metrical line, two adjacent matics, universals in). And there are possible universals of
syllables can be required to be in separate words that is, a narrative form (see narrative universals) that might also be
word boundary must intervene (by a caesura rule); or they can thought of as poetic, some of which may in fact be related to
be required to be in the same word that is, a word boundary universals of verse form. It is possible that there are universals
must not intervene (by a bridge rule). Thus, for example, the that relate verbal art to counting (perhaps via the aesthetic,
sixth syllable in a French 12-syllable alexandrin must be word- and perhaps extending beyond verbal art). Metricality is based
final. Word-boundary rules are widespread, and this suggests on counting, as are the kinds of form closely related to metrical-
an underlying universal. In particular, the word seems to ity, such as rhythm and word-boundary placement. Parallelism
have a special status in meter: Metrical rules do not control may be based on counting of a diferent kind (a tally or one-to-
for phrase or sentence boundaries, and this also points to a one alignment). Narratives seem to involve counting at various
universal. levels, including Dell Hymess (1992) suggestion that narratives
4. Rhyme understood as the repetition of the end of the are structured around pattern numbers, with narrative units
syllable (usually including its nucleus): Rhyme is very wide- organized in two and four or in three and ive in a particular
spread, including in nonmetrical verse: Parallelistic verse can tradition.
have rhyme, and we may even ind rhyme of a kind in prose. What is the relation between poetic form and linguistic
Rhyme would seem to manifest a universal. Furthermore, it is form? One widely held view (associated, for example, with
cross-linguistically true that sound sequences can be counted Jakobson) is that the forms of poetry in a particular language
as rhyme that are phonetically dissimilar but share underlying are dependent on the linguistic form of that language; the ixing
similarities; this possibility, and perhaps the way in which dis- of a choice from a poetic parameter is thus dependent on the
similar phonetic sequences are admitted as equivalent, may ixing of a choice from a linguistic parameter. hus, for exam-
manifest universals. ple, the claim might be that some languages are better suited to
5. Alliteration: Understood as the repetition of the beginning quantitative meters (where the distinction between heavy and
of the syllable (sometimes including its nucleus), alliteration is light syllables is criterial) and others better suited to accentual
much rarer than rhyme, which may itself tell us something about meters (where the distinction is instead between stressed and
poetic universals. he fact that words beginning with dissimilar unstressed syllables); English, for one, has successful accentual
vowels are considered to alliterate in separate traditions (e.g., verse, but neither nonaccentual syllable counting nor quanti-
Old English and Somali) may suggest a universal. Alliteration tative meters have taken hold in the poetic tradition, despite
also appears to be subject to locality constraints that do not hold attempts to introduce them. Kristin Hanson and Paul Kiparsky
for rhyme; thus, alliteration tends to be line-internal or between (1996) propose a theory of poetic universals that has a param-
adjacent lines and does not interlace as rhyme does in ABAB eter ofering a range of diferently sized phonological units that
structures (Fabb 1999). can match metrical positions; in a speciic tradition, a speciic
6. Parallelism: his formal property is very widespread in the size of phonological unit matches the metrical position. hus,
literatures of the world, and Roman Jakobson (1960) thought of for example, Chinese and Japanese verse both have ive- and
it as a deining formal characteristic of poetry because it draws seven-unit lines, whose positions are illed by syllables in the
attention to form by repeating it (he included meter, rhythm, former and subsyllabic morae in the latter. An alternative posi-
rhyme, and alliteration as types of parallelism). here are dif- tion is taken by Nigel Fabb and Morris Halle (2008), who argue
ferent kinds of parallelism, all quite widely distributed, includ- that poetic form and linguistic form have systemic subcom-
ing parallelism of sound sequences, parallelism of words, and ponents in common, including some parameters (and indeed
parallelism of syntactic structures. Universals have yet to be share subcomponents also with music); however, there is no
established. necessary relation between the poetic form and the linguistic
Lines may be organized into larger units, such as stanzas. he form of a particular language.
possibility of organizing lines into stanzas is suiciently wide- Is verbal art itself a universal? hat is, is there any single way
spread as to count as a universal. Stanzas have characteristics, in which it can be characterized, and distinguished from gen-
such as having a speciic number of lines, lines of the same or eral verbal behavior? he most commonly given answer to this
varying lengths, or rhyme. While there is clearly much variety, question is yes: that verbal art is distinguished from verbal
universals may be discovered, perhaps involving the way lines behavior because it draws attention to its own form. his is the
are counted in a stanza, or the possible ways in which rhyme basis of Jakobsons (1960) projection principle, of Nelsons
patterns can be structured. For example, there may be locality Goodmans (1978) notion of style as exempliied by a text, or
efects, such as limits on the possible distance (e.g., number of of Richard Baumans (1984) notion of verbal art as a text that is
626
Poetic Language, Neurobiology of
fully performed. All of these answers assume that the question In he Origins of Consciousness in the Breakdown of the
is not about the work itself, which cannot categorically be said Bicameral Mind ([1978] 1990), psychologist Julian Jaynes
to be either verbal art (literature) or not verbal art (not litera- claimed that Homers Iliad and the oldest books of the Hebrew
ture); instead, works are more or less verbal art to the extent that Old Testament portrayed human beings in a twilight state of
they carry the distinguishing characteristics of verbal art (or as awareness. According to Jaynes, the characters in the worlds
Goodman would say, works may carry symptoms of verbal art). most ancient poetry take action not as the result of personal
Being verbal art is thus a matter of degree. All of these answers thought and conscious decision but because they hear the voice
also imply that verbal art should be universal and that all users of of a god ordering them to do so. He hypothesized that the voices
language should be able to have a literature because literature is of the gods were actually auditory hallucinations produced in the
just a particular and always-possible way of using language. brains right temporal lobe. Transmitted to the left temporal
lobe, seat of left hemisphere language processing, they
Nigel Fabb
were perceived as coming from outside the self. Pointing to the
metered verse spoken by Greek oracles, the language of Hebrew
prophets, and the god-dictated Vedas of India as evidence of
Bauman, Richard. 1984. Verbal Art as Performance. Prospect Heights, the link between poetry and god-speech, Jaynes asserted that
IL: Waveland. the god-voices spoke in verse. Beginning around 1000 b.c.e., he
Burling, Robbins. 1966. he metrics of childrens verse: A cross-linguis- believed, the discovery and spread of writing brought about a
tic study. American Anthropologist 68: 141841.
breakdown in the functioning of the bicameral mind, although
Edmonson, Murray S. 1971. Lore: An Introduction to the Science of Folklore
the auditory hallucinations of modern-day schizophrenics fur-
and Literature. New York: Holt, Rinehart, and Winston. A survey of the
worlds literatures, with an interest in universals.
nish evidence that contemporary consciousness can revert to its
Fabb, Nigel. 1997. Linguistics and Literature: Language in the Verbal Arts earlier state. While the book was a inalist for the National Book
of the World. Oxford: Blackwell. A survey of linguistic work on litera- Award, his theory has generated controversy.
ture, drawing out potential universals. Homers dactylic hexameter verse line is among those
. 1999. Verse constituency and the locality of alliteration. Lingua surveyed by literary scholar Frederick Turner and psychophysi-
108: 22345. cist Ernst Pppel in their essay, he neural lyre: Poetic meter,
. 2002. Language and Literary Structure: he Linguistic Analysis of the brain, and time (1989). Comparing the metrical verse line
Form in Verse and Narrative. Cambridge: Cambridge University Press. lengths of various language cultures, Turner and Pppel found
Fabb, Nigel, and Morris Halle. 2008. Meter in Poetry: A New heory. that almost all of the lines took two to four seconds to recite,
Cambridge: Cambridge University Press. he irst comprehensive the-
with distribution peaking in the range of 2.5 to 3.5 seconds. he
ory of the worlds meters.
authors suggested that their indings might relect a constant in
Golston, Chris, and Tomas Riad. 2005. he phonology of Greek lyric
meter. Journal of Linguistics 41: 77115.
human neural processing: a human present moment or infor-
Goodman, Nelson. 1978. Ways of Worldmaking. Indianapolis: Hackett. mation bufer averaging about three seconds in length, subject
Hanson, Kristin, and Paul Kiparsky. 1996. A parametric theory of poetic to variation due to cultural factors. Literary critics have targeted
meter. Language 72: 287335. the essays biological reductionism and its underlying politics,
Hayes, Bruce, and Margaret MacEachern. 1998. Quatrain form in English as the authors view free verse as an historical anomaly compat-
folk verse. Language 64: 473507. ible with bureaucratic or even totalitarian modes of cognition. To
Hymes, Dell. 1992 Use all there is to use. In On the Translation of date, their thesis has not been subjected to empirical scientiic
Native American Literatures, ed. B. Swann, 83124. Washington, testing.
DC: Smithsonian Institution Press. Initial data on the neurobiology of poetic language came from
Jakobson, Roman. 1960. Linguistics and poetics. In Style in Language,
studies of subjects who had sustained brain damage or under-
ed. T. Sebeok, 35077. Cambridge, MA: MIT Press.
gone commissurotomy, surgical severing of the corpus cal-
Preminger, A., and T. V. F. Brogan, eds. 1993. he New Princeton
Encyclopedia of Poetry and Poetics. Princeton, NJ: Princeton University
losum. hose indings suggested that comprehension of many
Press. he best source of information about the worlds poetic poetic devices involved right hemisphere language pro-
traditions. cessing, even though the left hemisphere was known to control
language in most persons: Verbal intelligence tests of the isolated
left hemispheres of commissurotomy subjects fell in the normal
POETIC LANGUAGE, NEUROBIOLOGY OF
range, while subjects experienced aphasia after left (but rarely
Not unlike the elephant approached by a delegation of blind right) hemisphere damage. However, over time, tests of right-
men, each of whom investigated a body part seemingly unre- hemisphere-damaged (RHD) subjects revealed subtle linguistic
lated to the others, the neurobiology of poetic language has deicits in comprehending poetic devices such as metaphor or
been approached from such widely varying perspectives that connotation, while other studies showed that the isolated right
the results hardly seem to share tusks and a tail. One can peer hemisphere recognized certain concrete nouns (i.e., images),
into ancient poetry in search of evidence that consciousness has vowel sounds (i.e., assonance), and emotional prosody in spo-
changed over time, or explore cross-cultural poetics for clues ken or written language all important for understanding poetry
to common neural processing mechanisms. One can map the (Kane 2004).
regions of the brain involved in the processing of poetic devices, For example, Ellen Winner and Howard Gardner (1977) had
or pursue the question of how poets and nonpoets may difer in left-hemisphere-damaged (LHD), RHD, and control subjects
their neural functioning. match a spoken expression such as He has a heavy heart to one
627
Poetic Language, Neurobiology of
of four pictures, with the correct response being metaphoric. To to normal subjects, and from behavioral tests to technology-as-
their surprise, RHD patients performed poorly, often selecting sisted observations. In the 1970s and 1980s, RHD subjects per-
the literal match for example, an illustration of someone carry- formed poorly on connotative word meaning tests, while LHD
ing a giant heart. Similar results were obtained from metaphoric subjects experienced problems with denotation (Gardner and
word-matching studies. hen G. Bottini and colleagues (1994) Denes 1973; Brownell, Potter, and Michelow 1984; Drews 1987).
used PET (positron emission tomography; see neuroimag- One might have assumed that the left hemisphere processed
ing) to scan normal brains processing literal and metaphoric denotation and the right, connotation, but over time, a more
sentences; blood low (signaling brain activation) increased in complex picture emerged. Christine Chiarello and others estab-
six regions of the RH when metaphoric but not literal sentences lished, using visual-ield testing, that primary and subordinate
were being processed. he right hemispheres role in controlling word meanings are initially activated in both hemispheres, but
metaphor seemed obvious or was it? that subordinate meanings are quickly suppressed in the left
As advances in technology have made fMRI (functional mag- hemisphere, resulting in a more eicient processing time for the
netic resonance imaging) studies of normal linguistic processing dominant meaning not unlike Gioras graded salience model,
possible, the results have raised as well as answered questions. where the most salient meaning of an expression, metaphoric or
It is now known that conventional or frozen metaphors are not, gets processed irst and faster than a less commonly occur-
processed much like ordinary denotative language, primarily ring meaning (Chiarello and Maxield 1995).
in the left hemisphere, whereas novel metaphors as well as Finally, the neurobiology of poets may play a signiicant role
ironies and the literal meanings of idioms light up additional in the neurobiology of poetic language. Poets are known to sufer
regions of the right hemisphere (Giora et al. 2000; Mashal, Faust, from afective disorders in particular, hypomania and bipolar
and Hendler 2005; Sotillo et al. 2005; Eviatar and Just 2006, Faust illness at rates far exceeding those of the general population
and Mashal 2007). hanks to fMRI, the precise brain regions or other categories of writers (Andreasen 1987; Jamison 1989;
involved in novel metaphoric processing can be pinpointed: the Ludwig 1994; Post 1996). Feeling negative emotion strongly,
right homologue of wernickes area, right and left premotor being introspective, and spending time alone are traits associ-
areas, right and left insula, and brocas area (Mashal, Faust, ated with expressive writing as well as mental dysfunction, and
and Hendler 2005). Of course, novel and not conventional met- mentally ill persons may feel drawn to express their anguish in
aphors are the stuf of poetic language, unless ones deinition writing; James Kaufman and John Baer (2002) propose these
of poetry extends to greeting card verse, and so the role of the and other behavioral explanations for the poetry/afective dis-
right hemisphere remains signiicant. It was at irst assumed that order connection. Taking a neurobiological approach, Felix
the right hemispheres increased involvement in novel meta- Post (1996) suggests that the intensive intellectual and emo-
phoric processing corresponded to visuospatial processing of tional efort involved in writing poetry may trigger overactivation
evoked imagery, whereas conventional metaphors were unlikely of neural networks and, thus, cause mental illness. Julie Kane
to evoke pictures in the mind. However, Rachel Gioras graded (2004) suggests the opposite, that overactivation may precede
salience hypothesis (1997; Giora et al. 2000), which assumes that poetic output: Pointing to substantial evidence that handed-
the most common or salient meaning of an expression is pro- ness and dominance for language can shift temporarily from the
cessed irst, regardless of whether it is literal or metaphoric, and left to right hemisphere during manic episodes, she proposes
that right hemisphere language processing regions get recruited that abnormal mood elevation may activate right-brain regions
only when secondary meanings must be accessed, provides an involved in processing poetic language. Recently, too, Dawn
alternate explanation. Blasko and Victoria Kazmerski (2006) have shown that poets and
Concrete nouns are the building blocks of poetic images, and nonpoets difer in the brain regions that they activate while read-
preliminary studies of commissurotomy patients led by Michael ing poems.
Gazzaniga showed that the isolated right hemisphere was capa- here is a vast amount of territory yet to be covered in explor-
ble of recognizing simple nouns. Subsequent tests of normal sub- ing the elephant of poetic language, complicated by the fact
jects, isolating either the right or left visual ield, suggested that that new research indings often seem to challenge the old. But
the left hemisphere excelled at processing abstract nouns and as neuroimaging techniques become more precise and less inva-
low-imagery nouns, adjectives, and verbs, while the right per- sive, illuminating features that could only be guessed at before,
formed as well as the left in processing high-imagery nouns and one thing becomes increasingly clear: he neurobiology of poetic
adjectives. Once again, neural-imaging studies have revealed a language is not the same animal as the neurobiology of ordinary
more nuanced model than the simple association of left with language.
words and right with pictures (Kiehl et al. 1999). Marcel Just
Julie Kane
and his colleagues (1996) and Jean Franois Demonet, Guillaume
hierry, and Dominique Cardebat (2005) suggest that as cognitive
processing increases in complexity, right-hemispheric regions WORKS CITED AND SUGGESTIONS FOR FURTHER READING
get recruited to handle the additional demand. hat hypothesis Andreasen, Nancy. 1987. Creativity and mental illness: Prevalence
does not necessarily conlict with behavioral data showing the rates in writers and their irst-degree relatives. American Journal of
right hemisphere to be poor at processing abstractions but good Psychiatry 144: 128892.
at processing concrete nouns on its own. Blasko, Dawn, and Victoria Kazmerski. 2006. ERP correlates of individual
Studies of connotation, another essential element of poetic diferences in the comprehension of nonliteral language. Metaphor
language, have followed a similar trajectory from brain-damaged and Symbol 21.4: 26784.
628
Poetic Language, Neurobiology of Poetic Metaphor
Bottini, G., R. Corcoran, R. Sterzi, E. Paulesu, P. Schenone, P. Scarpa, R. Post, Felix. 1996. Verbal creativity, depression and alcoholism: An inves-
Frackowiak, and C. Freith. 1994. he role of the right hemisphere in tigation of one hundred American and British writers. British Journal
the interpretation of igurative aspects of language: A positron emis- of Psychiatry 168: 54555.
sion tomography study. Brain 117: 124153. Sotillo, Maria, Luis Carreti, Jos Hinojosa, Manuel Tapia, Francisco
Brownell, Hiram, Heather Potter, and Diane Michelow. 1984. Sensitivity Mercado, Sara Lpez-Mrtin, and Jacobo Albert. 2005. Neural activ-
to lexical denotation and connotation in brain-damaged patients: A ity associated with metaphor comprehension: Spatial analysis.
double dissociation. Brain and Language 22: 25365. Neuroscience Letters 373: 59.
Chiarello, Christine, and Lisa Maxield. 1995. Initial right hemispheric Turner, Frederick, and Ernst Pppel. 1989. he neural lyre: Poetic meter,
activation of subordinate word meanings is not due to homotopic cal- the brain, and time. In Expansive Poetry: Essays on the New Narrative
losal inhibition. Psychonomic Bulletin and Review 2: 37580. and the New Formalism, ed. Frederick Feinstein, 20954. Santa Cruz,
Demonet, Jean-Franois, Guillaume hierry, and Dominique Cardebat. CA: Story Line.
2005. Renewal of the neurophysiology of language: Functional neu- Winner, Ellen, and Howard Gardner. 1977. he comprehension of met-
roimaging. Physiological Reviews 85: 4995. aphor in brain-damaged patients. Brain 100: 71729.
Drews, Etta. 1987. Qualitatively diferent organizational structures of
lexical knowledge in the left and right hemispheres. Neuropsychologia
25: 41927.
POETIC METAPHOR
Eviatar, Zohar, and Marcel Just. 2006. Brain correlates of discourse pro- Since Aristotles irst articulation of a comparative theory of
cessing: An fMRI investigation of irony and conventional metaphor metaphor, metaphor studies in literary and ordinary language
comprehension. Neuropsychologia 44.12: 234859.
have proceeded without interruption in philosophy, rhetoric,
Faust, Miriam, and Nira Mashal. 2007. The role of the right cere-
linguistics, and literary criticism. Two traditions have emerged
bral hemisphere in processing novel metaphoric expressions
in metaphor theory: conceptual and linguistic traditions. he
taken from poetry: A divided visual field study. Neuropsychologia
45.4: 86070. conceptual view emphasizes metaphors fundamental role in
Gardner, Howard, and Gianfranco Denes. 1973. Connotative judge- everyday thought and language; the linguistic tradition lim-
ments by aphasic patients on a pictorial adaptation of the semantic its the range of metaphor to local pragmatic and aesthetic
diferential. Cortex 9: 18396. functions (Ortony 1993). he range of accounts within both
Giora, Rachel. 1997. Understanding igurative and literal language: he traditions is variegated and well beyond the scope of this entry.
graded salience hypothesis. Cognitive Linguistics 7.1: 183206. Nonetheless, for the language sciences, it seems that the con-
Giora, Rachel, ed. 2007. Is metaphor unique? Neural correlates of nonlit- ceptualist tradition has dominated in recent years. he present
eral language. Brain and Language 100.2 (Special Issue). discussion assumes a conceptual view of metaphor as under-
Giora, Rachel, Ofer Fein, Ann Kronrod, Idit Elnatar, Noa Shuval, and
stood through the frameworks of conceptual metaphor
Adi Zur. 2004. Weapons of mass distraction: Optimal innovation and
theory (CMT) and conceptual blending theory (CBT),
pleasure ratings. Metaphor and Symbol 19: 11541.
Giora, Rachel, Eran Zaidel, Nachum Soroker, Gila Batori, and Asa
where poetic metaphor is regarded as a special case of these
Kasher. 2000. Diferential efects of right- and left-hemisphere dam- underlying conceptual operations. At present, poetic or literary
age on understanding sarcasm and metaphor. Metaphor and Symbol metaphor cannot be easily extracted from the central questions
15: 6383. of metaphor theory in general, namely: What is metaphor, and
Jamison, Kay Redield. 1989. Mood disorders and patterns of creativity what is metaphor for? he present discussion merely touches on
in British writers and artists. Psychiatry 52: 12534. the irst question, in favor of a more elaborate treatment of the
Jaynes, Julian. [1978] 1990. he Origins of Consciousness in the Breakdown second question.
of the Bicameral Mind. 2d ed. Boston: Houghton. he question of what metaphor is and how poetic metaphor
Just, Marcel, Patricia Carpenter, Timothy Keller, William Eddy, and Keith can help language sciences understand the everyday mind and
hulborn. 1996. Brain activation modulated by sentence comprehen-
language is addressed in the irst section, where I compare and
sion. Science 274.5284: 11416.
contrast the two models of metaphor. he question of what met-
Kane, Julie. 2004. Poetry as right-hemispheric language. Journal of
Consciousness Studies 11.5/6: 2159.
aphor is for is addressed in the second section.
Katz, Albert, ed. 2006. Metaphor and Symbol 21.4. Special issue on neural
processing of nonliteral language. What Is Metaphor?
Kaufman, James, and John Baer. 2002. I bask in dreams of sui- CMT purports to unearth the systematic correlations of experi-
cide: Mental illness, poetry, and women. Review of General Psychology ence and meaning. Meaning arises from everyday experience.
6.3: 27186. Abstract notions such as time, causation, states, change, and
Kiehl, Kent, Peter Liddle, Andra Smith, Adrianna Mendrek, Bruce Forster, purposes depend on a rich system of metaphors. Metaphor is
and Robert Hare. 1999. Neural pathways involved in the processing of the name given to the process of conceptual mappings from
concrete and abstract words. Human Brain Mapping 7: 22533. source to target domains (see source and target). he latest
Ludwig, Arnold. 1994. Mental illness and creative activity in women
incarnation of CMT (Lakof and Johnson 1999) builds on Joseph
writers. American Journal of Psychiatry 151: 16506.
Gradys (1997) theory of primary metaphor, in which the onto-
Mashal, Nira, Miriam Faust, and Talma Hendler. 2005. he role of
genetically basic process of domain correlation constitutes the
the right hemisphere in processing nonsalient metaphorical mean-
ings: Application of principal components analysis to fMRI data. experiential basis of conceptual metaphors. A primary metaphor
Neuropsychologia 43.14: 2084100. is a correlation of subjective experience with a more abstract
Mashal, Nira, Miriam Faust, Talma Hendler, and Mark Jung-Beeman. concept. For instance, MORE IS UP is a primary metaphor, based
Processing salient and less-salient meanings of idioms: An fMRI on the tight ontological correlation between the accumulation of
investigation. Cortex. In press. the same entities and vertical height.
629
Poetic Metaphor
CBT, while not a theory of metaphor, accounts for metaphor (8) Who made each mast, and sail and rope;
as a species of conceptual blending that often involves the inte- (9) What anvils rang, what hammers beat;
gration of concepts that do not normally go together. CBT takes
(10) In what a forge and what a heat
a decidedly usage-based perspective to metaphor and other
phenomena, in which systematic correlations arise from con- (11) Were shaped the anchors of thy hope.
ceptual blending itself, the process of constructing new scenes (12) Fear not each sudden sound and shock,
and scenarios with speciic emergent properties from multiple (13) Tis of the wave, and not the rock;
mental models. he aim is to see how metaphors arise on the
(14) Tis but the lapping of a sail,
ly as we think and talk. CMT has as its basic unit of cognitive
structure the conceptual domain. CBT has as its basic unit of (15) And not a rent made by the gale.
organization a mental space, or scenes and scenarios set up as (16) In spite of rock and tempests roar,
we think, talk, and otherwise interact. CBT models the dynamic (17) In spite of false lights on the shore,
unfolding of a language users representations. In this respect,
(18) Sail on, nor fear to breast the sea,
CBT has developed analytic routines and modeling techniques
that capture constitutive principles and governing constraints of (19) Our hearts, our hopes are all with thee;
blending. (See Fauconnier and Turner 2002, 30952.) (20) Our hearts, our hopes, our prayers, our tears,
(21) Our faith triumphant oer our fears
What Is Metaphor For?
(22) Are all with thee, are all with thee.
his question has no straightforward answer, but Samuel Levin
provides an initial approximation by suggesting that these onto- A CMT analysis begins by positing cross-domain mappings
logically bizarre notions are constructed for the purpose of between the source domain of ships and the target domain of
conceiving what a world would have to be like were it in fact to states or nation-states. he conventional mappings between
comprise such states of afairs (1993, 121). Levin suggests that source and target domains include the following correspon-
we construct worlds in which the metaphor is literally true, but dences ofered in Grady, Oakley, and Coulson (1999, 109):
only in order to tease out inferences that guide reasoning about
the real world. Consider the opening line of John Miltons poem, Nation-State = Ship
On Time, in which the poet commands: Leader = Ships captain
Fly envious Time, till thou run out thy race, National policies/actions = Ships course
his conceit depends on the conventional metaphoric mapping National success/improvement = Forward motion of the
TIME IS A MOVER, creating a world in which time is literally an ship
intentional being running a race, the purpose of which is to focus National failures/problems = Sailing mishaps
attention on the theological implications of speeding up the pace Circumstances affecting the = Sea conditions
at which the known world ends. In the poets world, the notion nation
of time as running a race can be considered preternatural, but
the theological implication of the end of days is the great truth All these metaphoric mappings derive from the basic pri-
to be disclosed. In a similar vein, consider now the conventional mary metaphoric couplings of ACTION-AS-SELF-MOTION,
metaphor STATES ARE SHIPS. COURSES-OF-ACTION-AS-PATHS, SOCIAL-RELATIONSHIPS-AS-
he text in question is the sermon, he Negro Element in DEGREES-OF-PHYSICAL-PROXIMITY,andCIRCUMSTANCES-AS-
American Life: An Oration. Delivered by Reverend A. L. DeMond WEATHER. hese experiential correlations (and perhaps others)
on January 1, 1900, this oration illustrates the degree to which interact in a way that motivates the framing of a nation and its his-
a conventional metaphor can be extended and elaborated. he tory as a ship gliding through water.
reverend ends with a poem that makes elaborate use of the Ship- As George Lakof and Mark Turner (1989, 6772) argue, the
of-State metaphor, a potentially disastrous rhetorical maneuver, power of poetic metaphor, in particular, issues from the exten-
given the history of the forced importation of Africans. he ser- sion of these mappings for local expressive purposes. A conven-
mon ends thus: tionalized metaphor never gives you all you need, and poetic
thought is marked by its ability to stretch or extend conventional
As the old ship of State sails out into the ocean of the 20th century,
metaphors. Notice that with lines 67, DeMond extends the typi-
the Negro is on board, and he can say:
cal range of mappings to include ship building and the role
shipwright.
(1) Sail on, O ship of State,
Poems also employ expressions in which the schemas and
(2) Sail on, O Union, strong and great; domains underlying the metaphor can be elaborated in unusual
(3) Humanity, with all its fears, or novel ways. Lines 6 and 7, when understood against the con-
(4) With all the hope of future years text of the whole speech, take on unusual signiicance. he impli-
cation of line 6 is that the master shipwright is God, while the
(5) Is hanging breathless on thy fate.
workman is identiied with the Negro, echoing a consistent
(6) We know what master laid thy keel, theme of the speech the hard labor of the Negro race in build-
(7) What workman wrought thy ribs of steel; ing America.
630
Poetic Metaphor Poetics
A blending analysis helps account for ways in which the been conventionalized for just that purpose because the image
NATION-AS-SHIP metaphor is not a simple and obvious map- potential associated with building, operating, and navigating is
ping between two conceptual domains. While conceptual of rich social activities. In the blended space, however, the choice
domains name large depositories of knowledge about the physi- to sail on is framed as an all-or-nothing proposition. If the ship of
cal and social world, mental spaces comprise on-line scenes and state does not sail, it ceases to exist. In the sailing space, however,
scenarios; they are speciic and sensitive to pressures from local the ship, once built, exists whether or not the crew sails; in the
context. sailing space, a captain and crew can choose when and when not
Levins (1993) account of poetic metaphor is more com- to sail, and the crew can still be referred to as sailors whether on
pletely captured in CBT, a theoretical framework in which pre- land or on sea. In the blend, a refusal to board and sail is tanta-
ternatural scenes are constructed to reveal how to reason and mount to renouncing ones citizenship. By exploiting elements
draw inferences about something else. Conceptual blends are of the shipwright (a collective activity) and by attributing that
often richly counterfactual, but rarely do they exist for their activity to a divine creator, DeMonds version of the ship of state
own ends. In this case, the blended scenario extends and elabo- takes on the voice of a divine decree.
rates the conventional metaphor for local rhetorical purposes. A As suggested, DeMond takes considerable risk in quoting a
basic blending analysis of DeMonds introductory sentence and poem that makes extensive use of this metaphor, for members of
the irst ive lines of the poem proper would include a discourse the congregation may generate a metaphoric mapping in which
ground specifying the participants, the situation, and setting, a the cross-domain counterpart of American Negro is not passen-
mental space for Seafaring, a mental space for Nation, and the ger but cargo, destroying the political legitimacy of the image.
initial blended space for Nation-as-Ship, each of which is set up DeMond, however, assiduously avoids focusing any attention
in the very irst line of the poem. on the circumstances that brought them to America. Instead, he
Let us assume the analysis from the perspective of a worshiper picks up the story at their arrival and tells of the Negro race as
sitting in the Dexter Avenue Baptist Church in 1900. Under these those who built the nation.
conditions, the ground includes the identities of the churchgo- he present analysis presents CMT and CBT as complemen-
ers, the speaker, and the setting. Let us further assume that the tary analytic frameworks, wherein the irst focuses solely on con-
discourse participants are African Americans and that the poetic ventionalized mappings, while the latter is much more interested
persona represents them. Initially, the Seafaring and Nation in how these mappings operate in local rhetorical contexts, and
spaces project conceptual structure into the blend under the thus can point scholars in the direction of a usage-based theory
inluence of the cross-space mappings as speciied here. In the of poetic metaphor.
blend, America is a ship, Negro citizens are among its passen-
Todd Oakley
gers, the ocean is time, and the twentieth century is an unspeci-
ied landmark on open water. he blend allows the audience WORKS CITED AND SUGGESTIONS FOR FURTHER READING
to imagine temporally, causally, and spatially difuse political
events as attaining, for the moment, the look and feel of primary DeMond, A. L. 1900. he Negro element in American life: An oration.
Available online at: http://lcweb2.loc.gov/cgi-bin/query/r?ammem/
experience.
murray:@ield(DOCID+@lit(lcrbmrpt0e10div2).
Once composed, the blend and the network of mental spaces
Fauconnier, Gilles, and Mark Turner. 2002. he Way We hink. New
permit the addition of new information and relations. A notewor- York: Basic Books.
thy contribution of the blending framework here is that it ofers Grady, Joseph. 1997. Foundations of meaning: Primary metaphors and
precise ways of accounting for the elements of the Nation-as- primary scenes. Ph.D. diss., University of California, Berkeley.
Ship image that have no speciic counterparts in the target space Grady, Joseph, Todd Oakley, and Seana Coulson. 1999. Blending and
of nations and politics. Once the network is up and running, metaphor. In Metaphor in Cognitive Linguistics, ed. F. Gibbs and G.
readers can combine concepts luidly. For instance, line 2 com- Steen, 10124. Amsterdam: Benjamins.
mands, Sail on, O Union, strong and great, wherein the poet Lakof, George, and Mark Johnson. 1999. Philosophy in the Flesh. New
fuses elements from diferent mental spaces into tight syntactic York: Basic Books.
units. hus, in the blend it is perfectly natural and logical for a Lakof, George, and Mark Turner. 1989. More han Cool Reason: A Field
Guide to Poetic Metaphor. Chicago: University of Chicago Press.
union to sail. What is more, it is perfectly natural for the ship to
Levin, Samuel. 1993. Language, concepts, and worlds. In Metaphor
plot a straight course. Once the image is created, many other ele-
and hought. 2d ed. Ed. A. Ortony, 11223. Cambridge: Cambridge
ments of ships become mentally accessible. For instance, ships University Press.
must be made of particular materials in order to be seaworthy. Ortony, Andrew. 1993. Metaphor, language, and thought. In Metaphor
he phrase, ribs of steel, in line 6 satisies local formal and con- and hought. 2d ed. Ed. A. Ortony, 116. Cambridge: Cambridge
ceptual imperatives in 1) providing completion for the couplet University Press.
with line 5, and 2) suggesting that the nation is made of sturdy
material and (opportunistically) made from the very material
POETICS
that the Negro worker has been responsible for manufacturing.
Importantly, the mapping between Shipwright and Creator is In ancient Greece, as Aristotle pointed out, there was no com-
responsible for all aspects of the nation. mon name for all the diferent poetic genres (Poetics, 47b),
he goal of using this conventional metaphor is to construct including epic, tragic drama, dialogue, elegy, and poems writ-
a view of social reality for the Negro race, focusing on communal ten in various meters. Poetry in the sense of making or creation
activities and on achieving collective goals. he ship of state has became the general name for literary expressions in diverse
631
Poetics Point of View
forms, and Poetics, the term used by Aristotle for his treatise on Literary Mind and the Carving of Dragons is deservedly famous
tragedy and epic, thus represented the kind of critical and ana- as the most systematic study of the literary art in the Chinese
lytical treatment of poetry that would be called in later times lit- critical tradition. his substantial work of Chinese poetics relates
erary criticism or literary theory. literature to the cosmic tao and the exemplary classics of ancient
Aristotles Poetics ofers an important model in Western liter- sages, thereby elevating literature to a position of high social and
ary criticism, but it was not widely known in Europe in antiquity moral values. Its focus, however, is on the art of literature. he
or in medieval times, and it did not become a classic until the Literary Mind irst formulates some basic principles of the idea of
latter half of the sixteenth century. During the time that it was lost wen or literature, gives a survey of all the literary genres in clas-
in medieval Europe, however, the Poetics, along with some other sical Chinese literature, commenting on their origin and devel-
works by Aristotle, was being studied by Arabic scholars, notably opment, and then presents a highly developed theory of literary
Ibn Rushd, known in the West as Averros. But once it was redis- creation, making contributions to the important issues of the
covered and commented on by such inluential Renaissance crit- relationship between poetry and reality, the style and character-
ics as Lodovico Castelvetro (150571) and Francesco Robertello istics of a literary work, the efect of imagery and poetic imagina-
(151667), the Poetics quickly became one of the most inluential tion, and the regulations of metric composition. Since the eighth
works in Western literary criticism. Epic and tragedy discussed century in Tang China, and particularly the eleventh century in
therein became the two major classical genres before the rise the Song Dynasty, there have been numerous works in a critical
of the modern novel and, after Dante, poets of every European genre known as remarks on poetry (shihua), which often contain
nation tried to create an epic in the vernacular to mark the matu- valuable insights into the nature of poetry, the techniques of the
rity of a modern language and the establishment of a national literary art, and the principles of aesthetic appreciation. Like the
literary tradition. Aristotles philosophical treatment of plot, lan- aforementioned Indian example, the Chinese critical tradition
guage, and rhetoric of the tragic drama provides a model of criti- also ofers an alternative form of poetics outside the Aristotelian
cal analysis, and many basic concepts used in the Poetics, such and European tradition.
as imitation, recognition, the reversal of fortune, tragic hubris, In a broad sense, then, poetics can be understood as a crit-
and the catharsis of pity and fear, have all had a tremendous ical, theoretical, and more or less systematic treatment of poetry
inluence on later criticism. In our own time, Aristotles Poetics or literature in general. In such an expanded usage, what the
remains a major classic and continues to be discussed and com- term signals is a theoretical discourse on a subject in arts or lit-
mented on by important critics and theoreticians from various erature, covering a considerable range of oeuvre, and ofering
perspectives. some philosophical insights into the nature of the subject under
As the aforementioned Arabic commentaries suggest, the discussion. Poetics, therefore, becomes a general term for a sus-
systematic study of the literary art is by no means conined to tained argument or a long essay in literary and art criticism.
the European tradition. here are, for example, well-established
Zhang Longxi
traditions of sophisticated literary criticism or poetics in South
and East Asia. he earliest treatise on dance and dramatic art in
ancient India, Bharatamunis Ntyastra (ca. second cent. b.c.),
ofers a comprehensive discussion of Sanskrit drama in terms of Aristotle. 1987. Poetics with the Tractatus Coislinianus, Reconstruction
taste and emotions (rasa) and of language and bodily gestures of Poetics II, and the Fragments of the On Poets. Trans. Richard Janko.
that give expression to various emotions. In the seventh century, Indianapolis: Hackett.
Averros. 1977. hree Short Commentaries on Aristotles Topics,
Sanskrit poetics was fully established by such important theorists
Rhetoric, and Poetics. Ed. and trans. Charles E. Butterworth.
as Bhmaha and Dandin. In the ninth century, nandavardhana
Albany: State University of New York Press.
made signiicant contributions to its further development with De, S. K. 1988. History of Sanskrit Poetics. Calcutta: Firma KLM Pvt. Ltd.
discussions of the theoretical notions of rasa and dhvani, while Liu, James J. Y. 1975. Chinese heories of Literature. Chicago: University
Abhinavagupta and Kuntaka in the tenth century explored new of Chicago Press.
areas by debating on the issue of indirect and suggestive expres- . 1988. Language-Paradox-Poetics: A Chinese Perspective. Ed.
sions (vakrokti) in poetic language. Indeed, as an Indian scholar Richard John Lynn. Princeton, NJ: Princeton University Press.
remarks, A study of Sanskrit poetics from Bharata (5th century Miner, Earl. 1990. Comparative Poetics: An Intercultural Essay on heories
b.c.) to Panditarja Jaganntha (17th century a.d.) will bear wit- of Literature. Princeton, NJ: Princeton University Press.
ness to the existence of a highly developed poetics in ancient Pathak, R. S. 1998. Comparative Poetics. New Delhi: Creative Books.
India, with a rigorous scientiic method for description and anal- Rajendran, C. 2001. Studies in Comparative Poetics. Delhi: New Bharatiya
Book Co.
ysis of literature (Pathak 1998, 345).
In China, the Great Preface to the Mao edition of the Book
of Poetry (second century b.c.) articulated the Confucian ideas
POINT OF VIEW
about poetry and its functions, and laid the foundation of a poet-
ics that both acknowledges the release of emotions as the origin In narrative studies (see narratology), this term, also per-
of poetry and the eicacy of moral teaching as its ultimate justii- spective or focalization, refers to textual strategies that provide
cation. Lu Ji (261303), with his Rhyme-Prose on Literature added the reader with the illusion of seeing things through the eyes
to the critical tradition a more focused attention on the impor- of a character. hese strategies are mostly linguistic in nature,
tance of emotions (qing), and he argued for the necessity to learn ranging from deictic positioning in the characters mental here
both from nature and from the ancients. Liu Xies (465?520?) and now (see deixis) to lexical choices linking up with the
632
Point of View
characters worldview and ways of thinking and perceiving the Friedman in looking at types of narrative. Boris Uspensky ([1973]
world. Point of view, from a linguistic perspective, is therefore an 1983), too, extends the meaning of point of view under his term
important aspect of linguistic pragmatics. perspective to include a) vision (spatio-temporal perspective); b)
language (phraseological perspective); c) knowledge and feelings
Origins and History (psychological perspective); and d) ideology. Although these
he strategies of point of view narration are of fairly recent date. four types of perspective are all determinable from the language
hey came into existence as part of the shift toward increasingly of the text (the spatio-temporal perspective through deictics; the
subjective literary narratives near the end of the nineteenth cen- ideological through tell-tale phrases like tovarish [comrade]
tury, and document authors attempts to portray characters for fellow man; the psychological through the syntax and
individuality not merely in the rendering of idiosyncratic dia- lexis of emotion), it is the phraseological level of perspective that
logue (for instance, in dialect) but also in the extensive depic- is most linguistic in its deployment of register and style to
tion of characters minds or consciousness. Already in the 1790s, signal narrators or characters perspectives, for instance, in the
the Gothic novel igured the female protagonists anxious medi- citing of dialect words, hints at pronunciation typical of certain
tations, and Ann Radclife and Charles Maturin also portrayed social groups, or the contrast between high and low register in
the impact that the contemplation of sublime scenery had on heteroglossic texts (Bakhtin 1981; see dialogism and het-
their heroines. Gothic novels are, therefore, important anticipa- eroglossia). For instance, Uspensky cites Tolstoys sentence
tions of the point of view technique which, in English literature, Anna Pavlova had been coughing for the last few days: she had
came into its own in the work of George Eliot, Elizabeth Gaskell, an attack of la grippe, as she said (1983, 33) as an example of
homas Hardy, Joseph Conrad, and Henry James and in the phraseological point of view, where la grippe registers Anna
stream of consciousness novels of literary modernism (James Pavlovas class and social snobbery. Psychological perspec-
Joyce, Virginia Woolf, Katherine Mansield, D. H. Lawrence, tive can be exempliied by a sentence from Toni Morrison: He
Aldous Huxley, E. M. Forster). Since then, the point of view tech- examined the bushes, the branches, the ground for a berry, a nut,
nique has been standard in ictional narrative, especially so in anything (1977, 255; emphasis added). he sentence traces the
the short story, even though postmodernist texts of the radical order of Milkmans order of perception and the urgency of his
experimental sort do not employ it as often. quest for food. Uspensky devotes a whole chapter to the inter-
he term point of view (interchangeably with center of vision) relation of the four types of perspective in texts.
was irst used by Percy Lubbock in he Craft of Fiction (1921), Grard Genettes reconceptualization of point of view as focal-
although Henry James in his prefaces had already analyzed the ization (zero; external; internal) abides by the visual metaphor,
phenomenon under the heading of center of consciousness with focalization opposed to voice (who sees? vs. who speaks?
and the image of the house of iction having many windows [1972] 1980, 186). Genettes typology of focalization is one of
(James [18801/1908] 1975, 7). Point of view in James refers to limited perspective either no limitation of point of view (zero
the presentation of the story from the perspective, hence point of focalization) or limitation to a view on characters from outside
view, of a character, for instance, Strether in he Ambassadors: It (external focalization) or a subjective view from inside (internal
afected Strether: horrors were so little supericially at least in focalization). he narrator and narrative voice are excluded from
this robust and reasoning image. But he was none the less there the discussion, in contrast to Friedmans or Stanzels analyses.
to be veracious (James [1903] 1994, 99). However, even in James, More recent models of focalization are discussed by Manfred
the point of view technique, in the meaning of limited perspec- Jahn (2005), who has himself proposed the distinction among
tive (seeing the world through the naive, obsessed, or puzzled strict, ambient, weak, and zero focalization based on an optical
perspective of a character), is extended from the new narrative analogy.
form of the (third person) stream of consciousness novel (follow-
ing the protagonists associations in the depiction of their con- Linguistic Signals of Point of View
sciousness) to experiments with unreliable or otherwise limited he textual inscription of point of view depends on the insertion
irst person narrators, as in Jamess he Real hing or Daisy of signals of subjectivity and individual knowledge, opinion, or
Miller. In these texts, the narrator is very naive, and has a clearly worldview in the text such that they can be aligned with a char-
reduced intellectual capacity. For Lubbock (1921), Jean Pouillon acter. he same signals can also be employed to relate the sub-
(1946), and Norman Friedman (1955), the term, by contrast, jectivity or individual stance of the speaker/narrator of a text/
comprises not one technique of focusing the narrative through a utterance, and this alignment is usually discussed under the
central characters mind but a variety of three (Pouillon) to eight heading of voice and not point of view. Voice and point of view
(Friedman) alternative points of view that authors can choose. can get into conlict or overlap as in free indirect discourse, a tech-
Not only was point of view a vague term because it included nique for rendering speech or thought in which the language of
so many diferent aspects of narrative; it, moreover, was very the reported speaker/thinker (his/her point of view) is to some
limiting since it focused on the visual metaphor. As a conse- extent preserved in the report: She had never, ever told ibs, not
quence, the term point of view, though still used as a general for worlds. (Here, the syntax and vocabulary of the reported
label, became displaced in narratology by more inclusive or more speaker are integrated into the report.)
speciic terms: narrative situation, perspective, and focalization. Free indirect discourse (thought representation) is one of the
Franz Karl Stanzels ([1979] 1984) three narrative situations most common signals of point of view in literary texts since it
([1] authorial roughly: omniscient; [2] irst person; and [3] introduces a characters perspective (feelings, intentions, world-
igural the presentation through a characters mind) follows view) to the reader. Moreover, the narrative can be studded with
633
Point of View
stylistic and lexical markers relating to the characters social models of speech that are employed to create an illusion of
position, age, gender, and so on. For instance, when in Charles authenticity. Moreover, the attribution of expressivity markers
Dickenss Our Mutual Friend Mrs. Veneering remarks that to the primary frame speaker (narrator) or reported speaker
these social mysteries make one afraid of leaving Baby ([18645] (character) is frequently problematic. he mere presence of
1952, 414; emphasis added), the word Baby relates to the moth- expressivity markers does not convey a clear point of view;
erchild relationship of the reported speaker and represents her point of view needs to be constructed interpretatively by the
point of view. At the same time, the phrase these mysteries and listener or reader in the overall context of the utterance or text.
the pronoun one underline Mrs. Veneerings upper-class status. hus, though point of view can be fruitfully analyzed by linguis-
Addressee-oriented expressions like forms of address (Maam, tic means, it cannot be exhaustively described within a purely
Sir, Your Excellency, etc.) also invoke social position by linguistic formal framework. Point of view, therefore, is a pragmatic phe-
means (cf. Fillmore 1983, 1997). nomenon located on the threshold between narrative pragmat-
Most basically, deictics serve the function of positioning ics and literary narratology (see also literary character
speakers and, hence, creating point of view. For example, in and character types).
Bleak House Mr. Bucket (still grave) inquires if to-morrow morn-
Monika Fludernik
ing, now, would suit ([18523] 1962, 720; emphasis added),
in which the futurity of to-morrow relates to Mr. Buckets WORKS CITED AND SUGGESTED FURTHER READING
moment of utterance. Among linguists, Charles Fillmores work
on deixis (1983, 1997) needs to be credited with incisive insights Bakhtin, Michael M. 1981. he Dialogic Imagination: Four Essays. Ed.
Michael Holquist. Austin: University of Texas Press.
into the generation of point of view by means of deixis. From a
Bhler, Karl. 1934. Sprachtheorie. Die Darstellungsfunktionen der Sprache.
linguistic perspective, these signals are expressivity markers,
Jena: Gustav Fischer.
implying a speaking or thinking consciousness, a deictic center Dickens, Charles. [18523] 1962. Bleak House. London: Oxford University
(Bhlers [1934] origo) from which the world is being viewed. In Press.
the widest possible sense, such expressivity markers are indica- . [18645] 1952. Our Mutual Friend. London: Oxford University
tive of ideation and emotion, the latter capable of being textually Press.
suggested by syntactic means, such as intensifying repetitions Fillmore, Charles. 1983. How to know whether youre coming or going.
besides merely lexical intensiiers and emphatic vocabulary. In Essays on Deixis, ed. Gisa Rauh, 21927. Tbingen: Narr.
Evaluative point of view can be illustrated in sentences like Do . 1997. Lectures on Deixis. Stanford, CA: Center for the Study of
talk to the poor dear. Incomplete sentences (indicating hesi- Language and Information.
tation or derangement), sentence modiiers (in any case, sure Fludernik, Monika. 1993. he Fictions of Language and the Languages
of Fiction: he Linguistic Representation of Speech and Consciousness.
enough), clause-initial adjuncts (oh, well), interjections (good
London: Routledge.
grief), negative inversion (Never will he forget) or left and right
Friedman, Norman. 1955. Point of view in iction: he Development of a
dislocation are among the most common strategies used (cp. Critical Concept. PMLA 70: 116084.
Fludernik 1993, 22779). In oral discourse, moreover, expressiv- Genette, Grard. [1972] 1980. Narrative Discourse: An Essay in Method.
ity shows up in intonation and the echoing of idiosyncratic Trans. Jane E. Lewin. Ithaca, NY: Cornell University Press.
pronunciation (imitated in writing: she sho was happy). Halliday, M. A. K. 1971. Linguistic function and literary style: An inquiry
In medieval literature, point of view is often signaled by inter- into William Goldings he Inheritors. In Literary Style: A Symposium,
jections like alas or by means of repetition. Such signals of point ed. Seymour Chatman, 33065. London: Oxford University Press.
of view occur intermittently in medieval literature and early Harweg, Roland. 1968. Pronomina und Textkonstitution. Munich: Wilhelm
modern English prose but do not constitute a continuous repre- Fink.
sentation of a characters perspective as in the Gothic novel and Jahn, Manfred. 1999. More aspects of focalization: Reinements and
applications. In Recent Trends in Narratological Research, ed. John
the later stream of consciousness novel.
Pier, 21, 85110. Tours: Groupes de Recherches Anglo-Amricaines de
Like the study of narrative discourse markers, the focus on
Tours, University of Tours.
expressivity signals can help to emphasize the speciically narra- . 2005. Focalization. In Routledge Encyclopedia of Narrative
tive uses of point of view for the linguist. Point of view markers not heory, ed. David Herman, Manfred Jahn, and Marie-Laure Ryan,
only establish free indirect discourse; they are, moreover, crucial 1737. London: Routledge.
to text beginnings, where they help distinguish between narra- James, Henry. [18801/1908] 1975. he Portrait of a Lady. Ed. Robert D.
tives with a prominent speaker (= narrator) function and others Bamberg. New York: W. W. Norton.
in which the reader is eased into the story by means of a protago- . [1903] 1994. he Ambassadors. Ed. S. P. Rosenbaum . New
nists perspective. (Roland Harweg [1968] has contrasted these York: Norton.
as emic and etic text beginnings, respectively.) Peculiarities of . [1934] 1953. he Art of the Novel. Intro. R. P. Blackmur. New
York: Scribner.
thought and worldview are also constitutive of M. A. K. Hallidays
Lubbock, Percy. 1921. he Craft of Fiction. London: Jonathan Cape.
mind-style (1971) as the distinctive linguistic representation of
Morrison, Toni. 1977. Song of Solomon. New York: Signet.
individual self (Shen 2005, 312). Ultimately, an analysis of point
Pouillon, Jean. 1946. Temps et roman. Paris: Gallimard.
of view as expressivity links up with the linguistic enquiry into Shen, Dan. 2005. Mind-style. In Routledge Encyclopedia of Narrative
individual style. heory, ed. David Herman, Manfred Jahn, and Marie-Laure Ryan,
It should be noted that all of these signals of expressivity are 31112. London: Routledge.
clichs and cannot directly claim mimetic relevance (Fludernik Stanzel, Franz Karl. [1979] 1984. A heory of Narrative.
1993, 43464). On the contrary, they depend on typical recurrent Cambridge: Cambridge University Press.
634
Politeness
Uspensky, Boris. [1973] 1983. A Poetics of Composition: he Structure of the Politeness as Conversational Maxims
Artistic Text and Typology of a Compositional Form. Trans. Valentina A diferent approach understands politeness as a set of social
Zavarin and Susan Wittig. Berkeley: University of California Press.
conventions coordinate with Paul Grices (1975) coopera-
tive principle for maximally eicient information transmis-
POLITENESS sion (Make your contribution such as required by the purposes
of the conversation at the moment), with its four maxims of
Politeness is essentially a matter of taking into account the feel- quality, quantity, relevance, and manner (see conversational
ings of others as to how they should be interactionally treated, implicature). Robin Lakof (1973) argued that three rules
including behaving in a way that demonstrates appropriate of rapport underlie choices of linguistic expression, rules that
concern for interactors social status and their social relation- can account for how speakers deviate from directly expressing
ship. In this broad sense of speech oriented to an interactors meanings. Choice among the three pragmatic rules gives rise
social persona or face, politeness is ubiquitous in language use. to three distinct communicative styles: Rule 1, Dont impose,
Since taking account of peoples feelings generally involves produces a distant style; Rule 2, Give options, gives rise to a
saying things in a less straightforward or more elaborate man- deferent style; and Rule 3, Be friendly, results in a style of cama-
ner than when one is not considering such feelings, ways of raderie. Geofrey Leechs (1983) proposal is in the same vein.
being polite provide a major source of indirectness, reasons for Complementary to Grices cooperative principle, Leech postu-
not saying exactly what one means, in how people frame their lated a politeness principle, Minimize the expression of impolite
utterances. beliefs, with six maxims of tact, generosity, approbation, mod-
here are many folk notions for these kinds of attention to esty, agreement, and sympathy. As with Grices maxims, devia-
feelings, captured in terms like courtesy, tact, deference, sensibil- tions from what is expected give rise to inferences. Cross-cultural
ity, poise, rapport, and urbanity, as well as terms for the contrast- diferences derive from the diferent importance attached to par-
ing behaviors rudeness, gaucheness, social gafes and their ticular maxims.
consequences, embarrassment or humiliation. Such terms attest he conversational maxim view shares with the social norm
both to the pervasiveness of notions of politeness and to their view the emphasis on codiied social rules for minimizing friction
cultural framing. between interactors and the idea that deviations from expected
Peoples face is invested in their social status and in their levels or forms of politeness carry a message.
relationships with one another, and so indexing this relationship
appropriately is necessary for maintaining face expectations. In Politeness as Face Management
addition, one often has interactional goals that potentially con- A more sociological perspective places face work at the core of
travene face, and the expression of such communicative inten- politeness. Erving Gofman (1967) considered politeness as
tions (e.g., requests, ofers, disagreements, complaints) tends to an aspect of interpersonal rituals, central to public order. He
be mitigated by attention to face. deined face as an individuals publicly manifest self-esteem and
Politeness is crucial to the construction and maintenance proposed that social members have two kinds of face require-
of social relationships; indeed, it is probably a precondition for ments: positive face, or the want of approval from others, and
human cooperation in general. Politeness phenomena have, negative face, or the want not to ofend others. Attention to these
therefore, attracted interest in a wide range of social sciences, face requirements is a matter of orientation to Gofmans dip-
particularly linguistics, anthropology, psychology, sociology, lomatic iction of the virtual ofense, or worst possible reading
and communication. Work in these disparate ields can be char- (1971, 138 f), the working assumption that face is always poten-
acterized in terms of three main classes of theoretical approach. tially at risk, so that any interactional act with a social-relational
dimension is inherently face threatening and needs to be modi-
Politeness as Social Rules ied by appropriate forms of politeness. Deference (attention
To the layperson, politeness is a concept designating proper owed to the others face) can be distinguished from demeanor
social conduct, rules for speech and behavior stemming generally (attention owed to oneself).
from high-status individuals or groups (cf. standardization). Building on Gricean and Gofmanian approaches, Penelope
hese notions range from polite formulae like please and thank Brown and Stephen C. Levinson ([1978] 1987) introduced a
you, codiied forms of greetings and farewells, honoriic address comparative perspective by drawing attention to the detailed
forms, and so on, to more elaborate routines, for example, for parallels in the construction of polite utterances across widely
table manners or the protocol for formal events. Politeness in difering languages and cultures, arguing that universal prin-
this view is conventionally attached to certain linguistic forms ciples underlie the construction of polite utterances. he par-
and formulaic expressions, which may be very diferent in difer- allels they noted are of two sorts: how the polite expression of
ent languages and cultures. utterances is modiied in relation to social characteristics of
Some analytical approaches to politeness are formulated in the interloculors and the situation, and how polite utterances
terms of the same sorts of culture-speciic rules for doing what is are linguistically formulated. At least three social factors are
socially acceptable, for example, the work by Sachiko Ide (1989) involved in deciding how to be polite: 1) One tends to be more
and others on Japanese politeness as social indexing or discern- polite to social superiors; 2) one tends to be more polite to people
ment. In these approaches, politeness inheres in particular lin- one doesnt know. In the irst case, politeness tends to be asym-
guistic forms when used appropriately as markers of pregiven metrical (the superior is less polite to an inferior); in the second,
social categories. politeness tends to be symmetrically exchanged. In addition,
635
Politeness Politics of Language
3) in any culture there are norms and values afecting the degree Goody, Esther. 1995. Social Intelligence and Interaction.
of imposition or unwelcomeness of an utterance, and one tends Cambridge: Cambridge University Press.
to be more polite for more serious impositions. he linguistic Grice, H. Paul. 1975. Logic and conversation. In Syntax and
structures for conveying particular kinds of politeness are also Semantics. Vol. 3: Speech Acts. Ed. P. Cole and J. Morgan, 4158. New
underlyingly similar across languages, with the politeness of
Hickey, Leo, and Miranda Stewart, eds. 2005. Politeness in Europe.
solidarity (positive politeness) characterized by expressions of
Clevedon, UK: Multilingual Matters.
interest in the addressee, exaggerated expressions of approval, Ide, Sachiko. 1989. Formal forms and discernment: Two neglected
use of in-group identity markers and address forms, seeking aspects of linguistic politeness. Multilingua 8.2/3: 22338.
of agreement, and avoidance of disagreement, whereas avoid- Lakof, Robin. 1973. he logic of politeness or minding your ps and qs.
ance-based politeness (negative politeness) is characterized In Papers from the Ninth Regional Meeting of the Chicago Linguistic
by self-efacement, formality, restraint, deference, hedges, and Society, 292305.
impersonalizing mechanisms like nominalization or passive Leech, Geofrey. 1983. Principles of Pragmatics. London: Longman.
constructions. Placencia, Mara E., ed. 2006. Research on Politeness in the Spanish-
To explain these kinds of detailed parallels across languages Speaking World. Mahwah, NJ: Erlbaum.
and cultures in the minutiae of linguistic expression in socially Watts, Richard J. 2003. Politeness. Cambridge: Cambridge University
Press.
analogous contexts, Brown and Levinson proposed an abstract
Watts, Richard J., Sachiko Ide, and Konrad Ehlich, eds. 1992. Politeness in
model of politeness as strategic attention to face, deriving strat-
Language. Berlin: Mouton de Gruyter.
egies for constructing polite utterances in diferent contexts on
the basis of assessments of three social factors: the relative power
(P) of speaker and addressee, their social distance (D), and the POLITICS OF LANGUAGE
intrinsic ranking (R) of the face-threateningness of an imposition. Politics of language is not a domain or subdiscipline. It is an idea
In contrast with rule-based approaches, Brown and Levinson that puts the study of language in a perspective: the idea that
argued that politeness inheres not in words or sentences per se; language is a politically invested object and that people, con-
politeness is an implicature that may be conveyed by utterances sequently, act politically in, through, and on language. In that
spoken in context, by virtue of successful communication of a sense, the term covers an enormous range of phenomena and
polite attitude or intention. cuts across numerous disciplinarily organized practices. he
Politeness continues to be a major focus for research in many issue is one of function, and the politics of language suggests
disciplines concerned with social interaction, and the topic that political meanings and efects are among the functions of
now has its own professional journal, the Journal of Politeness language. In fact, some would emphasize that there are no non-
Research. Over the past 30 years, empirical descriptions of par- political meanings.
ticular politeness phenomena from many diferent parts of the Such political functions are metapragmatic: hey operate
world have accumulated, with the research emphasis largely through meta-discourses on language, on things people say
on cross-cultural diferences. here has been much theoretical about language in language. hus, the politics of language is a
controversy over whether, indeed, there are any universal prin- language-ideological phenomenon. Clear instances of it are
ciples of politeness and if so, what form they take. he recent widespread utterances such as English is the language of busi-
trend seems to be toward emphasizing emic rather than etic ness or Xhosa is a language for community interaction. In
approaches (cf. Watts 2003; Eelen 2005). But the importance of both instances, a particular language is deined as a language
politeness goes far beyond the ps and qs of appropriate behav- that operates with a speciic load, a speciic set of social, cul-
ior and speech in a particular cultural setting. Its wider signii- tural, economic political attributes, all of them implicitly
cance is in the interactional, communicative, day-to-day basis of articulated: Whenever I use English, my language use will be
social life and the conduct of social relationships. Recent devel- framed as business, and I will speak like a businessman. he
opments in the theory of social interaction that take account politics of language has to do with the way in which we associ-
of our common human nature (e.g., Goody 1995; Enield and ate particular varieties of language (forms) with particular nor-
Levinson 2005; see also universal pragmatics) ofer hope mative complexes, genres and topical domains, and identities
for theoretical progress in this ield. (functions). he relationship between forms and functions, thus
Penelope Brown deined and seen as relatively stable (stable enough to generate
shared meanings), is usually deined as ideology, and authors
WORKS CITED AND SUGGESTIONS FOR FURTHER READING explicitly addressing the politics of language often focus on ide-
ology, hegemony, and ideological naturalization.
Brown, Penelope, and Stephen C. Levinson. [1978] 1987. Politeness: Some
In what follows, I irst give a brief overview of some key notions
Universals in Language Use. Cambridge: Cambridge University Press.
Eelen, Gino. 2005. A Critique of Politeness heories. Manchester: St.
and authors, then engage in a brief survey of some recent work
Jerome. and focus on language ideologies as a frame for understand-
Enield, Nick, and Stephen C. Levinson, eds. 2005. Roots of Sociality. ing such political functions. I conclude with an appraisal of this
Cambridge: Cambridge University Press. work.
Gofman, Erving. 1967. he nature of deference and demeanor. In
Interaction Ritual, ed. Erving Gofman, 4795. New York: Anchor Key Notions and Authors
Books. Language has been deined as politically invested since Aristotle
. 1971. Relations in Public. New York: Harper Colophon Books. and the Sophists; it is therefore futile to attempt a historical survey.
636
Politics of Language
Rather, I would suggest we read history backwards, starting from Barthes (1957), who emphasizes the discursive routines and the
the current approaches to the politics of language and look- silences that are generated by the consumer-capitalist society. It
ing into those authors who are seen as formative now. From also underlies Foucaults (1984) notion of order of discourse,
that vantage point, two groups of authors stand out: authors and it is relected in Bourdieus (1991) notion of legitimate lan-
who developed a political view of language and authors whose guage. In each case, macrosocial order manifests itself in dis-
political-analytic work provides tools for scholars in the ield of course patterns, structures, both positively and negatively. he
language. he irst category is dominated by such scholars as M. fact that some things can only be said in some ways is an efect of
M. Bakhtin, V. N. Voloshinov, Roland Barthes, Michel Foucault, the social and political order; the fact that some things cannot be
and Pierre Bourdieu (see habitus); in the latter, Karl Marx (see said at all is an efect of the same thing (Blommaert 2005).
marxism) and Antonio Gramsci stand out. he fact is, however, that people rarely experience this shap-
his collection of authors and insights, it must be realized, ing of discourses as an efect of social and political forces. Mostly,
can only be discussed in a more or less coherent way when a we perceive these discourse routines and absences as normal,
number of conditions are met. In particular, two presuppositions as just the way things are. It is at this point that we see scholars
are required: refer to the Marxian notion of ideology an agentive notion in
which ideational complexes such as discourses have real mate-
(i) It is clear that relections of this kind are predicated on a
rial efects as well as to the Gramscian notion of hegemony.
view of language as a social object (not a mental object); such
Hegemony is ideological dominance, that is, dominance that
relections belong to the realm of a social theory of language.
is not perceived as dominance but as a neutral, normal state of
(ii) hey also are predicated on a view in which language afairs. Social and political forces operate in language through
displays intricate connections with social structure: Either hegemony, that is, through naturalized, neutralized, and nor-
language mirrors social structure (especially structures of malized perceptions and forms of behavior.
inequality) or it can become an instrument for changing hese authors all provide frequently used key notions and
social structure. insights, all of which revolve around the same central node: that
hese presuppositions ensure that the authors mentioned can language is not a neutral phenomenon but one that bears deep
become interlocutors for current practitioners in the ield, and such traces of social and political structures and processes in society.
practitioners would then be clustered in applied ields, such as he use of language, consequently, is always an activity that has
discourse analysis (both linguistic and foucaultian), social and political dimensions: It can reproduce existing struc-
sociolinguistics, and linguistic anthropology. tures or challenge them, it can empower or disempower people,
he work of Bakhtin and Voloshinov has been inluential in and it can enfranchise and disenfranchise them.
its emphasis on the social and political dimensions of a key fea-
ture of real language: its heteroglossic nature (see dialogism State of Affairs
and heteroglossia). Heteroglossia stands for the presence he political load of language is one of the central concerns
of multiple voices in an act of communication, and such for critical discourse analysis (CDA), an approach to dis-
voices are intricately related to social formations and interests. course analysis that, especially since the 1990s, explicitly focuses
Whenever we communicate, thus, we engage with existing com- on the ways in which discourse relects power and social struc-
plexes of social (and cultural) meaning, we insert ourselves in an ture and constructs it (Fairclough 1989, 1992; Blommaert 2005).
intertextual tradition in which such complexes make sense, It is CDAs stated goal to analyze opaque as well as transparent
and we articulate interests, not only (neutral, self-contained) structural relationships of dominance, discrimination, power
meanings. In addition, the articulation of such interests is not and control as manifested in language (Wodak 1995, 204) a
a unilateral and linear event. Bakhtin (1981) emphasizes the paradigmatic choice that is relected in numerous studies on rac-
importance of evaluative uptake in interaction his dialogical ism, sexism, media, and political discourse and advertisements.
principle in which every act of communication requires rati- In all of these, linguistic and textual patterns are analyzed as con-
ication by the other in order to be valid, that is, in order to be duits for hegemony and power abuse, and CDA has been inlu-
meaningful. his process of ratiication is evaluative: It is done ential in identifying registers and genres of power and control.
from within ordered complexes of forms-and-meanings in CDA clearly subscribes to a view of language as loaded (cf.
which appropriateness, social roles, luency, and other quality Bolinger 1980) and as invested with social and political interests
attributes are speciied. hus, even if I think I produce a cogent that steer discourse into particular, structural (i.e., nonarbitrary)
story, my interlocutor may judge it to be of the mark because patterns of use and abuse. he inluence of Foucault, Gramsci,
what I say and how I say it do not qualify as good enough in his/ Bourdieu, and other critical theorists is explicitly acknowledged
her evaluative framework. And evidently, such evaluative frame- in much CDA work.
works are relections and instruments of the social and political he same paradigmatic choice underlies work in what
order (Voloshinov 1973). could be called critical sociolinguistics: an approach in which
his social and political order penetrates language at a fun- the distribution of language in society is also seen as a relec-
damental level: It shapes discourses. Discourses are complexes tion of power processes, often crystallized in normative (stan-
of communicative forms (genres, styles) mapped onto thematic dards) discourses and invariably entailing judgments of users
and social domains, and what the social and political order does through judgments of language use (e.g., Milroy and Milroy
is to create spaces in which particular discourses operate while it 1985; Cameron 1995; see standardization). Variation in lan-
eliminates other such spaces. his idea is central to the work of guage speaks to variation in society, and such forms of variation
637
Politics of Language
are evaluated given diferent value. Institutionalization, such autonomous but cultural and social, and it displayed coherence
as, for example, in the education system (Rampton 1995) or in with other aspects of social and cultural patterning. In that sense,
bureaucracy (Sarangi and Slembrouck 1996) can stabilize and grammatical form responded to collective patterns that orga-
reify such evaluative patterns and use them as normative, exclu- nized social and cultural behavior, including linguistic behavior.
sive, and excluding instruments of power and control. Sensitive he full richness of Whorfs approach was established by people
social identities, such as gender and immigrant identities, can like Michael Silverstein (1979). Silverstein suggested that we
be especially vulnerable to exclusion or marginalization in such read Whorfs argument as follows: Linguistic form is indexical;
reiied normative structures. it indexes aspects of context through ideological inferences: A
Both CDA and critical sociolinguistics seek an integration of particular form stands for a particular social and cultural
the linguistic or discourse-analytic method with social theory, meaning (also Silverstein 2003). hus, in French, tu and vous
thus reversing the tendency toward autonomy and disciplinary share a great deal of linguistic meaning but are diferentiated by
recognizability that characterized earlier phases in the develop- indexical meanings; tu indexes a low second person singular
ment of these disciplines (e.g., Cameron 1992; Chouliaraki and addressee, while vous indexes a high second person singular
Fairclough 1999). his move is aimed at strengthening the funda- addressee. he one who uses tu or vous would express indexically
mental theoretical assumption: that language and social struc- his/her degree of respect and social distance toward the inter-
ture stand in an intricate relationship to each other and that one locutor, and the interlocutor would attribute conventional iden-
cannot be understood without an understanding of the other. tity features, such as polite, proper, well educated, middle
From another theoretical angle, linguistic anthropology has class, and so on to the one using these forms. hus, we select
signiicantly contributed to the study of the politics of language. linguistic (and wider semiotic) forms in relation to socially and
In contrast to the previous schools, linguistic anthropology has its culturally shared ideas about what would be appropriate, good,
roots in an integrated science of human behavior. he anthropo- useful, and salient communicative behavior in a speciic context,
logical notion of language, consequently, appears easier to inte- and our use of semiotic means creates, supports, and manipu-
grate into a mature social-theoretical framework than notions of lates contexts.
language that have their feet in twentieth-century linguistic tra- his reconstruction of Whorfs foundational insight has signif-
ditions. he fact that language forms and structures need to be icant implications. One efect is that it creates a new, but essen-
seen as relective and constructive of sociocultural and political tially inseparable, layer to language structure: a metapragmatic
realities was central to Edward Sapirs work (1921), and the post layer. Accepting that layer means that the analyst must accept
World War II reemergence of the ethnography of communication that whenever we communicate, we not only communicate in
(Gumperz and Hymes 1972; Gumperz 1982; Hymes 1996) started our communication but also communicate about our commu-
from the assumption that there is no occurrence of language that nication: We always lag socially and culturally shared (ideo-
is not drenched in social, cultural, historical, and political con- logical) indexical meanings while we talk, and these indexicals
texts and that, consequently, can be understood without atten- make others perceive our talk as serious, arrogant, funny,
tion to these contexts (Duranti 1997). It is from within linguistic or knowledgeable. he metapragmatics of language organizes
anthropology that the paradigm of language ideologies devel- its pragmatics its meaning in society. And this, then, means
oped (Schiefelin, Woolard, and Kroskrity 1998; Blommaert 1999; that approaches solely focused on a pragmatics of language risk
Kroskrity 2000; Bauman and Briggs 2003). buying into commonly shared metapragmatic frames; in other
words, a normal linguistics always risks dragging along the
Language Ideologies widespread language ideologies that dominate its object.
Language ideologies are beliefs, ideas, views, and perceptions Another efect is that the range of variability in language is
about language and communication. Such ideational complexes vastly expanded, for the metapragmatic layer also provides an
pertain to every aspect of communication: about linguistic forms enormous potential for social and cultural diferentiation (dis-
and functions, as well as about the wider behavioral frames (often tinction, to borrow Bourdieus term). In a nutshell, we can say
called nonlinguistic) in which they occur. hus, in the ield of that every possible diference in language can become a socially
language ideologies, people are seen to perform meanings, and and culturally salient and important diference and that linguis-
language in the narrow sense of the term is seen as just one mode tic diferences need not be big in order to generate important
of meaning production. People produce semiosis (meaningful social and cultural diferences.
symbolic behavior) as performance, and they do so within a
regimented field in which language ideologies produce stability Evaluation
and recognizability. Seen from that perspective, language ide- he idea that language is a politically invested object and that
ologies are of course not just ideational; they are practical in the people act politically in, through, and on language is by now a
sense of Bourdieu, referring to the Marxian praxis, rather than to well-established theoretical frame, the legitimacy of which no
the Mannheimian or Durkheimian notion of ideology. longer requires debate. One reason is the fact that the diferent
he study of language ideologies emerged out of the Whorian approaches discussed here all have very strong empirical incli-
concern with connections between language form and world nations and that studies documenting the politics of language
view (Hill and Mannheim 1992). To recap Benjamin Whorfs often manage to transcend the slogans of a committed social
basic idea, he argued that grammatical categories encoded science and bring theoretical and methodological innovation to
and thus revealed aspects of collective perceptions of reality; the ield. CDA has done much to sensitize discourse analysts at
as such, grammatical organization was not random, logical, or large about the fact that discourse matters to people because it
638
Politics of Language Possible Worlds Semantics
is invested with power and social capital; critical sociolinguis- Silverstein M. 1979. Language structure and linguistic ideology. In
tics has likewise drawn attention to the fact that sociolinguistic he Elements, ed. P. Clyne, W. Hanks, and C. Hofbauer, 193247.
distribution is not just a horizontal phenomenon but also a ver- Chicago: Chicago Linguistic Society.
tical one: Sociolinguistic diference is complemented by socio- . 2003. Indexical order and the dialectics of sociocultural life.
Language and Communication 23: 193229.
linguistic inequality. And from within linguistic anthropology,
Voloshinov, V. N. 1973. Marxism and the Philosophy of Language.
we have witnessed the emergence of a powerful ethnographic
paradigm that recovers the holistic and rich agenda developed Wodak, R. 1995. Critical linguistics and critical discourse analysis. In
earlier by the likes of Sapir and Whorf and applies these insights Handbook of Pragmatics: Manual, ed. J. Verschueren, J. O. stman,
to an expanding ield of fundamental and applied topics of lan- and J. Blommaert, 20410. Amsterdam: John Benjamins
guage in society. he language-ideological approach appears to
be the most promising one because of its compelling theoret-
ical coherence and empirical applicability, and it would beneit POSSIBLE WORLDS SEMANTICS
adjacent disciplines if the central insight that every pragmat- Possible worlds semantics is a family of semantic theories in
ics of language is accompanied by a metapragmatics would be which the truth conditions of modal concepts and other inten-
adopted. sional locutions are expressed with the help of the concept of
Jan Blommaert possible world (scenario, possible state of afairs, possible course
of events). (See modality, intension and extension.)
WORKS CITED AND SUGGESTIONS FOR FURTHER READING Human beings constantly ind themselves concerned with
what could happen or might have happened. he modal notions
Bakhtin, M. M. 1981. he Dialogic Imagination: Four Essays.
of possibility and necessity are used to cope with such situations.
Austin: University of Texas Press.
Barthes, R. 1957. Mythologies. Paris: Seuil.
Less directly, notions like knowledge, belief, obligation, permis-
Bauman, R., and C. Briggs. 2003. Voices of Modernity: Language Ideologies sion, and so on serve the same purpose. Concepts behaving in
and the Politics of Inequality. Cambridge: Cambridge University Press. essentially the same way as necessity, knowledge, and so on are
Blommaert, J. 2005. Discourse: A Critical Introduction. known as intensional concepts. Modal notions have several vari-
Cambridge: Cambridge University Press. eties, among them logical, conceptual, metaphysical, natural,
Blommaert, J., ed. 1999. Language Ideological Debates. Berlin: Mouton de nomic, and physical modalities. When the diferent possibilities
Gruyter. can be weighted, one can also evoke the concept of probability.
Bolinger, D. 1980. Language: he Loaded Weapon. London: Longman. It is nevertheless only relatively late that philosophers and
Bourdieu, P. 1991. Language and Symbolic Power. Cambridge, logicians came to think that in order to understand modal
UK: Polity.
notions (and other related notions), we have to consider unreal-
Cameron, D. 1992. Feminism and Linguistic heory. London: Macmillan.
ized courses of events or states of afairs and, hence, merely pos-
. 1995. Verbal Hygiene. London: Routledge.
Chouliaraki, L., and N. Fairclough. 1999. Discourse in
sible worlds. Earlier philosophers usually did not think in such
Late Modernity: Rethinking Critical Discourse Analysis. terms. For one thinker, Aristotle, the only reality is the succession
Edinburgh: Edinburgh University Press. of present moments outside of which there are no other possible
Duranti, A. 1997. Linguistic Anthropology. Cambridge: Cambridge courses of events. he idea of many worlds began its develop-
University Press. ment in the Middle Ages, encouraged by the famous condemna-
Fairclough, N. 1989. Language and Power. London: Longman. tion of 1277 of the view that God could not create other worlds.
. 1992. Discourse and Social Change. Cambridge, UK: Polity. he notion of possible world was put to major metaphysical uses
Foucault, M. 1984. he order of discourse. In Language and Politics, ed. by G. W. Leibniz for whom metaphysical truths are truths hold-
M. Shapiro, 10838. London: Blackwell.
ing in all possible worlds.
Gumperz, J. 1982. Discourse Strategies. Cambridge: Cambridge University
In twentieth-century philosophical logic, the notion of possi-
Press
ble world became prominent when modal logic was approached
Gumperz, J., and D. Hymes, eds. 1972. Directions in Sociolinguistics: he
Ethnography of Communication. New York: Holt, Rinehart and from a model-theoretical or semantic point of view. he use of the
Winston notion of possible world in the study of modalities is analogous
Hill, J., and B. Mannheim. 1992. Language and world view. Annual to the measure-theoretical approach to probability theory, with
Review of Anthropology 21: 381406. probability theorists sample-space points playing the same role
Hymes, D. 1996. Ethnography, Linguistics, Narrative Inequality: Toward as logicians possible worlds. One of the pioneers of the seman-
an Understanding of Voice. London: Taylor and Francis. tic study of modalities was Rudolf Carnap (1947), who explicitly
Kroskrity, P., ed. 2000. Regimes of Language. Santa Fe, NM: SAR. acknowledged the inspiration he received from Leibniz. he
Milroy, J., and L. Milroy. 1985. Authority in Language: Investigating early treatments of the logic and semantics of modalities nev-
Language Prescription and Standardisation. London: Routledge.
ertheless relied heavily on syntactical concepts and arguments.
Rampton, B. 1995. Crossing: Language and Ethnicity among Adolescents.
For instance, Carnap represented possible worlds by sets of
London: Longman.
sentences he called state-descriptions. A state-description is a
Sapir, E. 1921. Language: An Introduction to the Study of Speech. Orlando,
FL: Harcourt Brace. complete list of atomic sentences and the negations of atomic
Sarangi, S., and S. Slembrouck. 1996. Language, Bureaucracy and Social sentences that are true in some model.
Control. London: Longman. In such semisyntactical theorizing, interpretational questions
Schiefelin, B., K. Woolard, and P. Kroskrity, eds. 1998. Language were neglected, relatively speaking. Fortunately, this neglect did
Ideologies: Practice and heory. New York: Oxford University Press. not initially matter. For what is the cash value of assuming that
639
Possible Worlds Semantics
possible worlds exist? According to Van Quine, such existence automatically help the identiication of the same individual in
means that we can quantify over them (see quantification). another world. Nor does the rest of the reference system help us
he starting point of possible worlds semantics is the insight that here. here must exist principles deining what counts as a single
many modal and other intensional concepts can be construed as individual across possible worlds. heir totality can be called an
quantiiers over suitable classes of possible worlds. If NS means identiication system.
it is necessary that S, it is true if and only if S is true in all pos- he nature of such identiication has given rise to extensive
sible worlds. It is possible that S, briely PS, is true just in case discussion and controversies. he identiication system codiied
S is true in some possible world. If KaS means a knows that S, in our language is largely independent of the reference system.
it is true if and only if S is true in all the possible worlds not ruled Indeed, there are two diferent kinds of identiication actually
out by what a knows, and so on. hus, the idea of possible worlds used in our conceptual system. An identiication system can
was involved right from the beginning in the development of the be visualized as a kind of map shared by the possible worlds
semantics of modal logic, following the work of Alfred Tarski and between which the identiication is to take place. In the most
his associates. (Cf. Copeland 2002; Kanger 1957; Hintikka 1957a, common cases of identiication, the map can be thought of as a
1957b; Kripke 1959.) he irst to emphasize the role of possible kind of universal registry of the relevant population. For instance,
worlds semantics as the basis of general semantics seems to have if the iles of the Social Security System were to serve as such a
been Richard Montague (cf. Montague 1974). system, I would know who someone is if and only if I knew his or
Even though this is, for most purposes, an adequate expla- her social security number. Such identiication could be called
nation of the meaning of KaS, the characterizations of necessity public. An idea of how the criteria of public identiication could
and possibility need further speciication, namely, an indication work can be obtained by considering how we reidentify objects
of what kind of modality we are dealing with. For instance, not over time. Continuity considerations obviously play a major role,
all logically possible worlds are nomically (physically, naturally) but questions as to how objects behave over time also come into
possible. play.
We thus seem to obtain a semantically interpreted language An individuals position in someones perceptual space or
by adding to a irst-order language the operator or those opera- remembered role in someones past experiences can also serve
tors we are interested in. On the basis of this idea, we can develop as a framework of identiication. he simplest framework of this
much of a viable modal logic, epistemic logic, and so on, as well kind is someones visual space. Such forms of identiication are
as the required methods of proof. called perspectival. Among other expressions of our language,
his procedure is not suicient alone, however. For one thing, demonstratives rely on perspectival identiication. heir oper-
the possible worlds that igure in these explanations are relative ation is illustrated by Bertrand Russells onetime view that the
to the world w in which NS, PS, KaS, and so on are evaluated only logically proper names of English are this, that, and
semantically. hey will be called alternatives to w. To deal with I. he explanation is that Russell tacitly presupposed only per-
iterated or multiple modalities, we have to consider alternatives spectival identiication.
to alternative worlds, and so on. he alternativeness relation he distinction between perspectival and public identiica-
involved here is sometimes called the accessibility relation. tion gains further interest and robustness from the fact that these
his does not yet completely determine the possible worlds two systems are, in the case of visual cognition, implemented
semantics. To see what is missing, consider how the references by diferent parts of the human brain (Vaina 1990; Hintikka and
of linguistic expressions are determined. he guiding principle of Symons 2003). Since quantiiers depend on identiication, they
possible worlds semantics is that the application of a language, acquire a diferent meaning according to the kind of identiica-
including the reference of any expression e, in a given world w tion presupposed.
must depend only on that world. he way in which the reference hese observations open the doors to extensive applica-
of e in w is determined is, therefore, codiied in the function f that tions of logical languages with a possible worlds semantics. For
determines the reference of e as a function f(w, e) only. We could instance, a simple wh-statement Alonzo knows who (call him or
call the totality of these functions the reference system of the her x) is such that F[x] can be expressed by a sentence of the form
language. For instance, the reference of the 44th president of (x)KAlonzoF[x] where x ranges over persons. his shows how to
the United States is whoever wins the 2008 election. formalize simple knows who statements in general. For example,
hese ideas of reference and meaning are, in fact, the corner- (x)KAlonzo(Barbara = x) says that Alonzo knows of some particu-
stone of the version of possible worlds semantics most exten- lar individual x that Barbara is that x. his unmistakably means
sively used in linguistics, known as Montague semantics (see that Alonzo knows who Barbara is. Such statements may be con-
montague grammar). It was developed by Montague (1974) trasted to Ka(x)(b = x), which merely says that a knows that b
and applied in linguistics most vigorously by Barbara Partee exists.
(1976, 1989). his kind of variation of operator ordering K versus (x)
here are further problems in the development of possible cannot do the whole job, at least if we want to stay on the irst-
worlds semantics, however. When we use a quantiier, we con- order level. In order to do so, we have to resort to a recently
sider each of its values as being the same individual in diferent introduced idea of operator independence (Hintikka 2003).
possible worlds. But how can such identities be recognized? Since modalities are characterized by quantiication over pos-
hey cannot be established by examining the diferent possible sible worlds, the same kind of independence can obtain between
worlds in question independently of one another. For instance, modal operators and quantiiers as between quantiiers. his
a name-conferring (dubbing) ceremony in one world does not independence can be expressed by a slash. hus, we can express
640
Possible Worlds Semantics Possible Worlds Semantics and Fiction
(x)Ka(b = x) equivalently as Ka(x/Ka)(b = x), where x/Ka . 2003. A second generation epistemic logic and its general signii-
means that is independent of Ka. (Notice that by so doing, we
cance. In Knowledge Contributors, ed. by V. Hendricks, K. F. Jrgensen,
and S. A. Pedersen, 3356. Dordrecht, the Netherlands: Kluwer
can stay closer to the structure [word order] of the corresponding
Academic.
English knowledge statements.) In more complicated cases, we Hintikka, Jaakko, and John Symons. 2003. Systems of visual identiica-
can, for instance, express a knows which function g(x) is by tion and neuroscience: Lessons from epistemic logic. Philosophy of
Ka(x)(y/Ka)(g(x) = y). Science 70: 89104.
Kanger, Stig. 1957. Provability in Logic. Stockholm Studies in Philosophy.
We cannot stay on the irst-order level here without the inde- Vol. 1. Stockholm: Almqvist and Wicksell.
pendence indicator. Kripke, Saul A. 1959. A completeness theorem in modal logic. Journal
his does not clear up all interpretational problems, however. of Symbolic Logic 24: 114.
We can still ask: What are the relevant possible worlds in difer- Montague, Richard. 1974. Formal Philosophy: Selected Papers by
Richard Montague, ed. by Richmond homason. New Haven, CT: Yale
ent applications? his question is connected with the question as
University Press.
to what kinds of modalities and other intensional notions there
Partee, Barbara H. 1989. Possible worlds in model-theoretic seman-
are.
tics: A linguistic perspective. In Possible Worlds in Humanities, Arts
he characterization of possible worlds as represented by and Sciences: Proceedings of Nobel Symposium, ed. by S. Alln, 93123.
maximal consistent classes of sentences of a given language has Berlin and New York: Walter de Gruyter.
encouraged the idea that what is intended by possible worlds are Partee, Barbara H., ed. 1976. Montague Grammar. New York: Academic
indeed worlds in the sense of entire universes. However, a com- Press.
parison with probability theory shows that such grandiose inter- Vaina, Lucia. 1990. What and Where in the Human Visual System: Two
pretations are neither unavoidable nor even preferable. In most Hierarchies of Visual Modules. Synthese 83: 4991.
applications of probability theory, the possible worlds (sample
space points) are not worlds in any ordinary sense of the word.
hey usually are what might be called scenarios, namely, courses of POSSIBLE WORLDS SEMANTICS AND FICTION
events involving a small region of space-time, for example, tosses he applications of the philosophical concept of possible world
of a die. Some probability theorists speak of small worlds, and to narrative and to iction were irst developed in the late 1970s
practically all realistic applications of possible worlds semantics and early 1980s as a reaction to structuralist poetics, a
are to such small worlds. In some of his work, Montague, in fact, movement that adhered to Ferdinand de Saussures conception
operates with contexts of use, rather than possible worlds. of language as a self-enclosed system of signs. As homas Pavel
here remains the question of diferent modalities. Are they has argued, this theoretical position led to a moratorium on
all viable in the light of possible world semantics? here are no representational topics (1986, 6) and on the notion of reference
major unsolved conceptual problems about epistemic or dox- to a world external to language. In its literary applications, pos-
astic modalities or other similar intensional concepts. he class sible worlds (hence, PW) semantics is an attempt to restore the
of alternative epistemic worlds has a clear meaning or at least relevance of mimesis, reference, and the question of truth with-
as clear a meaning as our language has. Logical (conceptual)

The Cambridge Encyclopedia of Language Science

Enviado por

Dados do documento

Direitos autorais

Formatos disponíveis

Compartilhar este documento

Compartilhar ou incorporar documento

Opções de compartilhamento

Você considera este documento útil?

Este conteúdo é inapropriado?

Direitos autorais:

Formatos disponíveis

The Cambridge Encyclopedia of Language Science

Enviado por

Direitos autorais:

Formatos disponíveis

The Cambridge Encyclopedia of the Language Sciences

THE LANGUAGE SCIENCES

Cambridge University Press

Cambridge University Press 2011

his publication is in copyright. Subject to statutory exception

First published 2011

Printed in the United States of America

Library of Congress Cataloging in Publication data

ISBN 978-0-521-86689-7 Hardback

Cambridge University Press has no responsibility for the persistence or

ADVISORY EDITORIAL BOARD

Florian Coulmas Barbara Lust

William Croft Lee Osterhout

Lyle Jenkins James Pustejovsky

CONSULTING EDITORIAL BOARD

Mark Baker Howard Lasnik

Deborah Cameron Loraine Obler

Nigel Fabb William OGrady

Carol Ann Fowler Susan Pintzuk

Ronald Geluykens Eleanor Rosch

Margaret Harris Jay Rueckl

Zoltn Kvecses Mark Turner

Purua-artha-nyn gun pratiprasava kaivalya

List of Entries page xi

1 Language Structure in Its Human Context: New Directions

2 The Psychology of Linguistic Form 12

3 The Structure of Meaning 23

4 Social Practices of Speech and Writing 35

5 Explaining Language: Neuroscience, Genetics, and Evolution 46

7 Elaborating Speech and Writing: Verbal Art 65

List of Contributors 941

A Biolinguistics Computational Linguistics

Dialect Genes and Language Left Hemisphere Language Processing

Morphology, Neurobiology of Poetic Language, Neurobiology of Rhythm

Synchrony and Diachrony Transformational Grammar Verbal Reasoning

In putting together a volume on science, then, I have tried

WORDS Using the lexical entry cat as an example, imagine a connec-

h is is an example of structural entailment, because it is the

(4) a. he drought killed the crops.

Such lexical entailments involve an inference that is tied directly

(7) AGENT > RECIPIENT/BENEFACTIVE > THEME/PATIENT >

(10) [human animate]: human animate

and has stimulated much research recently (Bouillon 1997;

3 BUILDING SENTENCE MEANINGS

3.2 Quantiiers and Scope (32) a. Every woman sang a song.

(43) a. e[eat(e, m, the-soup)]

b. e[eat(e, m, the-soup) & with(e, a spoon)]

c. e[eat(e, m, the-soup) & with(e, a spoon) & in(e, the

(39) a. Mary arrived. kitchen)]

he meaning of such implicatures is not part of the asserted

THE FIELD OF LANGUAGE ACQUISITION

learning by hanging on to some aspect of the input that it can

THE LANGUAGE SCIENCES

On any of these options, the universal remains exception-

Figure 1. Waveforms, spectra, and

back 8 cm2 1 cm2 front

so its resonances must have velocity minima at both ends. heir

where Lrc is the length of the resonating cavity. he second com-

Place of Segment Proportion of oral Proximity to minimum or maximum

AGREEMENT WORKS CITED AND SUGGESTIONS FOR FURTHER READING

ANAPHORA restrictive conditions, some of these expressions may yet allow a

Figure 1. Several of the key brain regions affected

Treatment for Aphasia

Philip Lieberman BASIC LEVEL CONCEPTS

Japanese: (^ ^) [Congratulations on your comeback

(as if singing) That was good (^ ^)]